Split the subgraph_deployment table into two #6003

lutter · 2025-05-13T23:55:18Z

We split the table subgraphs.subgraph_deployment into two tables, subgraphs.head and subgraphs.deployment where the head table only contains the metadata that changes on every block.

This should help with situations where the subgraph_deployment table gets very bloated since the head table that gets bloated through frequent changes has much smaller rows than the current subgraph_deployment table. Rows in subgraph_deployment can grow as big as 500k, whereas rows in the head table will only take about 350 bytes at most.

Updates will also be marginally better on the heads table since it only has one index rather than the two that subgraph_deployment has.

This change has the downstream effect that there is no more sharded.subgraph_deployment table. Instead there are now sharded.head and sharded.deployment tables. This change will also require that all dashboards or other tools that used to access subgraph_deployment now access the two new tables.

Besides splitting tables, the new tables also use int4 columns throughout for block numbers rather than numeric since the rest of the system is restricted to int4 block numbers everywhere else.

zorancv

Looks good!

zorancv · 2025-05-16T17:34:41Z

store/postgres/src/detail.rs

+            .load::<(Deployment, Head)>(conn)?
+    }
+    .into_iter()
+    .map(DeploymentDetail::from)


This also makes the timeouts used for IPFS requests configurable; the default of 1s in debug builds is too short for the runner tests in CI and we therefore set it to the 60s for release builds for those tests.

We split the table `subgraphs.subgraph_deployment` into two tables, `subgraphs.head` and `subgraphs.deployment` where the `head` table only contains the metadata that changes on every block. This should help with situations where the `subgraph_deployment` table gets very bloated since the `head` table that gets bloated through frequent changes has much smaller rows than the current `subgraph_deployment` table. Rows in `subgraph_deployment` can grow as big as 500k, whereas rows in the `head` table will only take about 350 bytes at most. Updates will also be marginally better on the `heads` table since it only has one index rather than the two that `subgraph_deployment` has.

lutter self-assigned this May 13, 2025

lutter requested a review from zorancv May 13, 2025 23:55

lutter force-pushed the lutter/split-sd branch from 49cd0e1 to c7a2b77 Compare May 14, 2025 00:02

lutter requested a review from encalypto May 14, 2025 00:02

lutter force-pushed the lutter/split-sd branch 2 times, most recently from 8e47a78 to 899b13a Compare May 14, 2025 01:02

zorancv approved these changes May 16, 2025

View reviewed changes

store/postgres/src/detail.rs

.load::<(Deployment, Head)>(conn)?

}

.into_iter()

.map(DeploymentDetail::from)

Copy link

Contributor

zorancv May 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nifty!

lutter added 5 commits May 16, 2025 11:18

store: Streamline how we get DeploymentDetail a little

1a31994

store: Streamline how we get ErrorDetail a little

8aa1bd7

store: Streamline how we get SubgraphManifest a little

e449758

graph: Speed up some tests by using shorter timeouts in debug builds

e1d9876

This also makes the timeouts used for IPFS requests configurable; the default of 1s in debug builds is too short for the runner tests in CI and we therefore set it to the 60s for release builds for those tests.

lutter force-pushed the lutter/split-sd branch from 899b13a to 2301988 Compare May 16, 2025 18:18

lutter merged commit 2301988 into master May 16, 2025

lutter deleted the lutter/split-sd branch May 16, 2025 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Split the subgraph_deployment table into two #6003

Split the subgraph_deployment table into two #6003

Uh oh!

lutter commented May 13, 2025 •

edited

Loading

Uh oh!

zorancv left a comment

Uh oh!

zorancv May 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Split the subgraph_deployment table into two #6003

Split the subgraph_deployment table into two #6003

Uh oh!

Conversation

lutter commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zorancv left a comment

Choose a reason for hiding this comment

Uh oh!

zorancv May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lutter commented May 13, 2025 •

edited

Loading