Add plan for coalescing operations during streaming by tsg · Pull Request #770 · xataio/pgstream

tsg · 2026-03-11T10:14:23Z

Trying something new: this adds only the plan for solving #769, so we can discuss it before implementing.

Putting it in Draft mode because this is only meant for discussion.

kvch · 2026-03-11T18:37:26Z

batch_writer_coalesce_ops_plan.md

+
+## Expected impact
+
+For the customer's workload (9,737 DELETEs in a single batch):


Did we see other columns in the WHERE condition in users' WAL events? In the future we could make it more flexible and batch DELETE statements with different column names. Example:

DELETE FROM t1 WHERE my_column in ($1, $2, $3)

WALs always refer to changes by their identify columns, so we don't have to support complex WHERE conditions.

kvch

This is a good improvement. Hopefully, it will solve most of the issues for us. If not, there are several ways we can improve the process.

kvch · 2026-03-11T18:38:47Z

batch_writer_coalesce_ops_plan.md

+
+Batch raw WAL events instead of pre-built SQL strings, then build bulk SQL at execution time.
+
+- N DELETEs on the same table become: `DELETE FROM t WHERE "id" IN ($1, $2, ..., $N)`


Should we also consider composite primary keys?

Ah, i see that they are included.

kvch · 2026-03-11T18:42:31Z

batch_writer_coalesce_ops_plan.md

+
+1. Separate DDL and DML messages
+2. For DDL: build and execute queries via existing `ddlAdapter`
+3. For DML: walk messages in order, building "runs" of consecutive same-(schema, table, action) events:


Given that we already walk the messages in order, we can make the query aggregator logic a bit smarter, and consider adding interleaved DELETEs, if there is no INSERT or UPDATE statement that conflicts with them.

Yes, we can add this later. In some situation we can prove that it's still correct.

kvch

Solid plan, small comments

kvch · 2026-03-11T18:51:22Z

batch_writer_coalesce_ops_plan.md

+3. For DML: look up `schemaInfo` via schema observer (cached), create `walMessage` with raw `wal.Data` + `schemaInfo`
+4. Send `walMessage` to batch sender
+
+### 4. Refactor BatchWriter.sendBatch — bulk query building


What happens if the sending fails? Do we fall back to separate DELETE statements?

Maybe for simplicity for now we log as DATALOSS, I think this is similar to how we do for batch inserts in the snapshot mode.

kvch · 2026-03-11T19:08:55Z

batch_writer_coalesce_ops_plan.md

+
+In practice this works well for the target workload: WAL events from bulk operations on the source database (batch purges, accounting reconciliation, ETL loads) naturally produce long runs of the same operation on the same table, which coalesce effectively.
+5. Execute via existing `flushQueries` / `execQueries`
+6. Respect PostgreSQL's 65,535 parameter limit — split runs at ~60,000 params


I guess we can avoid this limit by using WHERE id = ANY($1::bigint[]) because the array is just one parameter.

So the code would be:

fmt.Sprintf("DELETE FROM %s WHERE id = ANY($1::%s[])", table, idType) args := []any{idArray}

This would perform better for multiple reasons:

only one parameter is passed to postgresql, so the parameter binding overhead is smaller

in the planning phase the original suggestion takes longer because the planner has to optimize a long-long condition with many OR operators. with the array, there is no such optimization

great suggestion! Seems like we can do this when the types are well known (scalar types) and fallback to IN for the corner cases.

Add plan for coalescing operations during streaming

87f6cf6

kvch reviewed Mar 11, 2026

View reviewed changes

kvch approved these changes Mar 11, 2026

View reviewed changes

kvch reviewed Mar 11, 2026

View reviewed changes

Updated plan

bb26f3e

tsg mentioned this pull request Mar 12, 2026

Batch WAL operations in stream mode #774

Open

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add plan for coalescing operations during streaming#770

Add plan for coalescing operations during streaming#770
tsg wants to merge 2 commits intomainfrom
stream_batch_operations_plan

tsg commented Mar 11, 2026

Uh oh!

kvch Mar 11, 2026

Uh oh!

tsg Mar 12, 2026

Uh oh!

kvch left a comment

Uh oh!

kvch Mar 11, 2026

Uh oh!

kvch Mar 11, 2026

Uh oh!

kvch Mar 11, 2026

Uh oh!

tsg Mar 11, 2026

Uh oh!

kvch left a comment

Uh oh!

kvch Mar 11, 2026

Uh oh!

tsg Mar 12, 2026

Uh oh!

kvch Mar 11, 2026 •

edited

Loading

Uh oh!

tsg Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		## Expected impact

		For the customer's workload (9,737 DELETEs in a single batch):


		Batch raw WAL events instead of pre-built SQL strings, then build bulk SQL at execution time.

		- N DELETEs on the same table become: `DELETE FROM t WHERE "id" IN ($1, $2, ..., $N)`

Conversation

tsg commented Mar 11, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kvch left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kvch left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kvch Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kvch Mar 11, 2026 •

edited

Loading