Fix publish replication reliability by Bojan131 · Pull Request #4093 · OriginTrail/dkg-engine

Bojan131 · 2026-03-18T10:59:13Z

What

Fixes the "Not replicated to enough nodes!" errors that happen when publishing large knowledge assets or during parallel publishes.

Changes

Added a semaphore (max 3 concurrent) to avoid flooding all shard nodes with messages at once
Replication now batches nodes in groups of minAcks+2 and exits early once minimum replication is met
Each node message is wrapped in try/catch — one failing peer no longer kills the entire operation
Added a single retry on NACK before giving up on a peer
Bumped publish message timeout from 15s to 60s for larger payloads

Why

Under load (parallel publishes with large assets), the node was sending all replication messages simultaneously. If any single peer failed or was slow, the whole publish would fail. This makes the replication process more resilient without changing the minimum replication requirements.

Made with Cursor

- Add semaphore (3 concurrent) to limit parallel replication messages - Batch replication to groups of minAcks+2 with early exit when minimum reached - Wrap individual node messages in try/catch so one failing peer doesn't kill the whole operation - Add single retry on NACK before giving up on a peer - Increase publish message timeout from 15s to 60s for large knowledge assets Made-with: Cursor

Bojan131 requested review from Mihajlo-Pavlovic and branarakic as code owners March 18, 2026 10:59

Bojan131 mentioned this pull request Mar 18, 2026

Publish erros fix #4092

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix publish replication reliability#4093

Fix publish replication reliability#4093
Bojan131 wants to merge 1 commit intov8/developfrom
fix/publish-replication

Bojan131 commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Bojan131 commented Mar 18, 2026

What

Changes

Why

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant