Skip to content

[relay] Phase 5 — observability: Grafana dashboards, failure alerts, end-to-end lag #18

Description

@Dipraise1

Context

The relay exposes relay_client.status() (dlq_pending, dlq_abandoned, total_ok, last_ok_hash) and logs submissions to SQLite, but nothing is visualized or alerting.

Tasks

  • Grafana dashboard: relay latency, submission success rate, payload volume
  • Alert on 3+ consecutive relay failures
  • Track XERIS acknowledgement vs relay submission timestamp (end-to-end lag)

Notes

Validator runs on the VPS (/opt/engram); a Prometheus exporter or simple pushgateway off the SQLite relay log is probably the lightest path.

From TASKS.md — Relay Phase 5.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions