Skip to content

Create an update-in-place process for opensearch sync #5607

@tdlowden

Description

@tdlowden

[NOTE: For individual stories, use the user story template! Only use this template for larger things that will likely break down into a series of smaller stories over time.]

Feature/what we're after

Clearing opensearch and then re-filling isn't a good long term solution, so we need a way to sync harvester changes to opensearch in an "upsert and delete" type of way.

Anticipated/hypothesized benefits

  • opensearch is never cleared (unless manual sync is necessary).
  • opensearch has most recent harvested data (no daily delay)

Measurements/metrics

  • When a harvest is completed, then the changes (insertions, updates, deletions) are synced to open search and is able to be confirmed via search or API.

References/background

  • Related to recent needs to re-sync and having the counts go to 0 (and search being limited)

Sketch

This could be done on a dataset individual level or in a batch at the end of the harvest job, whichever is easier. Batch would probably be faster, but individual may be easier to code/implement with the different cases of addition, update, and delete.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    Status

    🌈 Catalog UI Project

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions