Skip to content

🙋 Nodes as mirrors for data redundancy #39

@mishaschwartz

Description

@mishaschwartz

Topic category

Select which category your topic relates to:

  • software architecture
  • potential risks
  • federation decisions
  • opportunities for growth
  • other

Topic summary

Should the data hosted on nodes in the network be available from at least two nodes to ensure data availability and redundancy across the network.

This would likely require that:

  1. an exact copy of a data file hosted at node A also be available on one of the other nodes in the network
  2. catalogs should refer to file assets on all nodes where a copy is hosted
  3. access permissions be synchronized between nodes for copies of the same data file
  4. file copies can be easily verified as being identical

Some possible solutions for the above:

  1. we have to decide if all data or just some data should be copied. When adding new data, node administrators can coordinate this.
  2. possible updates to the stac-populator to handle this
  3. could be handled by creating accounts using Magpie's network mode
  4. checksums (possibly stored in the catalog)

To decide:

  • Is this something we want to encourage?
  • If yes, do we want to require copies for all data, most data, some data?

Supporting documentation

Additional information

Metadata

Metadata

Assignees

No one assigned

    Labels

    meeting-topicProposed topic for a future meeting

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions