-
Notifications
You must be signed in to change notification settings - Fork 167
Closed as not planned
Labels
H2.0/Harvest-RunnerHarvest Source Processing for Harvesting 2.0Harvest Source Processing for Harvesting 2.0not-mvpremove-from-queue
Description
User Story
In order to harvest WAF sources effectively and at scale, datagovteam would like to harden the current WAF ETL pipeline.
Acceptance Criteria
[ACs should be clearly demoable/verifiable whenever possible. Try specifying them using BDD.]
- GIVEN [a contextual precondition]
[AND optionally another precondition]
WHEN [a triggering event] happens
THEN [a verifiable outcome]
[AND optionally another verifiable outcome]
Background
[Any helpful contextual notes or links to artifacts/evidence, if needed]
Security Considerations (required)
[Any security concerns that might be implicated in the change. "None" is OK, just be explicit here!]
Sketch
- add record partition logic into harvesting logic repo
- benchmark and report metrics on traversal and download ( how many files vs how long it took ). total processing time.
- get number of WAF harvest sources
- consider implementing download xml inside traversal instead of separate function depending if performance impact is noticeable
Metadata
Metadata
Assignees
Labels
H2.0/Harvest-RunnerHarvest Source Processing for Harvesting 2.0Harvest Source Processing for Harvesting 2.0not-mvpremove-from-queue
Type
Projects
Status
🗄 Closed