Skip to content

Comic issue-level ingestion (~5,000 issues with covers) #17

Description

@adhit-r

Skills: TypeScript · scrapers · MediaWiki API · image hosting · data design
Time: ~30+ hours
Good for: Marvel comic readers · Dark Horse / Legends fans · data engineers
Difficulty: Advanced


Context

The kb has 24 comic SERIES today. Star Wars has thousands of individual
comic issues across Marvel, Dark Horse, IDW. Issue-level data unlocks "Vader
in Marvel #1" → covers, writer, artist, story arc, plot summary.

Goal

Ingest individual comic issues with covers and per-issue metadata. Add an
issue gallery to the comic series Datapad.

Where to start

  • Wookieepedia has comprehensive per-issue articles. The MediaWiki API can
    list category members; covers come from the article's pageimage.
  • New entity type: comic_issue with relations to its parent series
  • Cover images go to public/images/comic-issues/<series>/<issue>.jpg
  • The series's Datapad gets an "Issues" section with a thumbnail grid

Acceptance criteria

  • At least the original Marvel run (1977-86, 107 issues) ingested
  • And the current Marvel canon Star Wars (2015+) ingested
  • Each issue has cover, title, year, writer, artist, summary
  • Performance: thumbnails are small (≤30 KB each); use lazy-loading for
    the grid
  • Cite Wookieepedia URLs as the source of truth

Lore reference

https://starwars.fandom.com/wiki/List_of_comics

Metadata

Metadata

Assignees

No one assigned

    Labels

    advancedDeep technical chops neededarea: dataPipelines, ingestion, schemaenhancementNew feature or requestlegendsStar Wars Legends / EU content

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions