Skip to content

samuelandaudreymedianetwork/.github

Repository files navigation

Samuel & Audrey Media Network

Samuel & Audrey Media Network is an independent publishing and media archive created by Samuel Jeffery and Audrey Bergner.

This Hugging Face organization collects structured datasets derived from long-running travel websites, YouTube channels, photography archives, bilingual transcript projects, Argentina fieldwork, citation records, and finance writing.

The datasets are intended for non-commercial research, retrieval workflows, media archive search, language analysis, travel-domain NLP, citation review, image metadata analysis, and long-context dataset discovery.

Primary resources

Dataset groups

Travel, Argentina, and fieldwork archives

  • Project 23 Argentina Travel Archive
  • Che Argentina Travel Article Corpus
  • Nomadic Samuel Article Corpus
  • That Backpacker Article Corpus
  • Top 100 Travel Blogs 2010s Historical Archive

Video transcript and YouTube metadata datasets

  • Samuel & Audrey YouTube Transcripts EN Corpus, 2012–2026
  • Samuel y Audrey Bilingual YouTube Transcript Corpus ES/EN
  • Nomadic Samuel YouTube Transcripts Corpus
  • YouTube Travel Videos Metadata Index

Photography and visual archive metadata

  • Samuel & Audrey Photography Metadata Archive

Citation, media reference, and partnership records

  • Academic Citations and Media References
  • Media and Academic Citations and Third-Party References
  • Partnerships and Media References

Finance writing corpus

  • Picture Perfect Portfolios Article Corpus

Public datasets

Dataset Category Languages DOI / Archive Hugging Face
Project 23 Argentina Travel Archive Argentina travel archive English; Spanish https://doi.org/10.57967/hf/8886 https://huggingface.co/datasets/samuelandaudreymedianetwork/project-23-argentina-travel-archive
Top 100 Travel Blogs 2010s Historical Archive historical travel blogging archive English https://doi.org/10.57967/hf/8885 https://huggingface.co/datasets/samuelandaudreymedianetwork/top-100-travel-blogs-2010s-archive
Academic Citations and Media References citation and reference records English https://doi.org/10.57967/hf/8887 https://huggingface.co/datasets/samuelandaudreymedianetwork/academic-citations-and-media-references
YouTube Travel Videos Metadata Index video metadata English; Spanish https://doi.org/10.57967/hf/8888 https://huggingface.co/datasets/samuelandaudreymedianetwork/youtube-travel-videos-metadata-index
Che Argentina Travel Article Corpus article corpus English https://doi.org/10.5281/zenodo.18665586 https://huggingface.co/datasets/samuelandaudreymedianetwork/che-argentina-travel-article-corpus
Nomadic Samuel Article Corpus article corpus English https://doi.org/10.5281/zenodo.18665493 https://huggingface.co/datasets/samuelandaudreymedianetwork/nomadic-samuel-article-corpus
Picture Perfect Portfolios Article Corpus finance article corpus English https://doi.org/10.5281/zenodo.18665568 https://huggingface.co/datasets/samuelandaudreymedianetwork/picture-perfect-portfolios-article-corpus
That Backpacker Article Corpus article corpus English https://doi.org/10.5281/zenodo.18665606 https://huggingface.co/datasets/samuelandaudreymedianetwork/that-backpacker-article-corpus
Partnerships and Media References partnership and media reference records English https://doi.org/10.5281/zenodo.18665080 https://huggingface.co/datasets/samuelandaudreymedianetwork/partnerships-and-media-references
Media and Academic Citations and Third-Party References citation and third-party reference records English https://doi.org/10.5281/zenodo.18664879 https://huggingface.co/datasets/samuelandaudreymedianetwork/media-and-academic-citations-and-third-party-references
Nomadic Samuel YouTube Transcripts Corpus YouTube transcript corpus English https://doi.org/10.5281/zenodo.18665640 https://huggingface.co/datasets/samuelandaudreymedianetwork/nomadic-samuel-youtube-transcripts-corpus
Samuel y Audrey Bilingual YouTube Transcript Corpus ES/EN bilingual YouTube transcript corpus Spanish; English https://doi.org/10.5281/zenodo.18665315 https://huggingface.co/datasets/samuelandaudreymedianetwork/samuel-y-audrey-youtube-transcripts-es-en
Samuel & Audrey YouTube Transcripts EN Corpus, 2012–2026 YouTube transcript corpus English https://doi.org/10.5281/zenodo.18665704 https://huggingface.co/datasets/samuelandaudreymedianetwork/samuel-and-audrey-youtube-transcripts-en
Samuel & Audrey Photography Metadata Archive photography metadata archive English; Spanish https://doi.org/10.5281/zenodo.18665236 https://huggingface.co/datasets/samuelandaudreymedianetwork/samuel-and-audrey-photography-metadata-archive

Network websites

Site Focus URL
Samuel Jeffery Personal profile, publishing projects, and dataset work https://samueljeffery.net
Samuel & Audrey Travel videos, lifestyle content, and network hub https://samuelandaudrey.com
Nomadic Samuel Travel guides, video archives, and long-form destination coverage https://nomadicsamuel.com
That Backpacker Travel guides, itineraries, food travel, and narrative travel writing https://thatbackpacker.com
Che Argentina Travel Argentina travel guides, logistics, culture, and regional coverage https://cheargentinatravel.com
Picture Perfect Portfolios Quantitative finance writing and portfolio research https://pictureperfectportfolios.com
Audrey Bergner Audrey Bergner’s travel writing, creative work, and profile https://audreybergner.com

Data use and licensing

Most public datasets in this organization are released under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) unless a dataset card states otherwise.

The datasets are suitable for non-commercial research, education, analysis, open-source experimentation, retrieval workflows, language study, media-archive review, and non-commercial model training subject to the license terms and limitations listed in each dataset card.

For commercial licensing, bulk usage, image permissions, media inquiries, citation questions, or partnership requests, contact:

nomadicsamuel@gmail.com

Notes for researchers and developers

These datasets are derived from real media archives and should be interpreted according to their source context.

  • Article corpora preserve source article text.
  • Transcript corpora preserve spoken-language transcript material and may include transcription artifacts.
  • Photography metadata datasets provide metadata and source URLs, not embedded image files.
  • Citation and partnership datasets are reference indexes and should be reviewed source-by-source.
  • Travel logistics, prices, routes, business details, platform metadata, and view counts may change over time.

Citation

Samuel & Audrey Media Network. (2026). Samuel & Audrey Media Network Hugging Face Dataset Hub. Hugging Face. https://huggingface.co/samuelandaudreymedianetwork

About

GitHub organization profile and dataset directory for the Samuel & Audrey Media Network, linking public travel, video, photography, citation, media, and finance datasets across Hugging Face, GitHub, and Zenodo.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors