scraping Spiegel archive

powerful set of scripts! However, I can't replicate the "Scraping Der Spiegel archive data" part (on Linux)

when running article_crawler.py, no file "dictionary.json" seems to be generated

could you upload that file? 

BTW, I had to manually edit article_crawler.py at line 62/63, because it threw an error when pattern matching the filenames
year=tmp.group(1)
month=tmp.group(2)

then the appropriate (yet empty) (sub)directories (years/issue numbers) get created in /data

Thanks in advance for any hints as to how to use the article_extract.py!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

scraping Spiegel archive #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

scraping Spiegel archive #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions