-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
powerful set of scripts! However, I can't replicate the "Scraping Der Spiegel archive data" part (on Linux)
when running article_crawler.py, no file "dictionary.json" seems to be generated
could you upload that file?
BTW, I had to manually edit article_crawler.py at line 62/63, because it threw an error when pattern matching the filenames
year=tmp.group(1)
month=tmp.group(2)
then the appropriate (yet empty) (sub)directories (years/issue numbers) get created in /data
Thanks in advance for any hints as to how to use the article_extract.py!
Metadata
Metadata
Assignees
Labels
No labels