This repository has the set of files that gather information from the websites bild.de and wetter.de as a webscraping service, and from the weather channel by RESTful API calls.
The scripts that gather the data run on a server as cronjobs. The way they run is described by: crontab_info.txt
The structure for the RESTful API calls is the following:
-
api_info.pyhas the necessary information to access the wunderground API. -
constants.pyhas the global constants used across API scripts. -
city_location.pyis the script that gets the coordinates of specified named cities. -
daily_db.pyis the script that gathers daily data. -
hourly_db.pyis the script that gathers hourly data.
The structure for Wetter.de scraping is:
-
Wetter_de_scraping.pyscrapes hourly data. -
Web_Scraping_wetter_de_full_day.pyscrapes daily data. -
Web_Scraping_wetter_de_day_periods.pyscrapes periods of the day.
For bild.de:
bild_scraping.pydoes both daily and daily period scraping.
The helper scripts for database insertion are:
database.pydb_manager.pydb_info.py