webscraping_2018

This repository has the set of files that gather information from the websites bild.de and wetter.de as a webscraping service, and from the weather channel by RESTful API calls. The scripts that gather the data run on a server as cronjobs. The way they run is described by: crontab_info.txt

The structure for the RESTful API calls is the following:

api_info.py has the necessary information to access the wunderground API.
constants.py has the global constants used across API scripts.
city_location.py is the script that gets the coordinates of specified named cities.
daily_db.py is the script that gathers daily data.
hourly_db.py is the script that gathers hourly data.

The structure for Wetter.de scraping is:

Wetter_de_scraping.py scrapes hourly data.
Web_Scraping_wetter_de_full_day.py scrapes daily data.
Web_Scraping_wetter_de_day_periods.py scrapes periods of the day.

For bild.de:

bild_scraping.py does both daily and daily period scraping.

The helper scripts for database insertion are:

database.py
db_manager.py
db_info.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

webscraping_2018

About

Uh oh!

Releases

Packages

Contributors 8

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
.gitignore		.gitignore
Get_htmls.py		Get_htmls.py
LICENSE		LICENSE
README.md		README.md
Web_Scraping_wetter_de_day_periods.py		Web_Scraping_wetter_de_day_periods.py
Web_Scraping_wetter_de_full_day.py		Web_Scraping_wetter_de_full_day.py
Wetter_de_scraping.py		Wetter_de_scraping.py
bild_scraping.py		bild_scraping.py
city_location.py		city_location.py
constants.py		constants.py
crontab_info.txt		crontab_info.txt
daily_db.py		daily_db.py
daily_structured.py		daily_structured.py
database.py		database.py
db_info.py		db_info.py
db_manager.py		db_manager.py
hourly_db.py		hourly_db.py
hourly_structured.py		hourly_structured.py

License

BCCN-Prog/webscraping_2018

Folders and files

Latest commit

History

Repository files navigation

webscraping_2018

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Uh oh!

Languages

Packages