Skip to content

graemebalint/etl_pipeline

Repository files navigation

etl-pipeline

A proof of concept extract-transform-load pipeline using publicly available data about Canadian Business Licenses in Toronto, Edmonton and Calgary. The data was downloaded in CSV format from open data portals on municipal websites, and then read into Pandas dataframes to apply transformations. Once the transformations were complete, the data was imported into a PostreSQL database.

  • Programming languages: Python
  • Relevant libraries: Pandas, Numpy
  • Environment: Jupyter Notebook

About

A proof of concept extract-transform-load pipeline using publicly available data about Canadian Business Licenses from Toronto, Edmonton and Calgary.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors