Skip to content

wyp1125/etl-pipeline-cicd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

etl-pipeline-cicd

This CICD demo contains source code for a AWS severless ETL pipeline including a Lambda function and a Glue ETL job. A Step Function orchestrates the Lambda Function and Glue ETL job. The ETL pipeline is automatically redeployed when any change is pushed to the main branch.

The business logic is to move data from raw (Bronze layer) to rawplus (Silver layer), and then conduct Spark transformation on rawplus and load the transformed data to destination also as a Glue table (Gold layer) for interactive analytics using Athena, Redshift Spectrum or Dremio.

About

This CI/CD repo with Github Actions contains source code and a workflow for an AWS severless ETL pipeline.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages