Skip to content

Tzsapphire/Analytics_DBT

Repository files navigation

Analytics Engineer Project

(Recreated with Snowflake, dbt (Data Build Tool),and Power BI)

This is an end-to-end analytics engineering project that follows the modern ELT model.


Project Architecture

Data flow

In this project, I extracted and loaded the raw data into Snowflake then transformed using dbt models (staging → silver → gold layers)

Tool Architecture

Project Flow

  1. Data Ingestion: Loaded raw data into Snowflake via dbt sources

  2. Modeling with dbt: loaded the staging models with the yaml source file pointing at the raw snowflake wh; created intermediate (silver) models which contained cleaned data and the final gold models used for further reporting. Modeling performed across three layers:

    - Source: Raw ingestion from Airbyte
    - Intermediate: Data cleansing, type casting, addition of surrogate keys, etc
    - Consumption (Marts): Star schema 
    

For ensuring Data Quality using dbt:

  Built-in tests (unique, not null)      
  WHERE clause filters to reduce query cost

Final tables view

Relationship diagram

!ERD diagram!

  1. Visualization with Power BI: Built an interactive dashboard to answer analytical questions

Power BI Dashboard

Home(Product) (Sales)


Learnings

  • How to use dbt for transformation and running tests smartly to optimize cost
  • Visual interactive dashboard creation using Power BI
  • Learning markdowns

Credits

Thanks to my mentors Ahmed O. for insights shared throughout the project.

About

An analytics pipeline

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors