Skip to content

DataCatalog to explore end-to-end lineage with Airbyte, Airflow, dbt, BigQuery

Notifications You must be signed in to change notification settings

iobruno/data-catalog-labs

Repository files navigation

Data Catalog with DataHub

Airbyte Airflow dbt BigQuery

This project aims to provision end-to-end pipeline lineage with Airbyte, Airflow, dbt, BigQuery and DataHub as the Data Catalog/Lineage platform. Also ensuring sibling relationships are not duplicate (e.g: Airbyte destination table for a given source matches the same entity as dbt source table)

Quick Start:

  1. Spin up DataHub
docker compose -f datahub/compose.yaml up -d
  1. Spin up Airflow
docker compose -f airflow/compose.yaml up --build --force-recreate -d
  1. Spin up Airbyte with abctl
brew tap airbytehq/tap
brew install abctl

abctl local install
  1. Fetch Airbyte credentials
abctl local credentials
  1. Build the dbt-bigquery Docker Image
docker build -t dbt-bigquery:latest dbt/ --no-cache
  1. Build the datahub-ingest Docker Image
docker build -t datahub-ingest:latest datahub/ --no-cache
  1. Terraform

Follow the instructions on terraform for guidelines on how to run/apply

Reference Docs

Refer to the specific project folder on how to start each component individually

About

DataCatalog to explore end-to-end lineage with Airbyte, Airflow, dbt, BigQuery

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •