Skip to content
View francinaldocn's full-sized avatar

Highlights

  • Pro

Block or report francinaldocn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
francinaldocn/README.md

Francinaldo Nunes

Senior Data Engineer | Data Platform & Lakehouse Architecture

About

Senior Data Engineer with 15+ years in infrastructure, systems, databases and cloud. The last 3 years focused on data engineering and data platform work.

I build ingestion pipelines from PostgreSQL, SQL Server and APIs into curated Iceberg tables — PySpark for large volumes, Python (DuckDB, Pandas, Arrow) for high-performance batch jobs.

Tech stack

Kubernetes · ArgoCD · Helm · Kustomize · Spark Operator · Hive Metastore · MinIO
Iceberg · Trino · Dremio · Spark/PySpark · Airflow
AWS (S3, MWAA, Glue, Athena) · OCI · Python · SQL

Selected work

  • On-prem lakehouse: Designed and built a Kubernetes-based lakehouse with GitOps (ArgoCD/Helm/Kustomize). Added Dremio so analysts can query data on their own, without routing requests through the data team.
  • Optimization: Replaced 8h+ financial reports running on PostgreSQL with PySpark on AWS Glue (MWAA). Results land in Parquet tables queried by Athena (8h → 40min).
  • Legacy migration: Moved ETL workflows from Pentaho and SSIS to the Kubernetes lakehouse platform.

Popular repositories Loading

  1. buscadoe buscadoe Public

    Search terms in a pdf file on DOEPB.

    Python 2 1

  2. data-lakehouse-k8s data-lakehouse-k8s Public

    Local Data Lakehouse on Kind with Argo CD, Trino, Iceberg, and Airflow. A fully reproducible GitOps stack for dev/test and prototyping.

    Shell 2

  3. covid_brasil covid_brasil Public

    Dados sobre o covid19 no Brasil

    Jupyter Notebook 1

  4. k8s-lab-platform k8s-lab-platform Public

    EN: Professional v2.0 automated Kubernetes lab environment (Kind + Gateway API + Rancher). Engineered for security, robustness, and modern infrastructure standards. PT-BR: Plataforma v2.0 para auto…

    Shell 1

  5. francinaldocn francinaldocn Public

    Senior Data Engineer specializing in Data Platform & Lakehouse Architecture.

    1