Skip to content

neuefische/de-week-1-Python

Repository files navigation

πŸš€ Week 1 β€” Python for Data Engineering

Welcome! This repo contains everything you need for Week 1 of the Data Engineering program: environment setup, Bash/VS Code/Jupyter intros, Git + GitHub workflows, Python practice (basics β†’ intermediate β†’ advanced), and a Pandas primer.

πŸ’‘ Tip: Keep commits small (e.g., one per exercise). When a notebook’s tests pass, Restart Kernel β†’ Run All to ensure a clean run.


⚑ Quick Start

  1. Clone the repo
git clone <your-fork-or-classroom-url>.git
cd WEEK1-PYTHON-FOR-DATA-ENGINEERING
  1. Create a project virtual environment (recommended: Python 3.11.3)
  • Linux / macOS

    python -m venv .venv
    source .venv/bin/activate
  • Windows (PowerShell)

    python -m venv .venv
    .\.venv\Scripts\Activate.ps1
  1. (Optional) Jupyter & data stack for local runs
pip install notebook jupyterlab pandas numpy matplotlib seaborn
  1. Launch notebooks
jupyter lab
# or
jupyter notebook

πŸ’» In VS Code: Open the folder β†’ Command Palette β†’ β€œPython: Select Interpreter” β†’ pick .venv.


🎯 What You Will Learn This Week

  • πŸ›  Tooling: Bash, VS Code, Jupyter, Google Colab
  • 🌱 Version control: Git + GitHub (clone, branch, commit, PR)
  • 🐍 Python: Core syntax & flow, data structures, functions, error handling, iterators/generators, performance tips
  • πŸ“Š Pandas: Series/DataFrame basics, transforms, visualization

πŸ“‚ Folder Map

  • 01_welcome/

    • welcome.md β€” course kickoff and expectations. πŸŽ‰
  • 02_installation_setup/ [Time Allocation - 2nd half of Day 1]

    • setup_for_linux/ β€” Linux install guides (Bash, Docker, Git, Python/pyenv, VS Code, PostgreSQL/pgAdmin, Jupyter). 🐧
    • setup_for_mac/ β€” macOS install guides (Homebrew, Bash, Docker, Git, Python/pyenv, VS Code, PostgreSQL/pgAdmin, Jupyter). 🍎
    • setup_for_windows/ β€” Windows install guides (Git Bash/WSL notes, Docker Desktop, Git, Python/pyenv-win, VS Code, PostgreSQL/pgAdmin, Jupyter). πŸͺŸ
    • vscode_venv/ β€” how to use virtual environments with VS Code (Windows/macOS). πŸ–₯️ Use these if your machine isn’t set up yet.
  • 03_bash_jupyter_vscode_colab_intro/

    • bash.md β€” Bash intro + practice game (Bandit). πŸ”Ή
    • vscode.md β€” VS Code essentials for this course. ✨
    • jupyter.md β€” Jupyter Notebook/Lab walkthrough. πŸ““
    • colab.md β€” Using Google Colab. ☁️
  • 04_git_github/

    • git_github_intro.md β€” class workflow: fork/clone, feature branches, commits, PRs, resolving simple conflicts. 🧩
  • 05_python_practice/

    • python_basics/ β€” notebooks + exercise folder.

      • Topics:
      • Numeric variable types, Strings, If/Elif/Else, Loops 🐍
      • Lists, Sets, Mutability, Dictionaries, Comprehensions
      • Functions (intro/definitions/calling/challenge)
      • βœ… Each student notebook has TODOs and assert tests.
    • python_intermediate/ β€” notebooks + exercise folder: ⚑

      • Error handling, Iterators & Generators, Lambda/Map/Filter/Reduce, Performance.
    • python_advanced/ β€” notebooks + exercise folder: πŸš€

      • OOP introduction, Concurrency & Parallelism.
  • 06_pandas_intro/

    • 01_pandas.ipynb β†’ foundations (Series/DataFrame, indexing, I/O). πŸ“Š
    • 02_pandas_practice_1.ipynb, 04_pandas_practice_2.ipynb, 05_pandas_practice_3.ipynb β†’ progressively harder practice.
    • 03_pandas_visualization.ipynb β†’ quick plotting. πŸ“ˆ
    • data/ β€” sample CSVs/parquet used by the notebooks. Don’t move/rename. πŸ—‚οΈ

✨ Ready to dive in? Your Python + Pandas journey starts here! πŸπŸ’»πŸ“Š


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •