Skip to content

You would think for as many stats classes I took, this would be 2nd nature. It's not 🙃Exercise to compare EDA and modeling workflows in R and Python.

Notifications You must be signed in to change notification settings

Meghansaha/ml-refresher

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ML Refresher: Comparing R and Python Workflows

A side-by-side comparison of machine learning workflows using R's tidymodels and Python's scikit-learn ecosystems.

Overview

You would think for as many stats classes I took, this would be 2nd nature. It's not 🙃

This project serves two goals:

  1. Refresh my modeling skills using tidymodels (which actually makes sense to my brain)
  2. Practice Python by replicating a workflow I understand in R

Dataset

Using the Music & Mental Health Survey Results from Kaggle to predict anxiety scores from music listening habits.

What's Inside

  • VERY BASIC Exploratory data analysis comparing anxiety across demographics
  • Simple ML workflow in both R (tidymodels) and Python (scikit-learn)
  • Side-by-side syntax comparison for common modeling tasks

Tech Stack

R: tidymodels, tidyverse, ggplot2
Python: scikit-learn, polars, plotnine

Status

🚧 Work in progress - because learning takes time and toddlers don't sleep

About

You would think for as many stats classes I took, this would be 2nd nature. It's not 🙃Exercise to compare EDA and modeling workflows in R and Python.

Resources

Stars

Watchers

Forks