A side-by-side comparison of machine learning workflows using R's tidymodels and Python's scikit-learn ecosystems.
You would think for as many stats classes I took, this would be 2nd nature. It's not 🙃
This project serves two goals:
- Refresh my modeling skills using tidymodels (which actually makes sense to my brain)
- Practice Python by replicating a workflow I understand in R
Using the Music & Mental Health Survey Results from Kaggle to predict anxiety scores from music listening habits.
- VERY BASIC Exploratory data analysis comparing anxiety across demographics
- Simple ML workflow in both R (tidymodels) and Python (scikit-learn)
- Side-by-side syntax comparison for common modeling tasks
R: tidymodels, tidyverse, ggplot2
Python: scikit-learn, polars, plotnine
🚧 Work in progress - because learning takes time and toddlers don't sleep