Skip to content

Latest commit

 

History

History
38 lines (19 loc) · 2.32 KB

File metadata and controls

38 lines (19 loc) · 2.32 KB

Binder

Machine Learning Workshop

This workshop was originally created and run by ozzy18 and lisafeets for Women Who Code.

This repository contains lecture slides and python code examples to help data science newbies learn the basics of creating and evaluating machine learning models.

To use the content found here as a full day workshop, we suggest using the following schedule

full day workshop schedule

Getting Started

Lecture slides (found at mlworkshop_slides.pdf), go over the fundamentals of machine learning, from definitions to building and evaluating models.

To run through the workshop labs, visit this mybinder.org link.

The labs have been inspired/adapted/expanded from the "Predicting Breast Cancer - Logistic Regression" Kaggle post. Data is sourced from Breast Cancer Wisconsin (Diagnostic) Data Set.

Labs are meant to be run in order. Executing commands in order within each lab jupyter notebook will result in locally saved data sets that can be used in the proceeding lab. Datasets for each lab are also made available in the /data_sets folder. Answers to exercises found in the labs are available in the workshop_cheat_sheet.pdf.

Lab 1. Loading and cleaning breast cancer data (jupyter notebook).

Lab 2. Worksheet to practice choosing machine learning models for business problems (pdf)

Lab 3. Simple data processing and feature selection (jupyter notebook).

Lab 4. Model application, evaluation and tuning (jupyter notebook).

Resources for Beginner-Friendly datasets:

  1. Best Machine Learning Datasets for Beginners
  2. Fun, beginner-friendly datasets
  3. Another greate resource: Datasets and Tutorial Kernels for Beginners