-
Notifications
You must be signed in to change notification settings - Fork 20
Expand file tree
/
Copy pathds-next-steps.Rmd
More file actions
89 lines (75 loc) · 3.36 KB
/
ds-next-steps.Rmd
File metadata and controls
89 lines (75 loc) · 3.36 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
---
title: "Data Science Next Steps"
output: html_document
---
### A. Skills Development
* Linear and Matrix Algebra
* Diversify your Programming Languages (Python, C++, Java, etc.)
* Kaggle competitions
* [Computefest 2020](https://computefest.seas.harvard.edu/)
* [The Harvard Big Data Club](https://harvardbigdata.com/people)
* [Data Science Meetups in Boston](https://www.meetup.com/topics/data-science/us/ma/boston/?_cookie-check=nhT68rU50hwPaQWQ)
### B. Spring Data Science Courses
#### BST 263 - Applied Machine Learning (Full Spring)
* **Language: R**
* Material: [An Introduction to Statistical Learning](http://faculty.marshall.usc.edu/gareth-james/ISL/)
* Additional Reading: [The Elements of Statistical Learning](https://web.stanford.edu/~hastie/Papers/ESLII.pdf)
* Probability Basics
* Assessing Model Accuracy
* Linear and Logistic Regression
* Classification
* Cross-validation and bootstrap
* Subset selection, Penalty-based methods
* Ridge regression and LASSO
* Dimension Reduction
* Polynomial Regression
* Step functions and Basis functions
* Generalized Additive Models (GAMs)
* Classification and Regression Trees (CART)
* Bagging and Random Forests
* Ensembles
* Support Vector Machines
* Unsupervised Learning
* Bayesian Methods
#### BST 261 - Data Science 2 (Spring 2)
* **Language: Python**
* Material: [Deep Learning](http://www.deeplearningbook.org/)
* Brief review of Python 3, Linear Algebra and Probability
* Review of Machine Learning
* Feedforward Networks (MLPs)
* Convolutional Networks (CNNs)
* Recurrent and Recursive Networks (RNNs)
* Hyperparameter tuning
* Model Selection
#### BST 234 - Introduction to Data Structures and Algorithms (Full Spring)
* **Language: Python**
* Overview of numerical analysis material
* Concepts of Algorithms, Complexity and Sorting Algorithms
* Data Structures and Heapsort
* Parallel Programming
* Greedy Algorithms and Dynamic Programming
* Numerical Stability
* (Pseudo) Random Number Generation
* Efficient Algorithms for Linear Algebra
* Least-Squares Problem, Eigenvalue Decompositions
* Algorithms for Numerical Integration, MC-Integration, Importance Sampling
* Graphs and Network Algorithms
### C. Other Courses
* BST 262: Computing for Big Data (Fall)
* BST 267: Introduction to Social and Biological Networks (Fall)
* APMTH 120: Applied Linear Algebra and Big Data
* CS 124: Data Structure and Algorithms
* CS 134: Networks
* CS 171: Visualization
* CS 181: Machine Learning
* CS 287: Machine Learning for Natural Language
### D. Data Science Applications
* [Diagnosing Pneumonia](https://news.stanford.edu/2017/11/15/algorithm-outperforms-radiologists-diagnosing-pneumonia/)
* [Tumor Classification](https://www.sciencebuddies.org/blog/artificial-intelligence-and-cancer-diagnosis-meet-the-2012-google-science-fair-winner)
* [Precision Medicine](https://www.forbes.com/sites/oreillymedia/2012/08/20/data-science-and-the-health-care-revolution/#33d787f129f7)
* [Deep Learning for Diabetic Retinopathy](https://www.sciencedirect.com/science/article/pii/S1877050916311929)
* [Skin Cancer Classification with Deep Learning](https://cs.stanford.edu/people/esteva/nature/)
* [Digital Phenotyping](https://www.hsph.harvard.edu/onnela-lab/)
### E. Become a TA
We are in desparate need for TAs for our new data science courses! Keep this in mind if you're looking for some extra money next year 😁
<center><img src="./teaching_computers.png"></center>