forked from k-glen/POP77001_Computer_Programming_2021
-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.Rmd
More file actions
112 lines (79 loc) · 5.57 KB
/
index.Rmd
File metadata and controls
112 lines (79 loc) · 5.57 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
---
title: "POP77001 Computer Programming for Social Scientists"
author: "Tom Paskhalis, Department of Political Science, Trinity College Dublin"
date: "2021"
site: bookdown::bookdown_site
documentclass: book
bibliography: "bibliography.bib"
---
# About This Module {-}
This module provides foundational knowledge of computer programming concepts and software engineering practices. It introduces students to major data science programming languages and workflows, with a focus on social science data and research questions. Students will be introduced to Python and R, two principal data science programming languages. This course covers basic and intermediate programming concepts, such as object types, functions, control flow, testing and debugging. Particular emphasis will be made on data handling and analytical tasks with a focus on problems in social sciences. Homeworks will include hands-on coding exercises. In addition, students will apply their programming knowledge on a research project at the end of the module.
## Instructors {-}
- [Tom Paskhalis](mailto:tom.paskhalis@tcd.ie), *Office Hours*: Thursday 11:00-13:00 [online](https://outlook.office365.com/owa/calendar/TomPaskhalis@TCDUD.onmicrosoft.com/bookings/)
- [Martyn Egan](mailto:eganm9@tcd.ie)
## Module Meetings {-}
- 11 two-hour lectures
- Monday 11:00 in [East End Development 4/5](https://www.tcd.ie/Maps/map.php?b=233) LTEE2
- 11 one-hour tutorials
- Group 1: Wednesday 14:00 in [1 College Green](https://www.tcd.ie/Maps/map.php?b=234) 2.04
- Group 2: Thursday 10:00 in [East End Development 4/5](https://www.tcd.ie/Maps/map.php?b=233) LTEE2
- No lecture/tutorial in Week 7
| Week | Language | Topic |
|-------:|:-----------|:---------------------------------|
| 1 | - | What is computation? |
| 2 | Python | Python Basics |
| 3 | Python | Control Flow in Python |
| 4 | Python | Functions in Python |
| 5 | Python | Debugging and Testing in Python |
| 6 | Python | Data Wrangling in Python |
| 7 | - | - |
| 8 | R | Fundamentals of R Programming I |
| 9 | R | Fundamentals of R Programming II |
| 10 | R | Data Wrangling in R |
| 11 | R, Python | Complexity and Performance |
| 12 | R, Python | Web scraping |
## Prerequisites {-}
This is an introductory class and no prior experience with programming is required.
## Hardware and Software {-}
- Laptop with Windows/Mac/Linux OS (no Chrome books)
- Software:
- [**Python**](https://www.python.org/) (version 3+) - versatile programming language
- [**R**](https://cran.r-project.org/) (version 4+) - statistical programming language
- [**Jupyter**](https://jupyter.org/) - web-based interactive computational environment
- [**RStudio**](https://www.rstudio.com/) - integrated development environment
- [**Git**](https://git-scm.com/) - version control system
- [**GitHub**](https://github.com/) - git-based online platform for code hosting
## Materials {-}
The following texts provide a good introduction to Python and R programming with a focus on data analysis applications:
- Guttag, John. 2021 *Introduction to Computation and Programming Using Python: With Application to Computational Modeling and Understanding Data*. 3rd ed. Cambridge, MA: The MIT Press
- Matloff, Norman. 2011. *The Art of R Programming: A Tour of Statistical Software Design*. San Francisco, CA: No Starch Press.
- McKinney, Wes. 2017. *Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython*. 2nd ed. Sebastopol, CA: O'Reilly Media
- Wickham, Hadley. 2019. [*Advanced R*](http://adv-r.had.co.nz/). 2nd ed. Boca Raton, FL: Chapman and Hall/CRC.
- Wickham, Hadley, and Garrett Grolemund. 2017. [*R for Data Science: Import, Tidy, Transform, Visualize, and Model Data*](http://r4ds.had.co.nz/). Sebastopol, CA: O'Reilly Media.
Additional online resources:
- [Git Book](https://git-scm.com/book/en/v2)
- [Python For You and Me](https://pymbook.readthedocs.io/en/latest/)
- [Python Wikibook](https://en.wikibooks.org/wiki/Python_Programming)
- [Python 3 Documentation](https://docs.python.org/3/) (intermediate and advanced)
- [R Documentation](https://rdrr.io/)
- [R Inferno](https://www.burns-stat.com/pages/Tutor/R_inferno.pdf)
## Assessment
- 5 problem sets (50%)
- Bi-weekly programming assigments
- Due at 11:00 on Monday of weeks 3,5,7,10 and 12 on Blackboard
- Research project (50%)
- Final Python/R project demonstrating familiarity with programming concepts and ability to communicate results
- Due at 11:00 on Monday, 20 December 2021
## Assessment criteria {-}
1. ✔️ Code exists
2. ⌚ Code runs and does what it has to do
3. 📜 Code is legible (meaningful naming, comments)
4. ⚙️ Code is modular (no redundacies, use of abstractions)
5. 🏎️ Code is optimized (no needless loops, runs fast)
## Plagiarism {-}
- Plagiarising computer code is as serious as plagiarising text (see [Google LLC v. Oracle America, Inc.](https://en.wikipedia.org/wiki/Google_LLC_v._Oracle_America%2C_Inc.))
- All submitted programming assignments and final project should be done individually
- You may discuss general approaches to solutions with your peers
- But do not share or view each others code
- You can use online resources but give credit in the comments
Check the Trinity's [guide on the levels and consequences of plagiarism](https://libguides.tcd.ie/plagiarism/levels-and-consequences)