Intermediate statistics - Rutgers School of Criminal Justice (27:202:543)
Intermediate Statistics | 27:202:543 |
Tuesday, 1:00-3:40 | Room: CLJ 572 |
frank.edwards@rutgers.edu | |
Office hours Thursday, 1:00 - 3:00 | Room: CLJ 579B |
This is the course syllabus for Intermediate Statistics, Spring 2024. Continuous outcomes that meet the assumptions of ordinary least squares regression are relatively rare in the social sciences. This course focuses our attention on how to estimate regression models for discrete outcomes including binary, categorical, and count variables. These flexible tools allow us to more accurately model a wide range of outcomes.
All course communication will occur over email and canvas. Check them routinely.
Come prepared. This is a relatively small and advanced course. I expect everyone to participate actively in course discussions.
Be respectful and professional. Be mindful of the space you take up in the classroom.
Collaborate with your colleagues. I encourage you all to work together to complete assignments. However, I do expect you each to submit your own homework writeups.
A prior graduate-level course in statistics is required. This course assumes students are comfortable with multivariate linear regression, basic probability, and statistical computing.
These math camp materials from UChicago neatly cover the math you need for graduate-level statistics courses.
Jenny Bryan’s STAT 545 course at UBC provides a very comprehensive overview of programming in R and efficient data science workflows.
Rohan Alexander’s Telling Stories with Data provides a great introduction to practical data analysis with R.
All instruction will be conducted in the R statistical programming language. R is free and open-source, and can be downloaded here.
We will be using the RStudio integrated development environment. RStudio provides a powerful text editor and a range of very useful utilities.
In addition to writing code, it is a great tool for writing reports, papers, and slides using RMarkdown. This syllabus, most of my course materials, and most of my academic papers are based on Markdown. All course assignments should be completed in RMarkdown.
These recommended books are very useful, and some examples are pulled from them:
Wickham, R for Data Science
Healy, Data Visualization: A Practical Introduction
McElreath, Statistical Rethinking: A Bayesian Course with Examples in R and Stan
Course grading is based on a combination of homework assignments (50 percent) and a final project (50 percent)
All assignments will be posted and submitted on Canvas
Problem sets: I will assign weekly homework. These assignments should be completed in RMarkdown and submitted via canvas before class begins the following week.
Research project: You will write or revise a quantitative paper during the semester.
1/16 | Fundamentals: probability, regression | |
1/23 | Inference and simulation | |
1/30 | Linear regression review | |
2/6 | Binary variables and the Bernoulli distribution | |
2/13 | Linear regression (1) | |
2/20 | Logistic regression (2) | |
2/27 | Models for count data (1) | |
3/5 | Models for count data (2) | |
3/12 | Spring Break | |
3/19 | Models for categorical outcomes | |
3/26 | Advanced topics: missing data (1) | |
4/2 | Advanced topics: missing data (2) | |
4/9 | Advanced topics: multilevel models (1) | |
4/16 | Advanced topics: multilevel models (2) | |
4/23 | Advanced topics: Bayesian regression |