About the Trainings

Each class session has both interactive Modules and Walkthroughs that you will need to work through after doing the readings and watching a lecture (if applicable). These lessons are a central part of the class—they will teach you how to use R and other packages eventually leading to the tidyverse family.

Interactive training sections are provided as a courtesy by Data Camp1.

Advice

Carve out some time everyday to go through these. If you try to complete everything in one sitting, it will probably be overwhelming! However if you have familiarity with some modules, please feel free to work ahead.

Grading

The ultimate point of Data Camp is to get you familiarized with an environment that you likely have never seen or been exposed to. While you should absolutely go through each module, there is certainly no expectation that you will get everything right. In fact, the points that you incur don’t mean anything as far as how you are assessed so please use hints as needed! As with any things data science, you’ll learn by doing. If you are a polar personality type when it comes to work (i.e. primarily a perfectionist or mostly careless), then the modules will likely prove to be a challenge. It is highly unlikely that you will be able to comprehend everything by going beyond your limit or that it will just come to you so please work hard but also take breaks, swear2, and ask peers or me for help. Your score is predicated on putting in a solid effort, rather than getting it perfect because everything is probabilistic and nothing is for certain.

Data Camp Schedule

A tentative schedule is given below. The Course and Chapter names represent Data Camp titles3:

ExplorationLinkDue byRequiredCourse or Project NameChapters covered
1Week 15/22/22Introduction to RIntro to basics, Vectors, Matrices, Factors, Data Frames, Lists
5/22/22Working with Data in the TidyverseExplore your data, Tame your data, Tidy your data, Transform your data
2Week 25/29/22Introduction to the TidyverseData wrangling, Data visualization, Grouping and summarizing, Types of visualizations
5/29/22Foundations of InferenceIntroduction to ideas of inference, Completing a randomization test: gender discrimination, Hypothesis testing errors: opportunity cost, Confidence intervals
3Week 36/5/22Data Manipulation with dplyrTransforming Data with dplyr, Aggregating Data, Selecting and Transforming Data, Case Study: The babynames Dataset
6/5/22Introduction to Statistics in RSummary Statistics, Random Numbers and Probability, More Distributions and the Central Limit Theorem, Correlation and Experimental Design
4Week 46/12/22Introduction to Regression in RSimple Linear Regression, Predictions and model objects, Assessing model fit, Simple logistic regression
5Week 56/19/22Survey and Measurement Development in RPreparing to analyze survey data, Exploratory factor analysis & survey development, Confirmatory factor analysis & construct validation,
6Week 66/26/22Analyzing Survey Data in RIntroduction to survey data, Exploring categorical data, Exploring quantitative data, Modeling quantitative data
EC16/26/22Introduction to Data Visualization with ggplot2Introduction, Aesthetics, Geometries, Themes

Need Help?

While I am happy to meet face-to-face, I am not consistently in my office at the moment. It is likely easier to simply schedule Zoom session using the calendar or by notifying me on Slack by adding @Dr. Abhik Roy to your message.


  1. Please note that if you have (1) used Data Camp before and (2) are logged in with the same username, then any module that was successfully completed will not have to be done again. ↩︎

  2. and curse my name if you have to ↩︎

  3. Please note this is subject to change with notice. ↩︎