Essential R Skills for Intermediate Users - Online Course
A 3-day Livestream Seminar Taught by
Andrew MilesWednesday, May 6 —
Friday, May 8, 2026
10:00am-12:30pm (convert to your local time)
1:30pm-3:30pm
Go beyond basics to write cleaner, more effective code and enhance your research skills!
R has become the pre-eminent software package for conducting statistical analyses in a wide range of fields. However, harnessing its full potential requires moving beyond basic data manipulation and analysis and learning to use R for more of the tasks required for research in the modern world.
This course intentionally avoids addressing how to perform particular types of analyses in R—there are already many great courses that do this (such as the ones listed here). Instead, it selectively focuses on techniques and skills that will help you write better R code, simplify your research, and improve your analyses regardless of the particular data and models you use. The goal is to equip you with a suite of tools that you will be able to apply frequently across a range of projects.
Starting May 6, this seminar will be presented as a 3-day synchronous, livestream workshop via Zoom. Each day will feature two lecture sessions with hands-on exercises, separated by a 1-hour break. Live attendance is recommended for the best experience. But if you can’t join in real time, recordings will be available within 24 hours and can be accessed for four weeks after the seminar.
Closed captioning is available for all live and recorded sessions. Captions can be translated to a variety of languages including Spanish, Korean, and Italian. For more information, click here.
ECTS Equivalent Points: 1
More details about the course content
Course content can be roughly grouped into three broad skillsets. The first are “R skills”—skills that improve your ability to write effective R code. We will discuss how R finds files and how we can make this process less error prone, as well as methods for troubleshooting and debugging code. We will also learn how to write our own R functions and why doing so is usually a good idea.
The second type of skills are “research skills”—ways to use R to perform common research-related tasks. These include learning how to build flexible, publication-ready tables directly in R (goodbye spreadsheets!), learning the basics of data generation and simulation, and exploring how to bootstrap results from a variety of common analyses.
The third skillset consists of strategies for using large language models (LLMs). These AI tools are powerful additions to the modern R programmer’s toolkit because they can enhance coding workflows across a variety of tasks. Accordingly, LLM content will be spread throughout the course. We will discuss both when to use them and how to use them effectively for tasks such as debugging code, enhancing tables, and supporting simulation work.
This course is unapologetically hands-on. You are encouraged to write code along with the instructor, and to participate in the exercises interspersed through the seminar. By the end of the course, you can expect to log over a dozen hours of guided practice coding in R.
A primary aim of this course is to “upskill” your ability to work independently and flexibly in R. Consequently, particular emphasis will be placed on processes for tackling problems rather than ready-made solutions. We also will focus on the “nuts and bolts” of the various methods we discuss so that you will leave the course ready to tailor out-of-the-box functions to the specifics of your work.
Course content can be roughly grouped into three broad skillsets. The first are “R skills”—skills that improve your ability to write effective R code. We will discuss how R finds files and how we can make this process less error prone, as well as methods for troubleshooting and debugging code. We will also learn how to write our own R functions and why doing so is usually a good idea.
The second type of skills are “research skills”—ways to use R to perform common research-related tasks. These include learning how to build flexible, publication-ready tables directly in R (goodbye spreadsheets!), learning the basics of data generation and simulation, and exploring how to bootstrap results from a variety of common analyses.
The third skillset consists of strategies for using large language models (LLMs). These AI tools are powerful additions to the modern R programmer’s toolkit because they can enhance coding workflows across a variety of tasks. Accordingly, LLM content will be spread throughout the course. We will discuss both when to use them and how to use them effectively for tasks such as debugging code, enhancing tables, and supporting simulation work.
This course is unapologetically hands-on. You are encouraged to write code along with the instructor, and to participate in the exercises interspersed through the seminar. By the end of the course, you can expect to log over a dozen hours of guided practice coding in R.
A primary aim of this course is to “upskill” your ability to work independently and flexibly in R. Consequently, particular emphasis will be placed on processes for tackling problems rather than ready-made solutions. We also will focus on the “nuts and bolts” of the various methods we discuss so that you will leave the course ready to tailor out-of-the-box functions to the specifics of your work.
Computing
To participate in the hands-on exercises, you are strongly encouraged to use a computer with the most recent version of R installed. You are also encouraged to download and install RStudio, a front-end for R that makes it easier to work with. This software is free and available for Windows, Mac, and Linux platforms.
Participants are encouraged to have access to a prompt-based LLM service, such as Gemini, ChatGPT, or Claude. The course LLM content will be presented using the paid version of Gemini.
If you’d like to take this course but are concerned that you don’t know enough R, there are excellent online resources for learning the basics. Here are our recommendations.
To participate in the hands-on exercises, you are strongly encouraged to use a computer with the most recent version of R installed. You are also encouraged to download and install RStudio, a front-end for R that makes it easier to work with. This software is free and available for Windows, Mac, and Linux platforms.
Participants are encouraged to have access to a prompt-based LLM service, such as Gemini, ChatGPT, or Claude. The course LLM content will be presented using the paid version of Gemini.
If you’d like to take this course but are concerned that you don’t know enough R, there are excellent online resources for learning the basics. Here are our recommendations.
Who should register?
This course is for anyone who is already comfortable performing basic data manipulation and analysis (e.g., regression) in R. You should also be familiar with basic data handling in both base R and the tidyverse. If $, [], [[]], and %>% are all familiar to you, then you are probably good to go.
Those needing a more basic treatment are encouraged to take R for Stata Users, R for SAS Users, R for SPSS Users, or Introduction to R for Data Analysis, any of which would be good preparation for this course.
This course is for anyone who is already comfortable performing basic data manipulation and analysis (e.g., regression) in R. You should also be familiar with basic data handling in both base R and the tidyverse. If $, [], [[]], and %>% are all familiar to you, then you are probably good to go.
Those needing a more basic treatment are encouraged to take R for Stata Users, R for SAS Users, R for SPSS Users, or Introduction to R for Data Analysis, any of which would be good preparation for this course.
Seminar outline
Finding files in R
-
- Absolute and relative file paths
- RStudio projects
- The here package
Basic principles for using LLMs in R coding
Making tables
-
- Standard data and results tables
- Making custom tables
- LLMs in the table-making workflow
Programming
-
- Control structures
- Writing functions and functional coding
- Coding more complex functions
- LLM supports for programming and function-writing
Debugging code
-
- Two simple debugging strategies
- Debugging code in closed environments (e.g., functions)
- Using LLMs to debug code
Improving efficiency
-
- Testing code efficiency (speed, memory use)
- Parallel computing
Data generation
-
- Generating univariate and multivariate data
- Generating data to produce specific regression coefficients
- Using LLMs in data generation
Simulation
-
- Basic strategy
- Example 1 (core ideas): simulating sampling distributions
- Example 2 (adding complexity): power analysis
- Using LLMs to improve simulations
Bootstrapping
-
- Bootstrap inference for common regression models
- Flexible bootstrapping
Finding files in R
-
- Absolute and relative file paths
- RStudio projects
- The here package
Basic principles for using LLMs in R coding
Making tables
-
- Standard data and results tables
- Making custom tables
- LLMs in the table-making workflow
Programming
-
- Control structures
- Writing functions and functional coding
- Coding more complex functions
- LLM supports for programming and function-writing
Debugging code
-
- Two simple debugging strategies
- Debugging code in closed environments (e.g., functions)
- Using LLMs to debug code
Improving efficiency
-
- Testing code efficiency (speed, memory use)
- Parallel computing
Data generation
-
- Generating univariate and multivariate data
- Generating data to produce specific regression coefficients
- Using LLMs in data generation
Simulation
-
- Basic strategy
- Example 1 (core ideas): simulating sampling distributions
- Example 2 (adding complexity): power analysis
- Using LLMs to improve simulations
Bootstrapping
-
- Bootstrap inference for common regression models
- Flexible bootstrapping
Payment information
The fee of $995 USD includes all course materials.
PayPal and all major credit cards are accepted.
Our Tax ID number is 26-4576270.
The fee of $995 USD includes all course materials.
PayPal and all major credit cards are accepted.
Our Tax ID number is 26-4576270.