Multilevel and Mixed Models Using R - Online Course
A 3-Day Livestream Seminar Taught by
Stephen VaiseyWednesday, April 23 –
Friday, April 25, 2025
10:00am-12:30pm (convert to your local time)
1:30pm-3:30pm
Multilevel models are a class of regression models for data that have a hierarchical (or nested) structure. Common examples of such data structures are students nested within schools or classrooms, patients nested within hospitals, or survey respondents nested within countries.
Using regression techniques that ignore this hierarchical structure (such as ordinary least squares) can lead to incorrect results because such methods assume that all observations are independent. Perhaps more important, using inappropriate techniques (like pooling or aggregating) prevents researchers from asking substantively interesting questions about how processes work at different levels.
Starting April 23, we are offering this seminar as a 3-day synchronous*, livestream workshop held via the free video-conferencing software Zoom. Each day will consist of two lecture sessions which include hands-on exercises, separated by a 1-hour break. You are encouraged to join the lecture live, but will have the opportunity to view the recorded session later in the day if you are unable to attend at the scheduled time.
*We understand that finding time to participate in livestream courses can be difficult. If you prefer, you may take all or part of the course asynchronously. The video recordings will be made available within 24 hours of each session and will be accessible for four weeks after the seminar, meaning that you will get all of the class content and discussions even if you cannot participate synchronously.
Closed captioning is available for all live and recorded sessions. Captions can be translated to a variety of languages including Spanish, Korean, and Italian. For more information, click here.
More details about the course content
This seminar provides an intensive introduction to multilevel models. After a brief conceptual introduction (including a discussion of the difference between random and fixed effects), we will begin with simple variance components models that can tell us how much of the variation in a measure can be attributed to different levels of observation. We will then move on to mixed models (random effects models with fixed covariates) that allow us to ask how factors at different levels can affect an outcome.
Next, we will investigate how using random coefficients and cross-level interactions can help us discover hidden structure in our data and help us investigate how individual-level processes work differently in different contexts. We will also briefly consider how these techniques can be applied to cases where we have repeated observations of individuals or other entities over time.
Although the course will focus primarily on the continuous outcome case, we will also briefly cover how these models can easily be extended for use with categorical and limited dependent variables.
The seminar will focus on hands-on understanding and draw from examples across the social and behavioral sciences. At the conclusion of the course, you will:
-
- Know the technical and substantive difference between fixed and random effects.
- Understand random intercepts and random coefficients and when to use each one.
- Know how to combine the strengths of random-effects and fixed effects approaches into a single model.
- Know how to estimate these models and interpret the results.
Although these techniques apply to both nested and longitudinal data, in the interest of time we will focus exclusively on the nested data case. For a course focused on longitudinal data analysis, check out Longitudinal Data Analysis Using R.
This seminar provides an intensive introduction to multilevel models. After a brief conceptual introduction (including a discussion of the difference between random and fixed effects), we will begin with simple variance components models that can tell us how much of the variation in a measure can be attributed to different levels of observation. We will then move on to mixed models (random effects models with fixed covariates) that allow us to ask how factors at different levels can affect an outcome.
Next, we will investigate how using random coefficients and cross-level interactions can help us discover hidden structure in our data and help us investigate how individual-level processes work differently in different contexts. We will also briefly consider how these techniques can be applied to cases where we have repeated observations of individuals or other entities over time.
Although the course will focus primarily on the continuous outcome case, we will also briefly cover how these models can easily be extended for use with categorical and limited dependent variables.
The seminar will focus on hands-on understanding and draw from examples across the social and behavioral sciences. At the conclusion of the course, you will:
-
- Know the technical and substantive difference between fixed and random effects.
- Understand random intercepts and random coefficients and when to use each one.
- Know how to combine the strengths of random-effects and fixed effects approaches into a single model.
- Know how to estimate these models and interpret the results.
Although these techniques apply to both nested and longitudinal data, in the interest of time we will focus exclusively on the nested data case. For a course focused on longitudinal data analysis, check out Longitudinal Data Analysis Using R.
Computing
The vast majority of what you will learn in this course can be applied in any software package. However, this seminar will mostly use R for empirical examples and exercises. To replicate the instructor’s workflow in the course, you should have R and RStudio already installed on your computer when the course begins.
Basic familiarity with R is highly desirable. If you are new to R, check out Professor Vaisey’s one-hour Introduction to R video to get up to speed. But even novice R coders will be able to follow the lectures and do the exercises.
Stata notes and syntax are available upon request.
If you’d like to take this course but are concerned that you don’t know enough R, there are excellent on-line resources for learning the basics. Here are our recommendations.
The vast majority of what you will learn in this course can be applied in any software package. However, this seminar will mostly use R for empirical examples and exercises. To replicate the instructor’s workflow in the course, you should have R and RStudio already installed on your computer when the course begins.
Basic familiarity with R is highly desirable. If you are new to R, check out Professor Vaisey’s one-hour Introduction to R video to get up to speed. But even novice R coders will be able to follow the lectures and do the exercises.
Stata notes and syntax are available upon request.
If you’d like to take this course but are concerned that you don’t know enough R, there are excellent on-line resources for learning the basics. Here are our recommendations.
Who should register?
This course is for anyone who wants to learn to apply multilevel models to observational data. You should have a basic foundation in linear regression.
This course is for anyone who wants to learn to apply multilevel models to observational data. You should have a basic foundation in linear regression.
Seminar outline
- What are multilevel models?
- Hierarchical linear model motivation and notation
- Within and between variance
- Variance components models and plots
- The first distinction between “random” and “fixed” effects
- “Shrinkage” and empirical Bayes estimates of intercepts
- What are mixed models a mix of?
- Three types of R-squared
- Random coefficients/slopes
- Model selection with BIC and AIC
- Basic diagnostics
- The second distinction between “random” and “fixed” effects
- When and why (not) to use random effects models
- Testing the random effects assumption
- The “between-within” method to combine the best of RE/FE
- Centering and cross-level interactions
- A brief comparison of clustered and panel data
- Multilevel logistic regression and other limited dependent variables
- Presenting results
- What are multilevel models?
- Hierarchical linear model motivation and notation
- Within and between variance
- Variance components models and plots
- The first distinction between “random” and “fixed” effects
- “Shrinkage” and empirical Bayes estimates of intercepts
- What are mixed models a mix of?
- Three types of R-squared
- Random coefficients/slopes
- Model selection with BIC and AIC
- Basic diagnostics
- The second distinction between “random” and “fixed” effects
- When and why (not) to use random effects models
- Testing the random effects assumption
- The “between-within” method to combine the best of RE/FE
- Centering and cross-level interactions
- A brief comparison of clustered and panel data
- Multilevel logistic regression and other limited dependent variables
- Presenting results
Payment information
The fee of $995 includes all course materials.
PayPal and all major credit cards are accepted.
Our Tax ID number is 26-4576270.
The fee of $995 includes all course materials.
PayPal and all major credit cards are accepted.
Our Tax ID number is 26-4576270.