Missing Data - Online Course
A 3-Day Livestream Seminar Taught by
Paul Allison10:00am-2:00pm ET (New York time): Live lecture via Zoom
4:00pm-5:00pm ET: Live lab session via Zoom (Thursday and Friday only)
If you’re using conventional methods for handling missing data, you may be missing out. Conventional methods for missing data, like listwise deletion or regression imputation, are prone to three serious problems:
- Inefficient use of the available information, leading to low power and Type II errors.
- Biased estimates of standard errors, leading to incorrect p-values.
- Biased parameter estimates, due to failure to adjust for selectivity in missing data.
More accurate and reliable results can be obtained with maximum likelihood or multiple imputation.
Although these newer methods for handling missing data have been around for more than two decades, they have only become practical with the introduction of widely available and user friendly software.
Starting September 9, we are offering this seminar as a 3-day synchronous*, remote workshop for the first time. Each day will consist of a 4-hour live lecture held via the free video-conferencing software Zoom. You are encouraged to join the lecture live, but will have the opportunity to view the recorded session later in the day if you are unable to attend at the scheduled time.
Each lecture session will conclude with a hands-on exercise reviewing the content covered, to be completed on your own. An additional lab session will be held Thursday and Friday afternoons, where you can review the exercise results with the instructor and ask any questions.
*We understand that scheduling is difficult during this unpredictable time. If you prefer, you may take all or part of the course asynchronously. The video recordings will be made available within 24 hours of each session and will be accessible for four weeks after the seminar, meaning that you will get all of the class content and discussions even if you cannot participate synchronously.
Closed captioning is available for all live and recorded sessions.
More details about the course content
Maximum likelihood and multiple imputation have very similar statistical properties. If the assumptions are met, they are approximately unbiased and efficient. What’s remarkable is that these newer methods depend on less demanding assumptions than those required for older methods for handling missing data.
Maximum likelihood is available for logistic regression, Cox regression, and regression for count data. Multiple imputation can be used for virtually any statistical problem.
This course will cover the theory and practice of both maximum likelihood and multiple imputation. Maximum likelihood for linear models will be demonstrated with SAS, Stata, and Mplus. Multiple imputation will be demonstrated with both SAS and Stata. Slides and exercises using R are also available to participants on request.
Maximum likelihood and multiple imputation have very similar statistical properties. If the assumptions are met, they are approximately unbiased and efficient. What’s remarkable is that these newer methods depend on less demanding assumptions than those required for older methods for handling missing data.
Maximum likelihood is available for logistic regression, Cox regression, and regression for count data. Multiple imputation can be used for virtually any statistical problem.
This course will cover the theory and practice of both maximum likelihood and multiple imputation. Maximum likelihood for linear models will be demonstrated with SAS, Stata, and Mplus. Multiple imputation will be demonstrated with both SAS and Stata. Slides and exercises using R are also available to participants on request.
Computing
This is a hands-on course. To optimally benefit, you are strongly encouraged to use a computer with a recent version of SAS or Stata installed. Mplus and LEM will also be used in the section on maximum likelihood estimation, but will not be used for exercises. For those who prefer R, R slides and exercises using these packages can be downloaded from the course site.
There is now a free version of SAS, called the SAS OnDemand for Academics, that works in your web browser.
Seminar participants who are not yet ready to purchase Stata could take advantage of StataCorp’s free 30-day evaluation offer or their 30-day software return policy.
This is a hands-on course. To optimally benefit, you are strongly encouraged to use a computer with a recent version of SAS or Stata installed. Mplus and LEM will also be used in the section on maximum likelihood estimation, but will not be used for exercises. For those who prefer R, R slides and exercises using these packages can be downloaded from the course site.
There is now a free version of SAS, called the SAS OnDemand for Academics, that works in your web browser.
Seminar participants who are not yet ready to purchase Stata could take advantage of StataCorp’s free 30-day evaluation offer or their 30-day software return policy.
Who should register?
Virtually anyone who does statistical analysis can benefit from new methods for handling missing data. To optimally benefit from this course, you should have a good working knowledge of the principles and practice of linear regression, as well as elementary statistical inference. But you do not need to know matrix algebra, calculus, or likelihood theory. You should be at least moderately proficient at using one of these packages: SAS, Stata or R.
Virtually anyone who does statistical analysis can benefit from new methods for handling missing data. To optimally benefit from this course, you should have a good working knowledge of the principles and practice of linear regression, as well as elementary statistical inference. But you do not need to know matrix algebra, calculus, or likelihood theory. You should be at least moderately proficient at using one of these packages: SAS, Stata or R.
Seminar outline
- Assumptions for missing data methods
- Problems with conventional methods
- Maximum likelihood (ML)
- ML with EM algorithm
- Direct ML with Mplus, Stata and SAS
- ML for contingency tables
- Multiple Imputation (MI)
- MI under multivariate normal model
- MI with SAS and Stata
- MI with categorical and nonnormal data
- Interactions and nonlinearities
- Using auxiliary variables
- Other parametric approaches to MI
- Linear hypotheses and likelihood ratio tests
- Nonparametric and partially parametric methods
- Fully conditional models
- MI and ML for nonignorable missing data
- Assumptions for missing data methods
- Problems with conventional methods
- Maximum likelihood (ML)
- ML with EM algorithm
- Direct ML with Mplus, Stata and SAS
- ML for contingency tables
- Multiple Imputation (MI)
- MI under multivariate normal model
- MI with SAS and Stata
- MI with categorical and nonnormal data
- Interactions and nonlinearities
- Using auxiliary variables
- Other parametric approaches to MI
- Linear hypotheses and likelihood ratio tests
- Nonparametric and partially parametric methods
- Fully conditional models
- MI and ML for nonignorable missing data
Payment information
The fee of $895 includes all course materials.
PayPal and all major credit cards are accepted.
Our Tax ID number is 26-4576270.
The fee of $895 includes all course materials.
PayPal and all major credit cards are accepted.
Our Tax ID number is 26-4576270.