Skip to main content

more options



Statistical Analysis with Missing Data

Missing data are very common in all types of data sets, but most statistical procedures assume that all data are observed. The result is that most standard statistical software packages automatically drop from the analysis all observations with any missing data. This approach can lead to very low sample sizes and biased results. This workshop will discuss the mechanisms that cause missing data, advantages and disadvantages of commonly-used approaches to dealing with missing data, some new and better approaches, and how to implement them with available statitstical software.

The workshop is intended for participants who have the equivalent of two semesters of statistics and some previous experience doing data analysis. It is appropriate for faculty, research staff, and graduate students.

Topics:

1. Missing data mechanisms and why they are important
2. An overview of imputation techniques
3. Multiple Imputation
4. Maximum Likelihood and the EM Algorithm
5. Software for implementing these solutions

Participants in the workshop will:

1. Understand the processes that lead to missing data
2. Learn approaches for handling missing data and their advantages and disadvantages
3. Be able to put into practice some of these approaches

Fee:

None to members of the Cornell community

Registration: Registration is required. Since space is limited, early registration is encouraged.
Instructor: