Faculty of Medicine's internal website


Approaches to handling of missing data

Course leaders
Aleksandra Turkiewicz (PhD, statistician and epidemiologist at Clinical Epidemiology Unit, Orthpedics, IKVL),

Pär-Ola Bendahl (PhD, Associate Professor, statistician at the Department of Oncology and Pathology, IKVL),

Target group
The course is primarily intended for doctoral students at the Medical Faculty, Lund University, with research projects were missing data exists and needs to be handled. Also post-doctoral and senior researchers are welcomed to apply.

All participants must have a level of knowledge in applied statistics corresponding to the mandatory statistics courses (level I and II) at our faculty. Access to and substantial skills in one of the statistical software packages SPSS, STATA or R is also required. Additionally, participants must have good theoretical and practical knowledge of linear and logistic regression analysis. This means, that they need to be able to perform such analyses and report and interpret the results.
Before you can apply to this course you have to have taken Applied Statistics I and II OR our former course Statistical methods for Medical research.


Given autumn 2022
The course corresponds to one week full time studies.

The in-class (on Zoom) activities (mandatory) are scheduled on mornings 8 to 12 a.m.:

Wednesday, Sep 28th

Friday, Sep 30th

Wednesday, Oct 5th

Friday, Oct 7th

Wednesday, Oct 12th

Friday, Oct 14th

Course is given online, via Zoom.

Course content
The course is held in English and is based on the following three themes:

1. Introduction to missing data

  • To identify missing data
  • Potential consequences of missing data
  • Mechanisms leading to missing data
  • Overview of methods for handling of missing data

2. Multiple imputation (MI)

  • Overview of the theory behind MI
  • Method “chained equations”
  • How to build an imputation model
  • Analysis of multiply imputed data
  • Diagnostics of MI model (model validation)

3. Reporting results from analyses involving MI

  • Reporting guidelines
  • Limitations of MI

This level III course in applied statistics introduces the problem of missing data and its consequences and gives a short overview of methods for handling of missing data. Focus is on a method called multiple imputation (MI). Ideas and theory behind the method will be briefly covered - the emphasis will be on the practical applications. How to build an imputation model? How to summarize and interpret the results? What model diagnostics is needed? The course is wrapped up with training in reporting of the results after MI. Practical examples include mostly regression models for cohort studies and randomized controlled trials.

Course literature
Papers and lecture material will be made available on the course web site before the course starts. 

The course autumn 2022 is full.

For this course we have a collaboration with Gothenburg university

Site overview