Dealing with misclassification and missing data when estimating prevalence and incidence of caries experience

Authors


Emmanuel Lesaffre, L-Biostat, KU Leuven and Hasselt, BelgiumTel.: +32 16 336896
Fax: +32 16 337015
e-mail: emmanuel.lesaffre@med.kuleuven.be

Abstract

Mutsvari T, García-Zattera MJ, Declerck D, Lesaffre E. Dealing with misclassification and missing data when estimating prevalence and incidence of caries experience. Community Dent Oral Epidemiol 2012; 40 (Suppl. 1): 28–35. © 2012 John Wiley & Sons A/S

Abstract –  Objectives:  The aim of this research was to estimate the prevalence and incidence of caries experience (CE) in first permanent molars while dealing with misclassification and missing of data.

Methods:  CE was modeled as a Hidden Markov Model in which the response variable is subject to misclassification and missingness. The proposed analysis extends that of García-Zattera et al. (Stat Med 2010;29:3103) by allowing for various patterns of missing data. Findings were illustrated using data from the Signal Tandmobiel® study that is a longitudinal oral health intervention study.

Results:  Differences in the parameter estimates were noted between models that take into account misclassification and missing data and those that do not. Unbiased parameter estimates of prevalence and incidence were obtained without the use of validation data. Models that include subjects with missing data have smaller standard deviations than models that do not.

Conclusions:  It is important to account for misclassification to obtain less biased estimates of prevalence and incidence. For a proper estimation of prevalence and incidence in a longitudinal study subject to misclassification, validation data are not needed but when internal they can increase the efficiency in estimating the model. Also, including subjects with missing data increases the efficiency of estimating the parameters.

Ancillary