Dealing with misclassification and missing data when estimating prevalence and incidence of caries experience

T Mutsvari, MJ Garcia-Zattera, D Declerck, Emmanuel Lesaffre

Research output: Contribution to journalArticleAcademicpeer-review

2 Citations (Scopus)


Objectives: The aim of this research was to estimate the prevalence and incidence of caries experience (CE) in first permanent molars while dealing with misclassification and missing of data. Methods: CE was modeled as a Hidden Markov Model in which the response variable is subject to misclassification and missingness. The proposed analysis extends that of Garcia-Zattera et al. (Stat Med 2010; 29: 3103) by allowing for various patterns of missing data. Findings were illustrated using data from the Signal Tandmobiel (R) study that is a longitudinal oral health intervention study. Results: Differences in the parameter estimates were noted between models that take into account misclassification and missing data and those that do not. Unbiased parameter estimates of prevalence and incidence were obtained without the use of validation data. Models that include subjects with missing data have smaller standard deviations than models that do not. Conclusions: It is important to account for misclassification to obtain less biased estimates of prevalence and incidence. For a proper estimation of prevalence and incidence in a longitudinal study subject to misclassification, validation data are not needed but when internal they can increase the efficiency in estimating the model. Also, including subjects with missing data increases the efficiency of estimating the parameters.
Original languageUndefined/Unknown
Pages (from-to)28-35
Number of pages8
JournalCommunity Dentistry and Oral Epidemiology
Publication statusPublished - 2012

Cite this