Patient electronic health records, viewed as continuous-time right-censored survival data, can be used to estimate adverse drug reaction risk. Temporal outcome misclassification may occur as a result of errors in follow-up. These errors can be due to a failure to observe the incidence time of the adverse event of interest (due to misdiagnosis or nonreporting, etc) or an actual misdiagnosis of a competing adverse event. As the misclassifying event is often unobservable in the original data, we apply an internal validation sampling approach to produce consistent estimation in the presence of such errors. We introduce a univariate survival model and a cause-specific hazards model in which misclassification may also manifest as a diagnosis of an alternate adverse health outcome other than that of interest. We develop a method of maximum likelihood estimation of the model parameters and establish consistency and asymptotic normality of the estimators using standard results. We also conduct simulation studies to numerically investigate the finite sample properties of these estimators and the impact of ignoring the misclassification error.

cause-specific hazards, competing risks, electronic health records, internal validation sampling, misclassified survival outcomes
dx.doi.org/10.1002/sim.7854
Statistics in Medicine
School of Mathematics and Statistics

Gravel, C.A. (Christopher A.), Dewanji, A. (Anup), Farrell, P, & Krewski, D. (Daniel). (2018). A validation sampling approach for consistent estimation of adverse drug reaction risk with misclassified right-censored survival data. Statistics in Medicine. doi:10.1002/sim.7854