A Bayesian approach for unadjudicated events in cardiovascular disease cohort studies (2506.19066v1)
Abstract: An important issue in joint modelling for outcomes and longitudinal risk factors in cohort studies is to have an accurate assessment of events. Events determined based on ICD-9 codes can be very inaccurate, in particular for cardiovascular disease (CVD) where ICD-9 codes may overestimate the frequency of CVD. Motivated by the lack of adjudicated events in the Established Populations for Epidemiologic Studies of the Elderly (EPESE) cohort, we develop methods that use a related cohort Atherosclerosis Risk in Communities (ARIC), with both ICD-9 code events and adjudicated events, to create a posterior predictive distribution of adjudicated events. The methods are based on the construction of flexible Bayesian joint models combined with a Bayesian additive regression trees to directly address the ICD-9 misclassification. We assessed the performance of our approach by simulation study and applied to ARIC data.