Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data (1404.2124v1)

Published 8 Apr 2014 in stat.ML

Abstract: Predicting an individual's risk of experiencing a future clinical outcome is a statistical task with important consequences for both practicing clinicians and public health experts. Modern observational databases such as electronic health records (EHRs) provide an alternative to the longitudinal cohort studies traditionally used to construct risk models, bringing with them both opportunities and challenges. Large sample sizes and detailed covariate histories enable the use of sophisticated machine learning techniques to uncover complex associations and interactions, but observational databases are often ``messy,'' with high levels of missing data and incomplete patient follow-up. In this paper, we propose an adaptation of the well-known Naive Bayes (NB) machine learning approach for classification to time-to-event outcomes subject to censoring. We compare the predictive performance of our method to the Cox proportional hazards model which is commonly used for risk prediction in healthcare populations, and illustrate its application to prediction of cardiovascular risk using an EHR dataset from a large Midwest integrated healthcare system.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Julian Wolfson (14 papers)
  2. Sunayan Bandyopadhyay (2 papers)
  3. Mohamed Elidrisi (4 papers)
  4. Gabriela Vazquez-Benitez (2 papers)
  5. Donald Musgrove (1 paper)
  6. Gediminas Adomavicius (9 papers)
  7. Paul Johnson (13 papers)
  8. Patrick O'Connor (1 paper)
Citations (46)