A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens (2107.07875v3)

Published 13 Jul 2021 in stat.ML and cs.LG

Abstract: A dynamic treatment regimen (DTR) is a set of decision rules to personalize treatments for an individual using their medical history. The Q-learning-based Q-shared algorithm has been used to develop DTRs that involve decision rules shared across multiple stages of intervention. We show that the existing Q-shared algorithm can suffer from non-convergence due to the use of linear models in the Q-learning setup, and identify the condition under which Q-shared fails. We develop a penalized Q-shared algorithm that not only converges in settings that violate the condition, but can outperform the original Q-shared algorithm even when the condition is satisfied. We give evidence for the proposed method in a real-world application and several synthetic simulations.

Authors (6)

Palash Ghosh (6 papers)
Trikay Nalamada (2 papers)
Shruti Agarwal (13 papers)
Maria Jahja (5 papers)
Bibhas Chakraborty (30 papers)
Xinru Wang (18 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens (2107.07875v3)

Summary

Related Papers