Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens (2107.07875v3)

Published 13 Jul 2021 in stat.ML and cs.LG

Abstract: A dynamic treatment regimen (DTR) is a set of decision rules to personalize treatments for an individual using their medical history. The Q-learning-based Q-shared algorithm has been used to develop DTRs that involve decision rules shared across multiple stages of intervention. We show that the existing Q-shared algorithm can suffer from non-convergence due to the use of linear models in the Q-learning setup, and identify the condition under which Q-shared fails. We develop a penalized Q-shared algorithm that not only converges in settings that violate the condition, but can outperform the original Q-shared algorithm even when the condition is satisfied. We give evidence for the proposed method in a real-world application and several synthetic simulations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Palash Ghosh (6 papers)
  2. Trikay Nalamada (2 papers)
  3. Shruti Agarwal (13 papers)
  4. Maria Jahja (5 papers)
  5. Bibhas Chakraborty (30 papers)
  6. Xinru Wang (18 papers)

Summary

We haven't generated a summary for this paper yet.