Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Policy Learning for Optimal Dynamic Treatment Regimes with Observational Data (2404.00221v5)

Published 30 Mar 2024 in stat.ME, econ.EM, math.ST, stat.ML, and stat.TH

Abstract: Public policies and medical interventions often involve dynamics in their treatment assignments, where individuals receive a series of interventions over multiple stages. We study the statistical learning of optimal dynamic treatment regimes (DTRs) that guide the optimal treatment assignment for each individual at each stage based on the individual's evolving history. We propose a doubly robust, classification-based approach to learning the optimal DTR using observational data under the assumption of sequential ignorability. This approach learns the optimal DTR through backward induction. At each step, it constructs an augmented inverse probability weighting (AIPW) estimator of the policy value function and maximizes it to learn the optimal policy for the corresponding stage. We show that the resulting DTR can achieve an optimal convergence rate of $n{-1/2}$ for welfare regret under mild convergence conditions on estimators of the nuisance components.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)
  1. Shosei Sakaguchi (8 papers)

Summary

We haven't generated a summary for this paper yet.