Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling (2111.10106v1)

Published 19 Nov 2021 in stat.ML, cs.AI, cs.LG, and stat.AP

Abstract: Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collection of 13.9 million samples collected from several randomized control trials, scaling up previously available datasets by a healthy 210x factor. We provide details on the data collection and perform sanity checks to validate the use of this data for causal inference tasks. First, we formalize the task of uplift modeling (UM) that can be performed with this data, along with the relevant evaluation metrics. Then, we propose synthetic response surfaces and heterogeneous treatment assignment providing a general set-up for ITE prediction. Finally, we report experiments to validate key characteristics of the dataset leveraging its size to evaluate and compare - with high statistical significance - a selection of baseline UM and ITE prediction methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Eustache Diemert (9 papers)
  2. Artem Betlei (3 papers)
  3. Christophe Renaudin (4 papers)
  4. Massih-Reza Amini (40 papers)
  5. Théophane Gregoir (2 papers)
  6. Thibaud Rahier (9 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.