Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ZaliQL: A SQL-Based Framework for Drawing Causal Inference from Big Data (1609.03540v2)

Published 12 Sep 2016 in cs.DB, cs.AI, cs.LG, and cs.PF

Abstract: Causal inference from observational data is a subject of active research and development in statistics and computer science. Many toolkits have been developed for this purpose that depends on statistical software. However, these toolkits do not scale to large datasets. In this paper we describe a suite of techniques for expressing causal inference tasks from observational data in SQL. This suite supports the state-of-the-art methods for causal inference and run at scale within a database engine. In addition, we introduce several optimization techniques that significantly speedup causal inference, both in the online and offline setting. We evaluate the quality and performance of our techniques by experiments of real datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Babak Salimi (35 papers)
  2. Dan Suciu (83 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.