Two Optimal Strategies for Active Learning of Causal Models from Interventional Data (1205.4174v3)

Published 18 May 2012 in stat.ME and cs.DM

Abstract: From observational data alone, a causal DAG is only identifiable up to Markov equivalence. Interventional data generally improves identifiability; however, the gain of an intervention strongly depends on the intervention target, that is, the intervened variables. We present active learning (that is, optimal experimental design) strategies calculating optimal interventions for two different learning goals. The first one is a greedy approach using single-vertex interventions that maximizes the number of edges that can be oriented after each intervention. The second one yields in polynomial time a minimum set of targets of arbitrary size that guarantees full identifiability. This second approach proves a conjecture of Eberhardt (2008) indicating the number of unbounded intervention targets which is sufficient and in the worst case necessary for full identifiability. In a simulation study, we compare our two active learning approaches to random interventions and an existing approach, and analyze the influence of estimation errors on the overall performance of active learning.

Citations (118)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Two Optimal Strategies for Active Learning of Causal Models from Interventional Data (1205.4174v3)

Summary

Related Papers