DAGs with No Fears: A Closer Look at Continuous Optimization for Learning Bayesian Networks (2010.09133v1)

Published 18 Oct 2020 in cs.LG and stat.ML

Abstract: This paper re-examines a continuous optimization framework dubbed NOTEARS for learning Bayesian networks. We first generalize existing algebraic characterizations of acyclicity to a class of matrix polynomials. Next, focusing on a one-parameter-per-edge setting, it is shown that the Karush-Kuhn-Tucker (KKT) optimality conditions for the NOTEARS formulation cannot be satisfied except in a trivial case, which explains a behavior of the associated algorithm. We then derive the KKT conditions for an equivalent reformulation, show that they are indeed necessary, and relate them to explicit constraints that certain edges be absent from the graph. If the score function is convex, these KKT conditions are also sufficient for local minimality despite the non-convexity of the constraint. Informed by the KKT conditions, a local search post-processing algorithm is proposed and shown to substantially and universally improve the structural Hamming distance of all tested algorithms, typically by a factor of 2 or more. Some combinations with local search are both more accurate and more efficient than the original NOTEARS.

Citations (66)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

DAGs with No Fears: A Closer Look at Continuous Optimization for Learning Bayesian Networks (2010.09133v1)

Summary

Related Papers