Learning Staged Trees from Incomplete Data (2405.18306v1)
Abstract: Staged trees are probabilistic graphical models capable of representing any class of non-symmetric independence via a coloring of its vertices. Several structural learning routines have been defined and implemented to learn staged trees from data, under the frequentist or Bayesian paradigm. They assume a data set has been observed fully and, in practice, observations with missing entries are either dropped or imputed before learning the model. Here, we introduce the first algorithms for staged trees that handle missingness within the learning of the model. To this end, we characterize the likelihood of staged tree models in the presence of missing data and discuss pseudo-likelihoods that approximate it. A structural expectation-maximization algorithm estimating the model directly from the full likelihood is also implemented and evaluated. A computational experiment showcases the performance of the novel learning algorithms, demonstrating that it is feasible to account for different missingness patterns when learning staged trees.
- N. Balov. Consistent model selection of discrete Bayesian networks from incomplete data. Electronic Journal of Statistics, 7:1047–1077, 2013.
- Refining a Bayesian network using a chain event graph. International Journal of Approximate Reasoning, 54(9):1300–1309, 2013.
- Chain event graphs for informed missingness. Bayesian Analysis, 9(1):53–76, 2014.
- The dynamic chain event graph. Electronic Journal of Statistics, 9:2130–2169, 2015.
- T. Bodewes and M. Scutari. Learning Bayesian networks from incomplete data with the node-average likelihood. International Journal of Approximate Reasoning, 138:145–160, 2021.
- The R package stagedtrees for structural learning of stratified staged trees. Journal of Statistical Software, 102:1–30, 2022.
- Staged trees for discrete longitudinal data. arXiv preprint arXiv:2401.04297, 2024.
- N. Cohen and Y. Berchenko. Normalized information criteria and model selection in the presence of missing data. Mathematics, 9(19):2474, 2021.
- Chain event graphs. CRC Press, 2018.
- Causal discovery through MAP selection of stratified chain event graphs. Electronic Journal of Statistics, 8:965–997, 2014.
- Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B, 39(1):1–22, 1977.
- Using staged tree models for health data: Investigating invasive fungal infections by aspergillus and other filamentous fungi. Computational and Structural Biotechnology Journal, 24:12–22, 2024.
- bnstruct: an R package for Bayesian network structure learning in the presence of missing data. Bioinformatics, 33(8):1250–1252, 2017.
- G. Freeman and J. Q. Smith. Bayesian MAP model selection of chain event graphs. Journal of Multivariate Analysis, 102(7):1152–1165, 2011.
- N. Friedman. Learning belief networks in the presence of missing values and hidden variables. In Proceedings of the 14th International Conference on Machine Learning, pages 125–133. Morgan Kaufmann, 1997.
- N. Friedman. The Bayesian structural EM algorithm. In Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence, pages 129–138, 1998.
- A differential approach for staged trees. In 13th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty, pages 346–355. Springer, 2015.
- Discovery of statistical equivalence classes using computer algebra. International Journal of Approximate Reasoning, 95:167–184, 2018.
- The curved exponential family of a staged tree. Electronic Journal of Statistics, 16(1):2607–2620, 2022.
- Score equivalence for staged trees. arXiv preprint arXiv:2206.15322, 2022.
- A. Klimova and T. Rudas. On the closure of relational models. Journal of Multivariate Analysis, 143:440–452, 2016.
- A. Klimova and T. Rudas. Hierarchical Aitchison–Silvey models for incomplete binary sample spaces. Journal of Multivariate Analysis, 187:104808, 2022.
- S. L. Lauritzen. The EM algorithm for graphical association models with missing data. Computational Statistics & Data Analysis, 19(2):191–201, 1995.
- M. Leonelli. Sensitivity analysis beyond linearity. International Journal of Approximate Reasoning, 113:106–118, 2019.
- M. Leonelli and G. Varando. Highly efficient structural learning of sparse staged trees. In International Conference on Probabilistic Graphical Models, pages 193–204. PMLR, 2022.
- M. Leonelli and G. Varando. Context-specific causal discovery for categorical data using staged trees. In International Conference on Artificial Intelligence and Statistics, pages 8871–8888. PMLR, 2023.
- M. Leonelli and G. Varando. Learning and interpreting asymmetry-labeled DAGs: A case study on COVID-19 fear. Applied Intelligence, 54(2):1734–1750, 2024a.
- M. Leonelli and G. Varando. Structural learning of simple staged trees. Data Mining and Knowledge Discovery, pages 1–25, 2024b.
- Statistical analysis with missing data. John Wiley & Sons, 2019.
- D. B. Rubin. Inference and missing data. Biometrika, 63(3):581–592, 1976.
- Hard and soft EM in Bayesian network learning from incomplete data. Algorithms, 13(12):329, 2020.
- S. J. Russell and P. Norvig. Artificial intelligence: A modern approach. Pearson, 2016.
- J. L. Schafer. Multiple imputation: A primer. Statistical Methods in Medical Research, 8(1):3–15, 1999.
- Generating missing values for simulation purposes: A multivariate amputation procedure. Journal of Statistical Computation and Simulation, 88(15):2909–2930, 2018.
- M. Scutari. Learning Bayesian networks with the bnlearn R package. Journal of Statistical Software, 35:1–22, 2010.
- M. Scutari. Bayesian network models for incomplete and dynamic data. Statistica Neerlandica, 74(3):397–419, 2020.
- G. Shafer. The art of causal conjecture. MIT press, 1996.
- T. Silander and T.-Y. Leong. A dynamic programming algorithm for learning chain event graphs. In 16th International Conference on Discovery Science, pages 201–216. Springer, 2013.
- Conditional independence and chain event graphs. Artificial Intelligence, 172(1):42–68, 2008.
- Propagation using chain event graphs. In Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence, pages 546–553, 2008.
- Staged trees and asymmetry-labeled DAGs. Metrika, pages 1–28, 2024.
- cegpy: Modelling with chain event graphs in Python. Knowledge-Based Systems, 274:110615, 2023.
- X. Yu and J. Q. Smith. Causal algebras on chain event graphs with informed missingness for system failure. Entropy, 23(10):1308, 2021.