Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models

Pensar, Johan; Nyman, Henrik; Koski, Timo; Corander, Jukka

doi:10.1007/s10618-014-0355-0

Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models

Published: 14 June 2014

Volume 29, pages 503–533, (2015)
Cite this article

Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Johan Pensar¹,
Henrik Nyman¹,
Timo Koski² &
…
Jukka Corander^1,3

806 Accesses
1 Altmetric
Explore all metrics

Abstract

We introduce a novel class of labeled directed acyclic graph (LDAG) models for finite sets of discrete variables. LDAGs generalize earlier proposals for allowing local structures in the conditional probability distribution of a node, such that unrestricted label sets determine which edges can be deleted from the underlying directed acyclic graph (DAG) for a given context. Several properties of these models are derived, including a generalization of the concept of Markov equivalence classes. Efficient Bayesian learning of LDAGs is enabled by introducing an LDAG-based factorization of the Dirichlet prior for the model parameters, such that the marginal likelihood can be calculated analytically. In addition, we develop a novel prior distribution for the model structures that can appropriately penalize a model for its labeling complexity. A non-reversible Markov chain Monte Carlo algorithm combined with a greedy hill climbing approach is used for illustrating the useful properties of LDAG models for both real and synthetic data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

Staged trees and asymmetry-labeled DAGs

Article Open access 07 March 2024

Probabilistic Inference Using Generators: The Statues Algorithm

Copula directed acyclic graphs

Article 12 August 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Andersson S, Madigan D, Perlman M (1997) A characterization of Markov equivalence classes for acyclic digraphs. Ann Stat 25:505–541
Article MATH MathSciNet Google Scholar
Boutilier C, Friedman N, Goldszmidt M, Koller D (1996) Context-specific independence in Bayesian networks. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 115–123
Buntine W (1991) Theory refinement on Bayesian networks. In: Proceedings of the seventh conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 52–60
Chickering DM, Heckerman D, Meek C (1997) A Bayesian approach to learning Bayesian networks with local structure. In: Proceedings of thirteenth conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco
Cooper G, Herskovitz E (1992) A Bayesian method for the induction of probabilistic networks from data. Mach Learn 9:309–347
MATH Google Scholar
Corander J (2003) Labelled graphical models. Scand J Stat 30:493–508
Article MATH MathSciNet Google Scholar
Corander J, Gyllenberg M, Koski T (2006) Bayesian model learning based on a parallel MCMC strategy. Stat Comput 16:355–362
Article MathSciNet Google Scholar
Corander J, Ekdahl M, Koski T (2008) Parallel interacting MCMC for learning of topologies of graphical models. Data Min Knowl Discov 17:431–456
Article MathSciNet Google Scholar
Eriksen PS (1999) Context specific interaction models. AAlborg University, Tech. rep.
Friedman N, Goldszmidt M (1996) Learning Bayesian networks with local structure. In: Proceedings of the twelfth annual conference on uncertainty in artificial intelligence, Morgan Kaufmann, San Francisco, pp 252–262
Geiger D, Heckerman D (1996) Knowledge representation and inference in similarity networks and Bayesian multinets. Artif Intell 82:45–74
Article MathSciNet Google Scholar
Heckerman D (1991) Probabilistic similarity networks. MIT Press, Cambridge
Google Scholar
Heckerman D, Geiger D, Chickering D (1995) Learning Bayesian networks: the combination of knowledge and statistical data. Mach Learn 20:197–243
MATH Google Scholar
Højsgaard S (2003) Split models for contingency tables. Comput Stat Data Anal 42:621–645
Article Google Scholar
Højsgaard S (2004) Statistical inference in context specific interaction models for contingency tables. Scand J Stat 31:143–158
Article Google Scholar
Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. MIT Press, Cambridge
Google Scholar
Koski T, Noble J (2009) Bayesian networks. An introduction. Wiley, Chippenham, Wiltshire
Book MATH Google Scholar
Marttinen P, Corander J (2009) Bayesian learning of graphical vector autoregressions with unequal lag-lengths. Mach Learn 75:217–243
Article Google Scholar
Pagallo G, Haussler D (1990) Boolean feature discovery in empirical learning. Mach Learn 5:71–99
Article Google Scholar
Pearl J (1988) Probabilistic reasoning in intelligent systems. Morgan Kaufmann, San Francisco
Google Scholar
Poole D (1997) Probabilistic partial evaluation: Exploiting rule structure in probabilistic inference. In: Proceedings of the 15th international joint conference on artificial intelligence, pp 1284–1291
Poole D, Zhang N (2003) Exploiting contextual independence in probabilistic inference. J Artif Intell Res 18:263–313
MATH MathSciNet Google Scholar
Robinson R (1977) Counting unlabelled acyclic digraphs. Springer Lect Notes Math 622:28–43
Article Google Scholar
Silander T, Kontkanen P, Myllymäki P (2007) On sensitivity of the MAP Bayesian network structure to the equivalent sample size parameter. In: Proceedings of the 23rd conference on uncertainty in artificial intelligence, AUAI Press, Corvallis, pp 360–367
Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, New York
MATH Google Scholar
Zhang N, Poole D (1999) On the role of context-specific independence in probabilistic inference. In: Proceedings of the 16th international joint conference on artificial intelligence, pp 1288–1293

Download references

Acknowledgments

JP and HN were supported by the Foundation of Åbo Akademi University, as part of the grant for the Center of Excellence in Optimization and Systems Engineering. JC was supported by ERC grant 239784 and AoF grant 251170. The authors would like thank the AE and the three anonymous reviewers for their comments and suggestions that led to a significant improvement of the original manuscript. We would also like to thank Jarno Lintusaari for his comments during the revision phase.

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Åbo Akademi University, 20500 , Turku, Finland
Johan Pensar, Henrik Nyman & Jukka Corander
Department of Mathematics, KTH Royal Institute of Technology, 10044 , Stockholm, Sweden
Timo Koski
Department of Mathematics and Statistics, University of Helsinki, 00014 , Helsinki, Finland
Jukka Corander

Authors

Johan Pensar
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Nyman
View author publications
You can also search for this author in PubMed Google Scholar
Timo Koski
View author publications
You can also search for this author in PubMed Google Scholar
Jukka Corander
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Johan Pensar.

Additional information

Responsible editor: Kristian Kersting.

Appendix

See Tables 2, 3, 4 and Fig. 14.

Table 2 Description of the variables in coronary heart disease data

Full size table

Table 3 Properties of identified LDAGs for DAG data

Full size table

Table 4 Properties of identified LDAGs for LDAG data

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pensar, J., Nyman, H., Koski, T. et al. Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models. Data Min Knowl Disc 29, 503–533 (2015). https://doi.org/10.1007/s10618-014-0355-0

Download citation

Received: 15 September 2012
Accepted: 21 May 2014
Published: 14 June 2014
Issue Date: March 2015
DOI: https://doi.org/10.1007/s10618-014-0355-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

Labeled directed acyclic graphs: a generalization of context-specific independence in directed graphical models

Abstract

Access this article

Subscribe and save

Buy Now