Efficient estimation and model selection in large graphical models | Statistics and Computing Skip to main content
Log in

Efficient estimation and model selection in large graphical models

  • Papers
  • Published:
Statistics and Computing Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

We develop a computationally efficient method to determine the interaction structure in a multidimensional binary sample. We use an interaction model based on orthogonal functions, and give a result on independence properties in this model. Using this result we develop an efficient approximation algorithm for estimating the parameters in a given undirected model. To find the best model, we use a heuristic search algorithm in which the structure is determined incrementally. We also give an algorithm for reconstructing the causal directions, if such exist. We demonstrate that together these algorithms are capable of discovering almost all of the true structure for a problem with 121 variables, including many of the directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Badsberg, J. H. (1991) A Guide to CoCo. Technical Report 91-43, Institute for Electronic Systems, University of Aalborg.

  • Beinlich, I., Suermondt, H., Chavez, R., and Cooper, G. (1989) The ALARM monitoring system: A case study with two probabilistic inference techniques for belief networks. Technical Report KSL-88-84, Knowledge Systems Laboratory, Stanford University.

  • Cooper, G. and Herskovits, E. (1991) A Bayesian Method for the Induction of Probabilistic Networks from Data. Technical Report KSL-91-02, Knowledge Systems Laboratory, Stanford University.

  • Darroch, J. N., Lauritzen, S. L. and Speed, T. P. (1980) Markov fields and log-linear models for contingency tables. Annals of Statistics, 8, 522–39.

    Google Scholar 

  • Goodman, L. A. (1970) The multivariate analysis of qualitative data: Interaction among multiple classifications. Journal of the American Statistical Association, 65, 226–56.

    Google Scholar 

  • Haberman, S. J. (1974) The Analysis of Frequency Data. The University of Chicago Press.

  • Herskovits, E. and Cooper, G. (1990) Kutato: An Entropy-Driven System for Construction of Probabilistic Expert Systems from Databases. Technical Report KSL-90-22, Knowledge Systems Laboratory, Stanford University.

  • Lauritzen, S. L. (1982) Lectures on Contingency Tables. Aalborg University Press.

  • Lauritzen, S. L. and Spiegelhalter, D. S. (1988) Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society, Series B, 50.

  • Moussouris, J. (1974) Gibbs and Markov random systems with constraints. Journal of Statistical Physics, 10, 11–33.

    Google Scholar 

  • Ott, J. and Kronmal, R. A. (1976) Some classification procedures for multivariate binary data using orthogonal functions. Journal of the American Statistical Association, 71, 391–9.

    Google Scholar 

  • Pearl, J. (1988) Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann.

  • Plackett, R. L. (1974) The Analysis of Categorical Data. London: Griffin & Co.

    Google Scholar 

  • Rissanen, J. (1983) A universal prior for integers and estimation by minimum description length. Annals of Statistics, 11, No. 2, 416–31.

    Google Scholar 

  • Spirtes, P., Glymour, C. and Scheines, R. (1992) Causation, Prediction and Search. Book manuscript. Department of Philosophy, Carnegie Mellon University.

  • Wedelin, D. (1993) Efficient Algorithms for Probabilistic Inference, Combinatorial Optimization and the Discovery of Causal Structure from Data, Ph.D. Thesis, Department of Computer Science, Chalmers University of Technology.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wedelin, D. Efficient estimation and model selection in large graphical models. Stat Comput 6, 313–323 (1996). https://doi.org/10.1007/BF00143552

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00143552

Keywords

Navigation