A Probability for Classification Based on the Dirichlet Process Mixture Model | Journal of Classification Skip to main content
Log in

A Probability for Classification Based on the Dirichlet Process Mixture Model

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

In this paper we provide an explicit probability distribution for classification purposes when observations are viewed on the real line and classifications are to be based on numerical orderings. The classification model is derived from a Bayesian nonparametric mixture of Dirichlet process model; with some modifications. The resulting approach then more closely resembles a classical hierarchical grouping rule in that it depends on sums of squares of neighboring values. The proposed probability model for classification relies on a numerical procedure based on a reversible Markov chain Monte Carlo (MCMC) algorithm for determining the probabilities. Some numerical illustrations comparing with alternative ideas for classification are provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Similar content being viewed by others

References

  • CHARALAMBIDES, C.A. (2005), Combinatorial Methods in Discrete Distributions, Hoboken, NJ: Wiley.

    Book  MATH  Google Scholar 

  • ESCOBAR, M.D. (1994), “Estimating Normal Means with a Dirichlet Process Prior”, Journal of the American Statistical Association, 89, 268–277.

    Article  MATH  MathSciNet  Google Scholar 

  • ESCOBAR, M.D., and WEST, M. (1995), “Bayesian Density Estimation and Inference Using Mixtures”, Journal of the American Statistical Association, 90, 577–588.

    Article  MATH  MathSciNet  Google Scholar 

  • GODSILL, S.J. (2001), “On the Relationship Between Markov Chain Monte Carlo Methods for Model Uncertainty”, Journal of Computational and Graphical Statistics, 10, 230–248.

    Article  MathSciNet  Google Scholar 

  • GREEN, P.J. (1995), “Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination”, Biometrika, 82, 711–732.

    Article  MATH  MathSciNet  Google Scholar 

  • ISHWARAN, H., and JAMES, L.F. (2001), “Gibbs Sampling Methods for Stick-breaking Priors”, Journal of the American Statistical Association, 96, 161–173.

    Article  MATH  MathSciNet  Google Scholar 

  • LIJOI, A., MENA, R.H., and PRÜNSTER, I. (2005), “Hierarchical Mixture Modeling with Normalized Inverse Gaussian Priors”, Journal of the American Statistical Association, 100, 1278–1291.

    Article  MATH  MathSciNet  Google Scholar 

  • LIJOI, A., MENA, R.H., and PRÜNSTER, I. (2007), “Controlling the Reinforcement in Bayesian Non-parametric Mixture Models”, Journal of the Royal Statistical Society. Series B, 69, 715–740.

    Article  Google Scholar 

  • LIJOI, A., and PRÜNSTER, I. (2009), “Models Beyond the Dirichlet Process”, in Bayesian Nonparametrics, Eds. N.L. Hjort, C.C. Holmes, P. Müller, S.G. Walker, Cambridge University Press, in press.

  • LO, A. (1984), “On a Class of Bayesian Nonparametric Estimates I. Density Estimates”, Annals of Statistics, 12, 351–357.

    Article  MATH  MathSciNet  Google Scholar 

  • MCGRORY, C.A. and TITTERINGTON, D.M. (2007), “Variational Approximations in Bayesian Model Selection for Finite Mixture Distributions”, Computational Statistics and Data Analysis, 51, 5352–5367.

    Article  MATH  MathSciNet  Google Scholar 

  • QUINTANA, F.A. and IGLESIAS, P.L. (2003), “Bayesian Clustering and Product Partition Models”, Journal of the Royal Statistical Society, Series B, 65, 557–574.

    Article  MATH  MathSciNet  Google Scholar 

  • RICHARDSON, S. and GREEN, P.J. (1997), “On Bayesian Analysis of Mixtures With an Unknown Number of Components (With Discussion)”, Journal of the Royal Statistical Society, Series B, 59, 731–792.

    Article  MATH  MathSciNet  Google Scholar 

  • ROEDER, K. (1990), “Density Estimation with Confidence Sets Exemplified by Superclusters and Voids in the Galaxies”, Journal of the American Statistical Association, 85, 617–624.

    Article  MATH  Google Scholar 

  • SETHURAMAN, J. (1990), “A Constructive Definition of Dirichlet Priors”, Statistica Sinica, 4, 639–650.

    MathSciNet  Google Scholar 

  • WARD, J.H. (1990), “Hierarchical Grouping to Optimize an Objective Function”, Journal of the American Statistical Association, 58, 236–244.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stephen G. Walker.

Additional information

The first author gratefully acknowledges the Mexican Mathematical Society and the Sofia Kovalevskaia Fund, and the second author gratefully acknowledges CONACYT for Grant No. J50160-F, for allowing them to travel to the UK, where the work was completed during a visit to the University of Kent. The authors gratefully acknowledge the comments of three referees which have improved the paper.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fuentes–García, R., Mena, R.H. & Walker, S.G. A Probability for Classification Based on the Dirichlet Process Mixture Model. J Classif 27, 389–403 (2010). https://doi.org/10.1007/s00357-010-9061-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-010-9061-9

Keywords

Navigation