Abstract
In this paper we provide an explicit probability distribution for classification purposes when observations are viewed on the real line and classifications are to be based on numerical orderings. The classification model is derived from a Bayesian nonparametric mixture of Dirichlet process model; with some modifications. The resulting approach then more closely resembles a classical hierarchical grouping rule in that it depends on sums of squares of neighboring values. The proposed probability model for classification relies on a numerical procedure based on a reversible Markov chain Monte Carlo (MCMC) algorithm for determining the probabilities. Some numerical illustrations comparing with alternative ideas for classification are provided.
Similar content being viewed by others
References
CHARALAMBIDES, C.A. (2005), Combinatorial Methods in Discrete Distributions, Hoboken, NJ: Wiley.
ESCOBAR, M.D. (1994), “Estimating Normal Means with a Dirichlet Process Prior”, Journal of the American Statistical Association, 89, 268–277.
ESCOBAR, M.D., and WEST, M. (1995), “Bayesian Density Estimation and Inference Using Mixtures”, Journal of the American Statistical Association, 90, 577–588.
GODSILL, S.J. (2001), “On the Relationship Between Markov Chain Monte Carlo Methods for Model Uncertainty”, Journal of Computational and Graphical Statistics, 10, 230–248.
GREEN, P.J. (1995), “Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination”, Biometrika, 82, 711–732.
ISHWARAN, H., and JAMES, L.F. (2001), “Gibbs Sampling Methods for Stick-breaking Priors”, Journal of the American Statistical Association, 96, 161–173.
LIJOI, A., MENA, R.H., and PRÜNSTER, I. (2005), “Hierarchical Mixture Modeling with Normalized Inverse Gaussian Priors”, Journal of the American Statistical Association, 100, 1278–1291.
LIJOI, A., MENA, R.H., and PRÜNSTER, I. (2007), “Controlling the Reinforcement in Bayesian Non-parametric Mixture Models”, Journal of the Royal Statistical Society. Series B, 69, 715–740.
LIJOI, A., and PRÜNSTER, I. (2009), “Models Beyond the Dirichlet Process”, in Bayesian Nonparametrics, Eds. N.L. Hjort, C.C. Holmes, P. Müller, S.G. Walker, Cambridge University Press, in press.
LO, A. (1984), “On a Class of Bayesian Nonparametric Estimates I. Density Estimates”, Annals of Statistics, 12, 351–357.
MCGRORY, C.A. and TITTERINGTON, D.M. (2007), “Variational Approximations in Bayesian Model Selection for Finite Mixture Distributions”, Computational Statistics and Data Analysis, 51, 5352–5367.
QUINTANA, F.A. and IGLESIAS, P.L. (2003), “Bayesian Clustering and Product Partition Models”, Journal of the Royal Statistical Society, Series B, 65, 557–574.
RICHARDSON, S. and GREEN, P.J. (1997), “On Bayesian Analysis of Mixtures With an Unknown Number of Components (With Discussion)”, Journal of the Royal Statistical Society, Series B, 59, 731–792.
ROEDER, K. (1990), “Density Estimation with Confidence Sets Exemplified by Superclusters and Voids in the Galaxies”, Journal of the American Statistical Association, 85, 617–624.
SETHURAMAN, J. (1990), “A Constructive Definition of Dirichlet Priors”, Statistica Sinica, 4, 639–650.
WARD, J.H. (1990), “Hierarchical Grouping to Optimize an Objective Function”, Journal of the American Statistical Association, 58, 236–244.
Author information
Authors and Affiliations
Corresponding author
Additional information
The first author gratefully acknowledges the Mexican Mathematical Society and the Sofia Kovalevskaia Fund, and the second author gratefully acknowledges CONACYT for Grant No. J50160-F, for allowing them to travel to the UK, where the work was completed during a visit to the University of Kent. The authors gratefully acknowledge the comments of three referees which have improved the paper.
Rights and permissions
About this article
Cite this article
Fuentes–García, R., Mena, R.H. & Walker, S.G. A Probability for Classification Based on the Dirichlet Process Mixture Model. J Classif 27, 389–403 (2010). https://doi.org/10.1007/s00357-010-9061-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-010-9061-9