Abstract
To provide a parsimonious generative representation of the sequential activity of a number of individuals within a population there is a necessary tradeoff between the definition of individual specific and global representations. A linear-time algorithm is proposed that defines a distributed predictive model for finite state symbolic sequences which represent the traces of the activity of a number of individuals within a group. The algorithm is based on a straightforward generalization of latent Dirichlet allocation to time-invariant Markov chains of arbitrary order. The modelling assumption made is that the possibly heterogeneous behavior of individuals may be represented by a relatively small number of simple and common behavioral traits which may interleave randomly according to an individual-specific distribution. The results of an empirical study on three different application domains indicate that this modelling approach provides an efficient low-complexity and intuitively interpretable representation scheme which is reflected by improved prediction performance over comparable models.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Anderson, C., Domingos, P., and Weld, D. 2001. Adaptive web navigation for wireless devices. In Proceedings of the 17th International Joint Conference on Artificial Intelligence, pp. 879–884.
Attias, H. 2001. Learning in high dimension: modular mixture models. In Proc. AI and Statistics.
Blei, D.M., Ng, A.Y., and Jordan, M.I. 2003. Latent dirchlet allocation. Journal of Machine Learning Research, 3(5):993–1022.
Borges, J. and Levene, M. 1999. Data mining of user navigation patterns. WEBKDD, pp. 92–111.
Cadez, I., Heckerman, D., Meek, C., Smyth, P., and White, S. 2003. Model-based clustering and visualisation of navigation patterns on a web site. Journal of data Mining and Knowledge Discovery, 7(4):399–424.
Cover, T.M and Thomas, J.A. 1991. Elements of Information Theory. New York: Wiley.
Deshpande, M. and Karypis. to appear. Selective markov models for predicting web-page accesses. ACM Transactions on Internet Technology.
Hofmann, T. 2001. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42:177–196.
Hofmann, T. 2001. Learning What People (Don’t) Want. In European Conference on Machine Learning, pp. 214–225.
Frydman, H. 1984. Maximum likelihood estimation in the mover-stayer model. Journal of the American Statistical Society, 79:632–638.
Krogh, A. Hidden Markov Models in computational biology: Applications to protein modelling. Journal of Molecular Biology, 235:1501–1531.
Lee, D. and Sebastian Seung, H. 2001. Algorithms for Non-negative Matrix Factorization. In Advances in Neural Information Processing Systems 13, Todd K. Leen, Thomas G. Dietterich, and Volker. Tresp, (eds.) MIT Press, pp. 556–562.
Linton, F., Joy, D., Schaefer, H.-P. and Charron, A. 2000. OWL: A recommender system for organization-wide learning. Educational Technology & Society, 3.
Mannila, H., and Rusakov, D. 2001. Decomposing event sequences into independent components. In First SIAM Conference on Data Mining.
Manning, C.D. and Schütze, H. 1999. Foundations of Statistical Natural Language Processing. Cambridge, Massachusetts: The MIT Press.
Minka, T. and Lafferty, J. 2002. Expectation-propogation for the generative aspect model. In Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence.
Mitchell, T. 1996. Machine Learning. New York, US: McGraw-Hill.
Rabiner, L.R. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. In Proc. of the IEEE, 77(2):257–285.
Raftery, A. 1985. A model for higher-order Markov chains. Journal of the Royal Statistical Society B, 47(3):528–539.
Ronning, G. 1989. Maximum likelihood estimation of Dirichlet distributions. Journal of Statistical Computation and Simulation, 32(4):215–221.
Ross, D.A. and Zemel, R.S. 2003. Multiple-cause vector quantiztion. In Advances in Neural Information Processing Systems 15, S. Becker, S. Thrun and K. Obermayer (Eds.), MIT Press, pp. 1017–1024.
Sarukkai, R. 2000. Link predicition and path analysis using markov chains. Computer Networks, 33(1–6):377–386.
Saul, L. and Pereira, F. 1997. Aggregate and mixed-order markov models for statistical language processing. In Proceedings of 2nd International Conference on Empirical Methods in Natural Language Processing, pp. 81–89.
Saul, L.K. and Jordan, M.I. 1999. Mixed memory markov models: Decomposing complex stochastic processes as mixtures of simpler ones. Machine Learning, 37:75–87.
Tino, P. and Dorffner, G. 2001. Predicting the future of discrete sequences from fractal representations of the past. Machine Learning, 45(2):187–218.
Lappalainen, H. and Miskin, J.W. 2000. Ensemble learning. In Advances in Independent Component Analysis, M. Girolami (Ed.). Springer-Verlag, pp. 75–92.
Author information
Authors and Affiliations
Corresponding author
Additional information
Mark Girolami is a Reader in the Department of Computing Science at the University of Glasgow. In 2000 he was the TEKES visiting professor at the Laboratory of Computing and Information Science in Helsinki University of Technology. In 1998 and 1999 Dr. Girolami was a research fellow at the Laboratory for Advanced Brain Signal Processing in the Brain Science Institute, RIKEN, Wako-Shi, Japan. He has been a visiting researcher at the Computational Neurobiology Laboratory (CNL) of the Salk Institute. This year (2005) he will take up an MRC funded Discipline Hopping Award in the Department of Biochemistry. Mark Girolami holds a degree in Mechanical Engineering from the University of Glasgow (1985), and a Ph.D. in Computing Science from the University of Paisley (1998). Dr. Girolami was a development engineer with IBM from 1985 until 1995 when he left to pursue an academic career.
Ata Kabán received the B.Sc. degree with honours (1999) in computer science from the University “Babes-Bolyai” of Cluj-Napoca, Romania, and the Ph.D. degree in computer science (2001) from the University of Paisley, UK. She is a lecturer in the School of Computer Science of the University of Birmingham. Her current interests concern probabilistic modelling, machine learning and their applications to automated data analysis. She has been a visiting researcher at Helsinki University of Technology (June–December 2000 and in the summer of 2003). Prior to her career in Computer Science, she received the B.A. degree in musical composition (1994) and the M.A. (1995) and Ph.D. (1999) degrees in musicology from the Music Academy “Gh. Dima” of Cluj-Napoca, Romania.
Rights and permissions
About this article
Cite this article
Girolami, M., Kabán, A. Sequential Activity Profiling: Latent Dirichlet Allocation of Markov Chains. Data Min Knowl Disc 10, 175–196 (2005). https://doi.org/10.1007/s10618-005-0362-2
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s10618-005-0362-2