Abstract
This paper focuses on the integration of acoustic and visual information for people tracking. The system presented relies on a probabilistic framework within which information from multiple sources is integrated at an intermediate stage. An advantage of the method proposed is that of using a generative approach which supports easy and robust integration of multi source information by means of sampled projection instead of triangulation. The system described has been developed in the EU funded CHIL Project research activities. Experimental results from the CLEAR evaluation workshop are reported.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
CLEAR 2006 evaluation campaign. http://www.clear-evaluation.org/
Rich transcription 2005 spring meeting recognition evaluation. http://www.nist.gov/speech/tests/rt/rt2005/spring/
Brandstein, M., Ward, D.: Microphone Arrays. Springer, Heidelberg (2001)
Comaniciu, D., Ramesh, V., Meer, P.: Real-time tracking of non-rigid objects using mean-shift. In: Int. Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 142–149 (2000)
De Mori, R.: Spoken Dialogue with Computers. Academic Press, London (1998)
Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, Heidelberg (2001)
Isard, M., Blake, A.: Condensation – conditional density propagation for visual tracking. Int. Journal of Computer Vision 1(29), 5–28 (1998)
Isard, M., MacCormick, J.: BraMBLe: A Bayesian multiple-blob tracker. In: Int. Conf. of Computer Vision, vol. 2, pp. 34–41 (2003)
Lanz, O.: Approximate bayesian multibody tracking. IEEE Trans. Pattern Analysis and Machine Intelligence, to appear (2006)
Omologo, M., Svaizer, P.: Acoustic event localization using a crosspower-spectrum phase based technique. In: Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 2, pp. 273–276 (1994)
Sullivan, J., Rittscher, J.: Guiding random particles by deterministic search. In: Int. Conf. of Computer Vision, vol. 1, pp. 323–330 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Brunelli, R. et al. (2007). A Generative Approach to Audio-Visual Person Tracking. In: Stiefelhagen, R., Garofolo, J. (eds) Multimodal Technologies for Perception of Humans. CLEAR 2006. Lecture Notes in Computer Science, vol 4122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69568-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-69568-4_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69567-7
Online ISBN: 978-3-540-69568-4
eBook Packages: Computer ScienceComputer Science (R0)