Abstract
This paper documents the acoustic source tracking system developed by TUT for the 2006 CLEAR evaluation campaign. The described system performs 3-D single person tracking based on audio data received from multiple spatially separated microphone arrays. The evaluation focuses on meeting room domain.
The system consists of four distinct stages. First stage is time delay estimation (TDE) between microphone pairs inside each array. Based on the TDE, direction of arrival (DOA) vectors are calculated for each array using a confidence metric. Source localization is done by using a selected combination of DOA estimates. The location estimate is tracked using a particle filter to reduce noise. The system is capable of locating a speaker 72 % of the time with an average accuracy of 25 cm.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Stiefelhagen, R., Garofolo, J.: CLEAR Evaluation Campaign and Workshop (2006), http://www.clear-evaluation.org/
Mostefa, D., et al.: Clear evaluation plan v.1.1 (2006), http://www.clear-evaluation.org/downloads/chil-clear-v1.1-2006-02-21.pdf
Pirinen, T.W., Pertila, P., Parviainen, M.: The TUT 2005 Source Localization System. In: Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation, Royal College of Physicians, Edinburgh, UK, pp. 93–99 (2005)
Parviainen, M., Pirinen, T.W.: A Speaker Localization System for Lecture Room Environment. In: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (accepted for publication) (2006)
Huang, Y., Benesty, J., Elko, G.W.: Passive acoustic source localization for video camera steering. In: Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’00), vol. 2, pp. 909–912. IEEE, Los Alamitos (2000)
Roman, N., Wang, D.L., Brown, G.J.: Location-based sound segregation. In: Proceedings of the International Conference on Acoustics Speech and Signal Processing (ICASSP’02), pp. 1013–1016 (2002)
Blumrich, R., Altmann, J.: Medium-range localisation of aircraft via triangulation. Applied Acoustics 61(1), 65–82 (2000)
Bass, H.E., et al.: Infrasound. Acoustics Today 2(1), 9–19 (2006)
Pertilä, P., Parviainen, M., Korhonen, T., Visa, A.: Moving sound source localization in large areas. In: 2005 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS 2005), pp. 745–748 (2005)
Omologo, M., Brutti, A., Svaizer, P.: Speaker Localization and Tracking - Evaluation Criteria. CHIL, v. 5.0 (2005)
Knapp, C., Carter, G.C.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech, and Signal Processing 24(4), 320–327 (1976)
Champagne, B., Bédard, S., Stéphenne, A.: Performance of time-delay estimation in the presence of room reverberation. IEEE Transactions on Speech and Audio Processing 4(2), 148–152 (1996)
Omologo, M., Svaizer, P.: Use of the crosspower-spectrum phase in acoustic event location. IEEE Transactions on Speech and Audio Processing 5(3), 288–292 (1997)
Varma, K., Ikuma, T., Beex, A.A.: Robust TDE-based DOA-estimation for compact audio arrays. Proceedings of the Second IEEE Sensor Array and Multichannel Signal Processing Workshop (SAM) , 214–218 (2002)
Anguera, X., Wooters, C., Peskin, B., Aguiló, M.: Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 402–414. Springer, Heidelberg (2006)
Pirinen, T.: Normalized confidence factors for robust direction of arrival estimation. In: Proceedings of the 2005 IEEE International Symposium on Circuits and Systems (ISCAS), IEEE Computer Society Press, Los Alamitos (2005)
Yli-Hietanen, J., Kalliojärvi, K., Astola, J.: Low-complexity angle of arrival estimation of wideband signals using small arrays. In: Proceedings of the 8th IEEE Signal Processing Workshop on Statistical Signal and Array Signal Processing, pp. 109–112. IEEE Computer Society Press, Los Alamitos (1996)
Hawkes, M., Nehorai, A.: Wideband Source Localization Using a Distributed Acoustic Vector-Sensor Array. IEEE Transactions on Signal Processing 51(6), 1479–1491 (2003)
Gordon, N., Salmond, D., Smith, A.: Novel approach to nonlinear/non-Gaussian Bayesian state estimation. Radar and Signal Processing, IEE Proceedings F 140(2), 107–113 (1993)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Pertilä, P., Korhonen, T., Pirinen, T., Parviainen, M. (2007). TUT Acoustic Source Tracking System 2006. In: Stiefelhagen, R., Garofolo, J. (eds) Multimodal Technologies for Perception of Humans. CLEAR 2006. Lecture Notes in Computer Science, vol 4122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69568-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-69568-4_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69567-7
Online ISBN: 978-3-540-69568-4
eBook Packages: Computer ScienceComputer Science (R0)