{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,8]],"date-time":"2025-01-08T05:37:27Z","timestamp":1736314647804,"version":"3.32.0"},"reference-count":35,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2022,1,28]],"date-time":"2022-01-28T00:00:00Z","timestamp":1643328000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"ANID\/FONDECYT Postdoctorado","award":["3190147"]},{"name":"ANID\/FONDECYT","award":["11180107"]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"Multiple simultaneous sound source localization (SSL) is one of the most important applications in the speech signal processing. The one-step algorithms with the advantage of low computational complexity (and low accuracy), and the two-step methods with high accuracy (and high computational complexity) are proposed for multiple SSL. In this article, a combination of one-step-based method based on the generalized eigenvalue decomposition (GEVD), and a two-step-based method based on the adaptive generalized cross-correlation (GCC) by using the phase transform\/maximum likelihood (PHAT\/ML) filters along with a novel T-shaped circular distributed microphone array (TCDMA) is proposed for 3D multiple simultaneous SSL. In addition, the low computational complexity advantage of the GCC algorithm is considered in combination with the high accuracy of the GEVD method by using the distributed microphone array to eliminate spatial aliasing and thus obtain more appropriate information. The proposed T-shaped circular distributed microphone array-based adaptive GEVD and GCC-PHAT\/ML algorithms (TCDMA-AGGPM) is compared with hierarchical grid refinement (HiGRID), temporal extension of multiple response model of sparse Bayesian learning with spherical harmonic (SH) extension (SH-TMSBL), sound field morphological component analysis (SF-MCA), and time-frequency mixture weight Bayesian nonparametric acoustical holography beamforming (TF-MW-BNP-AHB) methods based on the mean absolute estimation error (MAEE) criteria in noisy and reverberant environments on simulated and real data. The superiority of the proposed method is presented by showing the high accuracy and low computational complexity for 3D multiple simultaneous SSL.<\/jats:p>","DOI":"10.3390\/s22031011","type":"journal-article","created":{"date-parts":[[2022,1,29]],"date-time":"2022-01-29T06:43:27Z","timestamp":1643438607000},"page":"1011","source":"Crossref","is-referenced-by-count":7,"title":["3D Multiple Sound Source Localization by Proposed T-Shaped Circular Distributed Microphone Arrays in Combination with GEVD and Adaptive GCC-PHAT\/ML Algorithms"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6391-6863","authenticated-orcid":false,"given":"Ali","family":"Dehghan Firoozabadi","sequence":"first","affiliation":[{"name":"Department of Electricity, Universidad Tecnol\u00f3gica Metropolitana, Av. Jos\u00e9 Pedro Alessandri 1242, Santiago 7800002, Chile"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5186-2642","authenticated-orcid":false,"given":"Pablo","family":"Irarrazaval","sequence":"additional","affiliation":[{"name":"Electrical Engineering Department, Pontificia Universidad Cat\u00f3lica de Chile, Santiago 7820436, Chile"},{"name":"Biomedical Imaging Center, Pontificia Universidad Cat\u00f3lica de Chile, Santiago 7820436, Chile"},{"name":"Institute for Biological and Medical Engineering, Pontificia Universidad Cat\u00f3lica de Chile, Santiago 7820436, Chile"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2500-3294","authenticated-orcid":false,"given":"Pablo","family":"Adasme","sequence":"additional","affiliation":[{"name":"Electrical Engineering Department, Universidad de Santiago de Chile, Av. Ecuador 3519, Santiago 9170124, Chile"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5692-5673","authenticated-orcid":false,"given":"David","family":"Zabala-Blanco","sequence":"additional","affiliation":[{"name":"Department of Computing and Industries, Universidad Cat\u00f3lica del Maule, Talca 3466706, Chile"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3958-503X","authenticated-orcid":false,"given":"Pablo Palacios","family":"J\u00e1tiva","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, Universidad de Chile, Santiago 8370451, Chile"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3461-4484","authenticated-orcid":false,"given":"Cesar","family":"Azurdia-Meza","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, Universidad de Chile, Santiago 8370451, Chile"}]}],"member":"1968","published-online":{"date-parts":[[2022,1,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"7373","DOI":"10.1109\/ACCESS.2019.2963768","article-title":"Sound Source Localization Based on GCC-PHAT With Diffuseness Mask in Noisy and Reverberant Environments","volume":"8","author":"Lee","year":"2020","journal-title":"IEEE Access"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1109\/TASSP.1976.1162830","article-title":"The generalized correlation method for estimation of time delay","volume":"24","author":"Knapp","year":"1976","journal-title":"IEEE Trans. Acoust. Speech Signal Process."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Yao, K., Chen, J.C., and Hudson, R.E. (2002, January 13\u201317). Maximum-likelihood acoustic source localization: Experimental results. Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, Orlando, FL, USA.","DOI":"10.1109\/ICASSP.2002.5745267"},{"key":"ref_4","unstructured":"Brandstein, M., and Ward, D. (2013). Microphone Arrays: Signal Processing Techniques and Applications, Springer."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1956","DOI":"10.1109\/TASLP.2017.2736067","article-title":"Augmented Intensity Vectors for Direction of Arrival Estimation in the Spherical Harmonic Domain","volume":"25","author":"Hafezi","year":"2017","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1830","DOI":"10.1109\/TSP.2004.828896","article-title":"Blind Separation of Speech Mixtures via Time-Frequency Masking","volume":"52","author":"Yilmaz","year":"2004","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"2171","DOI":"10.1109\/TASLP.2016.2598319","article-title":"Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization","volume":"24","author":"Li","year":"2016","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Proces."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Hu, Y., Samarasinghe, P.N., Abhayapala, T.D., and Gannot, S. (2020, January 4\u20138). Unsupervised Multiple Source Localization Using Relative Harmonic Coefficients. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain.","DOI":"10.1109\/ICASSP40776.2020.9053656"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1494","DOI":"10.1109\/TASLP.2014.2337846","article-title":"Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test","volume":"22","author":"Nadiri","year":"2014","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Hu, Y., Samarasinghe, P.N., and Abhayapala, T.D. (2019, January 20\u201323). Sound Source Localization Using Relative Harmonic Coefficients in Modal Domain. Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA.","DOI":"10.1109\/WASPAA.2019.8937221"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"384","DOI":"10.1121\/1.428310","article-title":"Adaptive eigenvalue decomposition algorithm for passive acoustic source localization","volume":"107","author":"Benesty","year":"2000","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Sun, H., Teutsch, H., Mabande, E., and Kellermann, W. (2011, January 22\u201327). Robust localization of multiple sources in reverberant environments using EB-ESPRIT with spherical microphone arrays. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic.","DOI":"10.1109\/ICASSP.2011.5946342"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"6407","DOI":"10.1109\/TSP.2015.2465302","article-title":"Performance Analysis of an Improved MUSIC DoA Estimator","volume":"63","author":"Vallet","year":"2015","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Liaquat, M.U., Munawar, H.S., Rahman, A., Qadir, Z., Kouzani, A.Z., and Mahmud, M.A.P. (2021). Sound Localization for Ad-Hoc Microphone Arrays. Energies, 14.","DOI":"10.3390\/en14123446"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"EL181","DOI":"10.1121\/1.5026122","article-title":"Direction of arrival estimation using nonsingular spherical ESPRIT","volume":"143","author":"Jo","year":"2018","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1109\/TASLP.2019.2953000","article-title":"Reflection Assisted Sound Source Localization Through a Harmonic Domain MUSIC Framework","volume":"28","author":"Birnie","year":"2020","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Williams, E.G. (1999). Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography, Academic Press.","DOI":"10.1016\/B978-012753960-7\/50007-3"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1821","DOI":"10.1109\/TASLP.2017.2718733","article-title":"Perpendicular Cross-Spectra Fusion for Sound Source Localization with a Planar Microphone Array","volume":"25","author":"Stefanakis","year":"2017","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"2215","DOI":"10.1109\/TASLP.2018.2858932","article-title":"Multiple Sound Source Localization with Steered Response Power Density and Hierarchical Grid Refinement","volume":"26","author":"Coteli","year":"2018","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2122","DOI":"10.1109\/TASLP.2018.2855960","article-title":"Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks","volume":"26","author":"Ma","year":"2018","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"7000304","DOI":"10.1109\/LSENS.2018.2890129","article-title":"Multiple Speech Sources Localization in Room Reverberant Environment Using Spherical Harmonic Sparse Bayesian Learning","volume":"3","author":"Dai","year":"2019","journal-title":"IEEE Sens. Lett."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.1109\/TASLP.2019.2915785","article-title":"Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering","volume":"27","author":"Yang","year":"2019","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"87749","DOI":"10.1109\/ACCESS.2020.2993076","article-title":"Free-Field TDOA-AOA Sound Source Localization Using Three Soundfield Microphones","volume":"8","author":"Kraljevic","year":"2020","journal-title":"IEEE Access"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"6503413","DOI":"10.1109\/TIM.2021.3077670","article-title":"Acoustic Source Localization in a Reverberant Environment Based on Sound Field Morphological Component Analysis and Alternating Direction Method of Multipliers","volume":"70","author":"Chu","year":"2021","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1864","DOI":"10.1109\/TASLP.2021.3079809","article-title":"Indoor Multi-Speaker Localization Based on Bayesian Nonparametrics in the Circular Harmonic Domain","volume":"29","author":"SongGong","year":"2021","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1109\/TASLP.2020.3039569","article-title":"Multiple Source Direction of Arrival Estimations Using Relative Sound Pressure Based MUSIC","volume":"29","author":"Hu","year":"2021","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1109\/TASLP.2018.2877892","article-title":"CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning","volume":"27","author":"Stoter","year":"2019","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_28","unstructured":"Dehghan Firoozabadi, A., and Abutalebi, H.R. (2010, January 11\u201313). SRP-ML: A Robust SRP-based speech source localization method for Noisy environments. Proceedings of the 18th Iranian Conference on Electrical Engineering (ICEE), Isfahan, Iran."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Dehghan Firoozabadi, A., Irarrazaval, P., Adasme, P., Zabala-Blanco, D., Palacios-J\u00e1tiva, P., Durney, H., Sanhueza, M., and Azurdia-Meza, C. (2021, January 23\u201327). Three-dimensional sound source localization by distributed microphone arrays. Proceedings of the 29th European Signal Processing Conference (EUSIPCO), Dublin, Ireland.","DOI":"10.23919\/EUSIPCO54536.2021.9616326"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"495250","DOI":"10.1155\/S111086570330602X","article-title":"Robust Adaptive Time Delay Estimation for Speaker Localization in Noisy and Reverberant Acoustic Environments","volume":"2003","author":"Doclo","year":"2003","journal-title":"EURASIP J. Adv. Signal Process."},{"key":"ref_31","unstructured":"Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., and Zue, V. (1993). TIMIT Acoustic-Phonetic Continuous Speech Corpus LDC93S1, Linguistic Data Consortium. Available online: https:\/\/catalog.ldc.upenn.edu\/LDC93S1."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Cetin, O., and Shriberg, E. (2006, January 17\u201321). Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: Insights for automatic speech recognition. Proceedings of the Interspeech, Pittsburg, PA, USA.","DOI":"10.21437\/Interspeech.2006-91"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1121\/1.382599","article-title":"Image method for efficiently simulating small-room acoustics","volume":"65","author":"Allen","year":"1979","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_34","unstructured":"Momenzadeh, H. (2007). Speaker Localization Using Microphone Arrays. [Master\u2019s Thesis, Yazd University]."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Jia, M., Wu, Y., Bao, C., and Wang, J. (2018). Multiple Sound Sources Localization with Frame-by-Frame Component Removal of Statistically Dominant Source. Sensors, 18.","DOI":"10.3390\/s18113613"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/3\/1011\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,7]],"date-time":"2025-01-07T16:00:36Z","timestamp":1736265636000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/3\/1011"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,28]]},"references-count":35,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,2]]}},"alternative-id":["s22031011"],"URL":"https:\/\/doi.org\/10.3390\/s22031011","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2022,1,28]]}}}