{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,13]],"date-time":"2023-09-13T21:23:01Z","timestamp":1694640181913},"reference-count":23,"publisher":"Emerald","issue":"2","license":[{"start":{"date-parts":[[2005,5,1]],"date-time":"2005-05-01T00:00:00Z","timestamp":1114905600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2005,5,1]]},"abstract":"We present an approach for blind separation of acoustic sources produced from multiple speakers mixed in realistic room environments. We first transform recorded signals into the time\u2010frequency domain to make mixing become instantaneous. We then separate the sources in each frequency bin based on an independent component analysis (ICA) algorithm. For the present paper, we choose the complex version of fixedpoint iteration (CFPI), i.e. the complex version of FastICA, as the algorithm. From the separated signals in the time\u2010frequency domain, we reconstruct output\u2010separated signals in the time domain. To solve the so\u2010called permutation problem due to the indeterminacy of permutation in the standard ICA, we propose a method that applies a special property of the CFPI cost function. Generally, the cost function has several optimal points that correspond to the different permutations of the outputs. These optimal points are isolated by some non\u2010optimal regions of the cost function. In different but neighboring bins, optimal points with the same permutation are at almost the same position in the space of separation parameters. Based on this property, if an initial separation matrix for a learning process in a frequency bin is chosen equal to the final separation matrix of the learning process in the neighboring frequency bin, the learning process automatically leads us to separated signals with the same permutation as that of the neighbor frequency bin. In each bin, but except the starting one, by chosen the initial separation matrix in such a way, the permutation problem in the time domain reconstruction can be avoided. We present the results of some simulations and experiments on both artificially synthesized speech data and real\u2010world speech data, which show the effectiveness of our approach.<\/jats:p>","DOI":"10.1108\/17427370580000115","type":"journal-article","created":{"date-parts":[[2010,6,5]],"date-time":"2010-06-05T11:18:42Z","timestamp":1275736722000},"page":"89-100","source":"Crossref","is-referenced-by-count":5,"title":["Blind source separation of acoustic signals in realistic environments based on ICA in the time\u2010frequency domain"],"prefix":"10.1108","volume":"1","author":[{"given":"Shuxue","family":"Ding","sequence":"first","affiliation":[]},{"given":"Andrzej","family":"Cichocki","sequence":"additional","affiliation":[]},{"given":"Jie","family":"Huang","sequence":"additional","affiliation":[]},{"given":"Daming","family":"Wei","sequence":"additional","affiliation":[]}],"member":"140","reference":[{"key":"p_1","doi-asserted-by":"publisher","DOI":"10.1016\/0165-1684(91)90079-X"},{"key":"p_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1995.7.6.1129"},{"key":"p_3","doi-asserted-by":"publisher","DOI":"10.1016\/0165-1684(94)90029-9"},{"key":"p_4","series-title":"Advances in Neural Information Processing Systems","first-page":"757","volume":"8","author":"Amari S.","year":"1996"},{"key":"p_5","volume-title":"Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications","author":"Cichocki A.","year":"2003"},{"key":"p_6","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.72.3634"},{"key":"p_7","doi-asserted-by":"publisher","DOI":"10.1137\/S0895479893259546"},{"key":"p_8","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065797000239"},{"key":"p_9","unstructured":"R. H. Lambert. Multichannel blind deconvolution: FIR matrix algebra and separation of multipath mixtures. PhD dissertation,University of Southern California, Departmentof Electrical Engineering,May 1996."},{"key":"p_10","first-page":"377","volume-title":"Unsupervised Adaptive Filtering","volume":"1","author":"Lambert R. H.","year":"2000"},{"key":"p_11","unstructured":"N. Murata, S. Ikeda and A. Ziehe. An approach to blind source separation based on temporal structure of speech signals. BSIS Technical Reports, 98-92, 1998."},{"key":"p_12","unstructured":"M. Ogawa, F. Asano, S. Ikeda, H. Asoh and N. Kitawaki. Blind source separation for acoustic signals using subspace method and frequencydomain infomax. Technical Report of IEICE. EA2000-50: 15-22, 2000."},{"key":"p_13","doi-asserted-by":"publisher","DOI":"10.1049\/el:20000623"},{"key":"p_14","volume-title":"Proc. ICA'99","author":"Anem\u00fcller J.","year":"1999"},{"key":"p_15","unstructured":"J. Anem\u00fcller. Across-frequency processing in convolutive blind source separation. PhD thesis,Dept. of Physics, Universityof Oldenburg, Oldenburg, Germany,2001."},{"key":"p_16","doi-asserted-by":"publisher","DOI":"10.1142\/S1469026804001252"},{"key":"p_17","first-page":"423","volume-title":"Proc. ICA2000","author":"Barros A. K.","year":"2000"},{"key":"p_18","volume-title":"Proc. ICA'99","author":"Torkkola K.","year":"1999"},{"key":"p_19","doi-asserted-by":"publisher","DOI":"10.1109\/72.761722"},{"key":"p_20","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065700000028"},{"key":"p_21","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-010-0812-9"},{"key":"p_22","first-page":"261","volume-title":"Proc. Int. Workshop Independent Component Analysis and Blind Signal Separation","author":"Schobben D.","year":"1999"},{"key":"p_23","doi-asserted-by":"publisher","DOI":"10.1109\/89.841214"}],"container-title":["International Journal of Pervasive Computing and Communications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17427370580000115\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17427370580000115\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,22]],"date-time":"2022-03-22T20:51:51Z","timestamp":1647982311000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17427370580000115\/full\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,5,1]]},"references-count":23,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2005,5,1]]}},"alternative-id":["10.1108\/17427370580000115"],"URL":"https:\/\/doi.org\/10.1108\/17427370580000115","relation":{},"ISSN":["1742-7371"],"issn-type":[{"value":"1742-7371","type":"print"}],"subject":[],"published":{"date-parts":[[2005,5,1]]}}}