Abstract
Fuzzy clustering has been proved successful in various fields in the recent past. In this paper, we introduce fuzzy clustering algorithms into the domain of automatic speaker clustering, and present a novel fuzzy-based hierarchical speaker clustering algorithm by applying fuzzy theory into the state-of-the-art agglomerative hierarchical clustering. This method follows a bottom-up strategy, and determines the fuzzy memberships according to a membership propagation strategy, which propagates fuzzy memberships in the iterative process of hierarchical clustering. Further analysis reveals that this method is an extension of conventional hierarchical clustering algorithm. Experiment results show that our method exhibits quite competitive performances compared to conventional k-means, fuzzy c-means and agglomerative hierarchical clustering algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jin, H., Kubala, F., Schwartz, R.: Automatic Speaker Clustering. In: Proceedings of the DARPA Speech Recognition Workshop, pp. 108–111 (1997)
Liu, D., Kubala, F., Technol, B., Cambridge, M.: Online Speaker Clustering. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings (ICASSP 2003), vol. 1 (2003)
Ajmera, J., Wooters, C.: A Robust Speaker Clustering Algorithm. In: IEEE Workshop on Automatic Speech Recognition and Understanding, 2003. ASRU 2003, pp. 411–416 (2003)
Zhang, X., Gao, J., Lu, P., Yan, Y.: A Novel Speaker Clustering Algorithm via Supervised Affinity propagation. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008, pp. 4369–4372 (2008)
Padmanabhan, M., Bahl, L., Nahamoo, D., Picheny, M., Center, I., Heights, Y.: Speaker Clustering and Transformation for Speaker Adaptation Inspeech Recognition Systems. Speech and Audio Processing 6(1), 71–77 (1998)
Barras, C., Zhu, X., Meignier, S., Gauvain, J.: Improving Speaker Diarization. In: RT-04F Workshop (November 2004)
Reynolds, D., Torres-Carrasquillo, P.: Approaches and Applications of Audio Diarization. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. Proceedings (ICASSP 2005), vol. 5 (2005)
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and Unsupervised Discretization of Continuous Features. In: Proceedings of the Twelfth International Conference on Machine Learning, vol. 202, pp. 194–202. Morgan Kaufmann, San Francisco (1995)
Duda, R., Hart, P., Stork, D.: Pattern classification. Wiley, Chichester (2001)
Wilcox, L., Chen, F., Kimber, D., Balasubramanian, V.: Segmentation of Speech Using Speaker Identification. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 1994. ICASSP 1994, vol. 1 (1994)
Pedrycz, W., Gomide, F.: An Introduction to Fuzzy Sets: Analysis and Design. MIT Press, Cambridge (1998)
Delacourt, P., Wellekens, C.: DISTBIC: A Speaker-Based Segmentation for Audio Data Indexing. Speech Communication 32(1-2), 111–126 (2000)
Gish, H., Schmidt, M.: Text-Independent Speaker Identification. Signal Processing Magazine 11(4), 18–32 (1994)
Stadelmann, T., Freisleben, B.: Fast and Robust Speaker Clustering Using the Earth Movers Distance and MIXMAX Models. In: Proceedings of the 31st International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 989–992 (2006)
Dembele, D., Kastner, P.: Fuzzy C-means Method for Clustering Microarray Data (2003)
Cannon, R., Dave, J., Bezdek, J.: Efficient Implementation of the Fuzzy C-Means Clustering Algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 8(2), 248–255 (1986)
Wang, W., Lv, P., Zhao, Q., Yan, Y.: A decision-tree-based online speaker clustering. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds.) IbPRIA 2007. LNCS, vol. 4477, pp. 555–562. Springer, Heidelberg (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, H., Zhang, X., Suo, H., Zhao, Q., Yan, Y. (2009). A Novel Fuzzy-Based Automatic Speaker Clustering Algorithm. In: Yu, W., He, H., Zhang, N. (eds) Advances in Neural Networks – ISNN 2009. ISNN 2009. Lecture Notes in Computer Science, vol 5552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01510-6_72
Download citation
DOI: https://doi.org/10.1007/978-3-642-01510-6_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01509-0
Online ISBN: 978-3-642-01510-6
eBook Packages: Computer ScienceComputer Science (R0)