Abstract
We address the problem of effectively monitoring audio programs on the web. The paper tries to present how to construct such an audio program surveillance system using several state-of-the-art speech technologies. A real-world system WAPS (Web Audio Program Surveillance) is used as an example. WAPS is described in details in terms of the challenges it faces, it system architecture and its component modules. Objective evaluation of the whole WAPS is also given. Experiments show that WAPS shows satisfying performance on both artificially created data and real web data.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Koumpis, K., Renals, S.: Content-based access to spoken audio. IEEE Signal Processing Magazine 22(5), 61–69 (2005)
Chelba, C., Hazen, T., Saraclar, M.: Retrieval and browsing of spoken content. IEEE Signal Processing Magazine 25(3), 39–49 (2008)
Manuel, J., Thong, V., Moreno, P., et al.: Speechbot: An Experimental Speech-based Search Engine for Multimedia Content on the Web. IEEE Trans. on Mutimedias 3(4), 88–96 (2002)
Christopher, A., Michiel, B., Ari, B., Ciprian, C., Anastassia, D.: An audio indexing system for election video material. In: ICASSP 2009, vol. 77(2), pp. 596–599 (2003)
Rose, R.: Keyword detection in conversational speech utterances using hidden Markov model based continuous speech recognition. Computer speech & language (Print) 9(4), 309–333 (1995)
Campbell, W., Sturim, D., Reynolds, D.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing Letters 13(5), 308–311 (2006)
Suo, H., Li, M., Xiao, X., Zhang, X., Wang, X., Lv, P., Yan, Y.: IOA ThinkIT Speech Laboratory System Description for NIST LRE 2007. In: Workshop of NIST LRE 2007 (2007)
Hhkkan-Tur, D., Riccardi, G.: A General Algorithm For Word Graph Matrix Decomposition. In: ICASSP 2003, vol. 77(2), pp. 596–599 (2003)
Shao, J., Li, T., Zhang, Q., Zhao, Q., Yan, Y.: A One-Pass Real- Time Decoder Using Memory-Efficient State Network. IEICE Transactions on Information and Systems 91(3), 529 (2008)
Gao, J., Shao, J., Zhang, Q., Zhao, Q., Yan, Y.: Spoken Term Detection Using Dynamic Match Subword Confusion Network. In: Fourth International Conference on Natural Computation. ICNC 2008, vol. 4 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gao, J., Sun, Y., Suo, H., Zhao, Q., Yan, Y. (2009). WAPS: An Audio Program Surveillance System for Large Scale Web Data Stream. In: Liu, W., Luo, X., Wang, F.L., Lei, J. (eds) Web Information Systems and Mining. WISM 2009. Lecture Notes in Computer Science, vol 5854. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05250-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-05250-7_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05249-1
Online ISBN: 978-3-642-05250-7
eBook Packages: Computer ScienceComputer Science (R0)