Abstract
Automatic indexing of multimedia documents across several different application tasks, including searching for words spoken, the detection of keywords and audio information retrieval. Thus, despite the changes made in the field of indexing speech, much remains to be done particularly for the key word search in spontaneous speech. Although the research areas of spoken words and audio retrieval has been well addressed, but still significant limitations to achieve, especially in terms of resource available today on the web.
The goal of this paper is to propose an approach for document management based multimedia indexing techniques to detect speech and keywords. We present in this article the various methods of indexing with the techniques of detection of key words. These methods derive three principal approaches from vocal indexing: the detection of key word, the detection of key words on phonetic flow (PSPL, CN,...) and the indexing containing the recognition with great vocabulary (LVR). We present, thereafter the step suggested for an approach based on the combinations of two techniques (PSPL, S-PSPL and CN, like on technique LVR.
A validation of this approach of indexing and information retrieval is in the course of validation for the field of the E-libraries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lee, L.S., Pan, Y.C.: Voice-Based Information Retrieval — How Far are We from the Text-Based Information Retrieval? In: Automatic Speech Recognition & Understanding, pp. 26–43 (2009)
Yao, Q., Soong, F.K., Lee, T.: Tone Enhanced Generalized Character Posterior Probability (GCPP) for Cantonese LVCSR. Comp. Speech Lang. 22, 360–373 (2008)
Chelba, C., Silva, J., Acero, A.: Soft Indexing of Speech Content for Search in Spoken Documents. Computer Speech and Language 21, 458–478 (2007)
Seide, F., Yu, P., Shi, Y.: Towards Spoken Document Retrieval for the Enterprise: Approximate Word-Lattice Indexing with Text Indexers. In: Automatic Speech Recognition & Understanding, pp. 629–634 (2007)
Hori, T., Hetherington, I.L., Hazen, T.J., Glass, J.R.: Open Vocabulary Spoken Utterance Retrieval Using Confusion Networks. In: ICASSP, pp. 73–76 (2007)
Pan, Y.C., Chang, H.L., Lee, L.S.: Subword-Based Position Specific Posterior Lattices (S-PSPL) for Indexing Speech Information. In: Interspeech, pp. 318–321 (2007)
Park, A., Hazen, T., Glass, J.: Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling. In: Proc. ICASSP, Philadelphia, PA (2005)
Chelba, C., Acero, A.: Position Specific Posterior Lattices for Indexing Speech. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 443–450. Association for Computational Linguistics, Michigan, Ann Arbor (2005)
Wessel, F., Schluter, R., Macherey, K., Ney, H.: Confidence Measures for Large Vocabulary Continuous Speech Recognition. In: SAP, vol. 9, pp. 288–298 (2001)
Mangu, L., Brill, E., Stolcke, A.: Finding Consensus in Speech Recognition: Word Error Minimization and other Applications of Confusion Networks. Computer Speech and Language 14, 373–400 (2000)
Mills. T., Pye, D., Sinclair, D., Wood, K., A Digital Photo Management System. Technical, AT&T Laboratories Cambridge, Cambridge (2000)
El Meliani, R., O’Shaughnessy, D.: Lexical Fillers for Task-Independent-Training Based Keyword Spotting and Detection of New Words. In: Proc. EUROSPEECH, pp. 2129–2133 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Issam, B., Ridda, L.M. (2012). Approaches for the Detection of the Keywords in Spoken Documents Application for the Field of E-Libraries. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34478-7_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-34478-7_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34477-0
Online ISBN: 978-3-642-34478-7
eBook Packages: Computer ScienceComputer Science (R0)