Abstract
Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for speech recognition by examining a Dutch-language corpus. We propose that different spectral features are needed for different phonemes and that, besides vowels, consonants should be taken into account.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ewen, C.J., van der Hulst, H.: The Phonological Structure of Words. Cambridge University Press, Cambridge (2001)
Ladd, D.R.: Intonational phonology. Cambridge Studies in Linguistics, vol. 79. Cambridge University Press, Cambridge (1996)
Harley, T.: The Psychology of Language: From Data to Theory. Psychology Press, Hove (2001)
Thiessen, E.D., Saffran, J.R.: When cues collide: Use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. Developmental Psychology 39, 706–716 (2003)
Cooper, N., Cutler, A., Wales, R.: Constraints of lexical stress on lexical acces in English: Evidence from native and non-native listeners. Language and Speech 45, 207–228 (2002)
Sluijter, A.: Phonetic Correlates of Stress and Accent. PhD thesis, Leiden University (1995)
van Kuijk, D., Boves, L.: Acoustic characteristics of lexical stress in continuous telephone speech. Speech Communication 27, 95–111 (1999)
Bouwman, A.G.G., Boves, L.: Using information on lexical stress for utterance verification. In: Proceedings of ITRW on Prosody in ASRU, Red Bank, pp. 29–34 (2001)
van Kuijk, D., van den Heuvel, H., Boves, L.: Using lexical stress in continuous speech recognition for Dutch. In: Proceedings ICSLP IV, pp. 1736–1739 (1996)
Wang, C., Seneff, S.: Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the JUPITER domain (2001)
van den Heuvel, H., van Kuijk, D., Boves, L.: Modelling lexical stress in continuous speech recognition. Speech Communication 40, 335–350 (2003)
Greenberg, S., Carvey, H., Hitchcock, L., Chang, S.: Temporal properties of spontaneous speech—a syllable-centric perspective. Journal of Phonetics 31, 465–485 (2003)
van Son, R.J.J.H., Pols, L.C.W.: An acoustic profile of consonant reduction. In: Proceedings ICSLP, vol. 3, pp. 1529–1532 (1996)
Xie, H., Andreae, P., Zhang, M., Warren, P.: Detecting stress in spoken English using decision trees and support vector machines. In: Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation, pp. 145–150. Australian Computer Society, Inc. (2004)
Wang, X.: Duration modelling in HMM-based speech recognition. PhD thesis, University of Amsterdam (1997)
Russell, M.J., Moore, R.K.: Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition. In: Proceedings of ICASSP, vol. 10, pp. 5–8 (1985)
Ramesh, P., Wilpon, J.G.: Modeling state durations in hidden Markov models for automatic speech recognition. In: Proceedings of ICASSP, vol. 1, pp. 381–384 (1992)
Sitaram, R.N.V., Sreenivas, T.: Incorporating phonetic properties and hidden Markov models for speech recognition. Journal of the Acoustical Society of America 102, 1149–1158 (1997)
Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK book (for HTK version 3.2.1) (2002)
Wiggers, P., Wojdel, J.C., Rothkrantz, L.J.: Development of a speech recognizer for the Dutch language. In: Proceedings of 7th annual scientific conference on web technology, new media, communications and telematics theory, methods, tools and applications (EUROMEDIA), pp. 133–138 (2002)
Wojdeł, J.C.: Automatic Lipreading in the Dutch Language. PhD thesis, Delft University of Technology, Delft (2003)
Boersma, P.: PRAAT, a system for doing phonetics by computer. Glot International 5, 341–345 (2001)
Bolinger, D.: Intonation and its Parts. Edward Arnold, London (1986)
Bolinger, D.: Intonation and its Uses. PhD thesis, Stanford University (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
van Dalen, R.C., Wiggers, P., Rothkrantz, L.J.M. (2005). Modelling Lexical Stress. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_27
Download citation
DOI: https://doi.org/10.1007/11551874_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)