Abstract
Voice pathologies have become a social concern, as voice and speech play an important role in certain professions, and in the general population quality of life. In these last years emphasis has been placed in early pathology detection, for which classical perturbation measurements (jitter, shimmer, HNR, etc.) have been used. Going one step ahead the present work is aimed to estimate the values of the biomechanical parameters of the vocal fold system, as mass, stiffness and losses by the inversion of the vocal fold structure, which could help non only in pathology detection, but in classifying the specific patient’s pathology as well. The model structure of the vocal cord will be presented, and a method to estimate the biomechanical parameters of the cord body structure will be described. From these, deviations from normophonic cases, and unbalance between cords may be extracted to serve as pathology correlates. The relevance of deviations and unbalance in Pathology Detection is shown through Principal Component Analysis. Results for normal and pathological cases will be presented and discussed.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Akande, O.O., Murphy, P.J.: Estimation of the vocal tract transfer function with application to glottal wave analysis. Speech Communication 46(1), 1–13 (2005)
Alku, P.: An Automatic Method to Estimate the Time-Based Parameters of the Glottal Pulseform. In: ICASSP 1992, p. II/29–32 (1992)
Berry, D.A.: Mechanisms of modal and nonmodal phonation. Journal of Phonetics 29, 431–450 (2001)
Cannito, M.P., Buder, E.H., Chorna, L.B.: Spectral Amplitude Measures of Adductor Spasmodic Dysphonic Speech. Journal of Voice 19(3), 391–410 (2005)
Childers, D.G.: Speech Processing and Synthesis Toolboxes. John Wiley & Sons, New York (2000)
Dibazar, A.A., Narayanan, S.: A System for Automatic Detection of Pathological Speech. In: Proc. of the 36th Asilomar Conf. on Signals, Systems and Computers (2002)
Godino, J.I., Gómez, P.: Automatic Detection of Voice Impairments by means of Short Term Cepstral Parameters and Neural Network based Detectors. IEEE Trans. on Biomed. Eng. 51(2), 380–384 (2004)
Gómez, P., Díaz, F., Martínez, R., Godino, J.I., Álvarez, A., Rodríguez, F., Rodellar, V.: Precise Reconstruction of the Mucosal Wave for Voice Pathology Detection and Classification. In: Proceedings of the EUSIPCO 2004, pp. 297–300 (2004)
Gómez, P., Godino, J.I., Díaz, F., Álvarez, A., Martínez, R., Rodellar, V.: Biomechanical Parameter Fingerprint in the Mucosal Wave Power Spectral Density. In: Proc. of the ICSLP 2004, pp. 842–845 (2004)
Gómez, P., Godino, J.I., Álvarez, A., Martínez, R., Nieto, V., Rodellar, V.: Evidence of Glottal Source Spectral Features found in Vocal Fold Dynamics. In: Proceedings of the 2004 Int. Conf. on Acoustics, Speech and Signal Processing, ICASSP 2005, pp. 441–444 (2005)
Gómez, P., Díaz, F., Álvarez, A., Martínez, R., Rodellar, V., Fernández, R., Nieto, A., Fernández, F.J.: PCA of Perturbation Parameters in Voice Pathology Detection. In: Proc. of EUROSPEECH 2005, pp. 645–648 (2005)
Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis. Prentice-Hall, Upper Saddle River (2002)
Growing Problem for Call Centre Workers Suffering from Repetitive Voice Injury, http://www.healthandsafetytips.co.uk/News.htm
Kuo, J., Holmberg, E.B., Hillman, R.E.: Discriminating Speakers with Vocal Nodules Using Aerodynamic and Acoustic Features. In: Proc. of the ICASSP 1999, March 15-19, vol. 1, pp. 77–80 (1999)
Ritchings, T., McGillion, M., Moore, C.: Pathological voice quality assessment using artificial neural networks. Medical Engineering and Physics 24, 561–564 (2002)
Simberg, S.: Prevalence of Vocal Symptoms and Voice Disorders among Teacher Students and Teachers and a Model of Early Intervention, Ph.D. Thesis, Department of Speech Sciences, University of Helsinki, http://ethesis.helsinki.fi/
Story, B.H.: An overview of the physiology, physics and modeling of the sound source for vowels. Acoustic Sci. & Tech. 23(4), 195–206 (2002)
Story, B.H., Titze, I.R.: Voice simulation with a bodycover model of the vocal folds. J. Acoust. Soc. Am. 97, 1249–1260 (1995)
The Voice Center of Eastern Virginia Medical School, http://www.voice-center.com/larynx_ca.html
Titze, I.R.: Principles of Voice Production. Prentice-Hall, Englewood Cliffs (1994)
Titze, I.R.: The physics of small-amplitude oscillation of the vocal folds. Journal of the Acoustical Society of America 83(4), 1536–1552 (1988)
Titze, I.R., Lemke, J., Montequin, D.: Populations in the U.S. workforce who rely on voice as a primary tool of trade: A preliminary report. Journal of Voice 11, 254–259 (1997)
Varho, S., Alku, P.: A new predictive method for all-pole modelling of speech spectra with a compressed set of parameters. In: Proc. ICASSP 1999, pp. III/126–129 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gómez, P. et al. (2006). Voice Pathology Detection by Vocal Cord Biomechanical Parameter Estimation. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds) Nonlinear Analyses and Algorithms for Speech Processing. NOLISP 2005. Lecture Notes in Computer Science(), vol 3817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11613107_21
Download citation
DOI: https://doi.org/10.1007/11613107_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31257-4
Online ISBN: 978-3-540-32586-4
eBook Packages: Computer ScienceComputer Science (R0)