Transfer Learning to Detect Parkinson’s Disease from Speech In Different Languages Using Convolutional Neural Networks with Layer Freezing

Rios-Urrego, Cristian David; Vásquez-Correa, Juan Camilo; Orozco-Arroyave, Juan Rafael; Nöth, Elmar

doi:10.1007/978-3-030-58323-1_36

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12284))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1724 Accesses

Abstract

Parkinson’s Disease is a neurodegenerative disorder characterized by motor symptoms such as resting tremor, bradykinesia, rigidity and freezing of gait. The most common symptom in speech is called hypokinetic dysarthria, where speech is characterized by monotone intensity, low pitch variability and poor prosody that tends to fade at the end of the utterance. This study proposes the classification of patients with Parkinson’s Disease and healthy controls in three different languages (Spanish, German, and Czech) using a transfer learning strategy. The process is further improved by freezing consecutive different layers of the architecture. We hypothesize that some convolutional layers characterize the disease and others the language. Therefore, when a fine-tuning in the transfer learning is performed, it is possible to find the topology that best adapts to the target language and allows an accurate detection of Parkinson’s Disease. The proposed methodology uses Convolutional Neural Networks trained with Mel-scale spectrograms. Results indicate that the fine-tuning of the neural network does not provide good performance in all languages while fine-tuning of individual layers improves the accuracy by up to 7%. In addition, the results show that Transfer Learning among languages improves the performance in up to 18% when compared to a base model used to initialize the weights of the network.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 10295; Price includes VAT (Japan)

Softcover Book: JPY 12869; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages

Parkinson’s Disease Detection from Speech Using Convolutional Neural Networks

Parkinson’s Disease Assessment from Speech Data Using Recurrence Plot

References

Bocklet, T., et al.: Automatic evaluation of parkinson’s speech-acoustic, prosodic and voice related cues. In: Proceedings of INTERSPEECH, pp. 1149–1153 (2013)
Google Scholar
Goetz, C., et al.: Movement disorder society-sponsored revision of the unified parkinson’s disease rating scale (mds-updrs): scale presentation and clinimetric testing results. Mov. Disord. Official J. Mov. Disord. Soc. 23(15), 2129–2170 (2008)
Article Google Scholar
Hornykiewicz, O.: Biochemical aspects of parkinson’s disease. Neurology 51(2 Suppl 2), S2–S9 (1998)
Article Google Scholar
Khojasteh, P., et al.: Parkinson’s disease diagnosis based on multivariate deep features of speech signal. In: Proceedings of LSC, pp. 187–190. IEEE (2018)
Google Scholar
Kruithof, M., et al.: Object recognition using deep convolutional neural networks with complete transfer and partial frozen layers. In: Proceedings of SPIE, vol. 9995, p. 99950K. International Society for Optics and Photonics (2016)
Google Scholar
Logemann, J.A., et al.: Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of parkinson patients. J. Speech Lang. Hear. Res. 43(1), 47–57 (1978)
Google Scholar
McKinlay, A., et al.: A profile of neuropsychiatric problems and their relationship to quality of life for parkinson’s disease patients without dementia. Parkinsonism Relat. Disord. 14(1), 37–42 (2008)
Article Google Scholar
McNemar, Q.: Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12(2), 153–157 (1947)
Article Google Scholar
Naseer, A., et al.: Refining parkinson’s neurological disorder identification through deep transfer learning. Neural Comput. Appl. 32(3), 839–854 (2020)
Article Google Scholar
Oquab, M., et al.: Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings of CVPR, pp. 1717–1724 (2014)
Google Scholar
Orozco-Arroyave, J.R.: Analysis of speech of people with Parkinson’s disease, vol. 41. Logos Verlag Berlin GmbH (2016)
Google Scholar
Orozco-Arroyave, J.R., et al.: New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease. In: Proceedings of LREC, pp. 342–347 (2014)
Google Scholar
Rusz, J.: Detecting speech disorders in early Parkinson’s disease by acoustic analysis (2018)
Google Scholar
Vásquez-Correa, et al.: Convolutional neural network to model articulation impairments in patients with Parkinson’s disease. In: Proceedings of INTERSPEECH, pp. 314–318 (2017)
Google Scholar
Vásquez-Correa, J.C., et al.: Convolutional neural networks and a transfer learning strategy to classify parkinson’s disease from speech in three different languages. In: Nyström, I., Hernández Heredia, Y., Milián Núñez, V. (eds.) CIARP 2019. LNCS, vol. 11896, pp. 697–706. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33904-3_66
Chapter Google Scholar
Wang, D., Zheng, T.F.: Transfer learning for speech and language processing. In: Proceedings of APSIPA, pp. 1225–1237. IEEE (2015)
Google Scholar
Wodzinski, M., et al.: Deep learning approach to Parkinson’s disease detection using voice recordings and convolutional neural network dedicated to image classification. In: Proceedings of EMBC, pp. 717–720. IEEE (2019)
Google Scholar
Yorkston, K.M., et al.: The effect of rate control on the intelligibility and naturalness of dysarthric speech. J. Speech Hear. Disord. 55(3), 550–560 (1990)
Article Google Scholar
Yosinski, J., et al.: Understanding neural networks through deep visualization (2015). ArXiv Preprint arXiv:1506.06579
Yunusova, Y., Weismer, G.G., Lindstrom, M.J.: Classifications of vocalic segments from articulatory kinematics: healthy controls and speakers with dysarthria. J. Speech Lang. Hear. Res. (2011)
Google Scholar

Download references

Acknowledgments

The work reported here was financed by CODI from University of Antioquia by grant Number 2017–15530. This project has received funding from the European Unions Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Grant Agreement No. 766287.

Author information

Authors and Affiliations

Faculty of Engineering, University of Antioquia UdeA, Medellín, Colombia
Cristian David Rios-Urrego, Juan Camilo Vásquez-Correa & Juan Rafael Orozco-Arroyave
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave & Elmar Nöth

Authors

Cristian David Rios-Urrego
View author publications
You can also search for this author in PubMed Google Scholar
Juan Camilo Vásquez-Correa
View author publications
You can also search for this author in PubMed Google Scholar
Juan Rafael Orozco-Arroyave
View author publications
You can also search for this author in PubMed Google Scholar
Elmar Nöth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cristian David Rios-Urrego .

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Karel Pala
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Aleš Horák

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rios-Urrego, C.D., Vásquez-Correa, J.C., Orozco-Arroyave, J.R., Nöth, E. (2020). Transfer Learning to Detect Parkinson’s Disease from Speech In Different Languages Using Convolutional Neural Networks with Layer Freezing. In: Sojka, P., Kopeček, I., Pala, K., Horák, A. (eds) Text, Speech, and Dialogue. TSD 2020. Lecture Notes in Computer Science(), vol 12284. Springer, Cham. https://doi.org/10.1007/978-3-030-58323-1_36

Download citation

DOI: https://doi.org/10.1007/978-3-030-58323-1_36
Published: 01 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58322-4
Online ISBN: 978-3-030-58323-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics