Assessing Alzheimer’s Disease from Speech Using the i-vector Approach | SpringerLink
Skip to main content

Assessing Alzheimer’s Disease from Speech Using the i-vector Approach

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11658))

Included in the following conference series:

Abstract

One of the world’s chronic neuro-degenerative diseases, Alzheimer’s Disease (AD), leads its sufferers, among other symptoms, to suffer from speech difficulties. In particular, the inability to recall vocabulary which makes patients’ speech different. Furthermore, Mild Cognitive Impairment (MCI) is usually considered as a prodromal neuro-degenerative state of AD. The key to abate the progress of both disorders is their early diagnosis. However, actual ways of diagnosis are costly and quite time-consuming. In this study, we propose the extraction of features from speech through the use of the i-vector approach, by which we seek to model the speech pattern of the three mental conditions from the subjects. To the best of our knowledge, no previous studies have utilized i-vector features to assess Alzheimer’s before. These i-vectors are extracted from Mel-Frequency Cepstral Coefficients (MFCCs), then they are given to a SVM classifier in order to identify the speech in one of the following manners: AD - Alzheimer Disease, MCI - Mild Cognitive Impairment, HC - Healthy Control. We tested these i-vector features by performing a 5-fold cross-validation and we achieved an F1-score of 79.2%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Baldas, V., Lampiris, C., Capsalis, C., Koutsouris, D.: Early diagnosis of Alzheimer’s type dementia using continuous speech recognition. In: Lin, J.C., Nikita, K.S. (eds.) MobiHealth 2010. LNICST, vol. 55, pp. 105–110. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20865-2_14

    Chapter  Google Scholar 

  2. Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)

    Article  Google Scholar 

  3. Cernak, M., Komaty, A., Mohammadi, A., Anjos, A., Marcel, S.: Bob speaks Kaldi. In: Proceedings of Interspeech, August 2017

    Google Scholar 

  4. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 1–27 (2011)

    Article  Google Scholar 

  5. Dehak, N., et al.: Support vector machines and joint factor analysis for speaker verification. In: 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4237–4240. IEEE (2009)

    Google Scholar 

  6. Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)

    Article  Google Scholar 

  7. Folstein, M., Folstein, S., McHugh, P.: Mini-mental state: a practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12(3), 189–198 (1975)

    Article  Google Scholar 

  8. Förstl, H., Kurz, A.: Clinical features of Alzheimer’s disease. Eur. Arch. Psychiatry Clin. Neurosci. 249(6), 288–290 (1999). https://doi.org/10.1007/s004060050101

    Article  Google Scholar 

  9. Frank, E.: Effect of Alzheimer’s disease on communication function. J. S. C. Med. Assoc. 9(90), 417–23 (1994)

    Google Scholar 

  10. Fraser, K., Rudzicz, F., Graham, N., Rochon, E.: Automatic speech recognition in the diagnosis of primary progressive aphasia. In: Proceedings of SLPAT, Grenoble, France, pp. 47–54 (2013)

    Google Scholar 

  11. Fraser, K.C., et al.: Automated classification of primary progressive aphasia subtypes from narrative speech transcripts. Cortex 55, 43–60 (2014)

    Article  Google Scholar 

  12. Fraser, K.C., Rudzicz, F., Rochon, E.: Using text and acoustic features to diagnose progressive aphasia and its subtypes. In: Proceedings of Interspeech, Lyon, France, pp. 25–29 (2013)

    Google Scholar 

  13. Freedman, M., Leach, L., Kaplan, E., Winocur, G., Shulman, K., Delis, D.: Clock Drawing: A Neuropsychological Analysis. Oxford University Press, New York (1994)

    Google Scholar 

  14. Galvin, J.E., Sadowsky, C.H.: Practical guidelines for the recognition and diagnosis of dementia. J. Am. Board Fam. Med. 25(3), 367–382 (2012)

    Article  Google Scholar 

  15. Ganchev, T., Fakotakis, N., Kokkinakis, G.: Comparative evaluation of various MFCC implementations on the speaker verification task. In: Proceedings of the SPECOM, vol. 1, pp. 191–194 (2005)

    Google Scholar 

  16. García, N., Orozco-Arroyave, J.R., D’Haro, L.F., Dehak, N., Nöth, E.: Evaluation of the neurological state of people with Parkinson’s Disease using i-vectors. In: INTERSPEECH (2017)

    Google Scholar 

  17. García, N., Vásquez-Correa, J., Orozco-Arroyave, J.R., Nöth, E.: Multimodal i-vectors to detect and evaluate Parkinson’s disease. In: Proceedings of Interspeech 2018, pp. 2349–2353 (2018)

    Google Scholar 

  18. Gosztolya, G., Vincze, V., Tóth, L., Pákáski, M., Kálmán, J., Hoffmann, I.: Identifying mild cognitive impairment and mild Alzheimer’s disease based on spontaneous speech using ASR and linguistic features. Comput. Speech Lang. 53, 181–197 (2019). http://www.sciencedirect.com/science/article/pii/S088523081730342X

    Article  Google Scholar 

  19. Grzybowska, J., Kacprzak, S.: Speaker age classification and regression using i-vectors. In: INTERSPEECH, pp. 1402–1406 (2016)

    Google Scholar 

  20. Hansen, J.H.L., Hasan, T.: Speaker recognition by machines and humans: a tutorial review. IEEE Signal Process. Mag. 32(6), 74–99 (2015). https://doi.org/10.1109/MSP.2015.2462851

    Article  Google Scholar 

  21. Kenny, P.: Joint factor analysis of speaker and session variability: theory and algorithms. CRIM, Montreal, (Report) CRIM-06/08-13, vol. 14, pp. 28–29 (2005)

    Google Scholar 

  22. Lehr, M., Prud’hommeaux, E., Shafran, I., Roark, B.: Fully automated neuropsychological assessment for detecting mild cognitive impairment. In: Proceedings of Interspeech, Portland, OR, USA, pp. 1039–1042 (2012)

    Google Scholar 

  23. Madikeri, S., Dey, S., Motlicek, P., Ferras, M.: Implementation of the standard i-vector system for the Kaldi speech recognition toolkit. Idiap-RR Idiap-RR-26-2016, Idiap, October 2016

    Google Scholar 

  24. Nelson, L., Tabet, N.: Slowing the progression of Alzheimer’s disease; what works? Ageing Res. Rev. 23(B), 193–209 (2015)

    Article  Google Scholar 

  25. Rosen, W., Mohs, R., Davis, K.: A new rating scale for Alzheimer’s disease. J. Psychiatry Res. 141(11), 1356–1364 (1984)

    Google Scholar 

  26. Satt, A., Hoory, R., König, A., Aalten, P., Robert, P.H.: Speech-based automatic and robust detection of very early dementia. In: Proceedings of Interspeech, Singapore, pp. 2538–2542 (2014)

    Google Scholar 

  27. Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)

    Article  Google Scholar 

Download references

Acknowledgments

This study was partially funded by the National Research, Development and Innovation Office of Hungary via contract NKFIH FK-124413 and by the Ministry of Human Capacities, Hungary (grant 20391-3/2018/FEKUSTRAT). László Tóth was supported by the János Bolyai Research Scholarship of the Hungarian Academy of Sciences.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to José Vicente Egas López .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Egas López, J.V., Tóth, L., Hoffmann, I., Kálmán, J., Pákáski, M., Gosztolya, G. (2019). Assessing Alzheimer’s Disease from Speech Using the i-vector Approach. In: Salah, A., Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2019. Lecture Notes in Computer Science(), vol 11658. Springer, Cham. https://doi.org/10.1007/978-3-030-26061-3_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-26061-3_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-26060-6

  • Online ISBN: 978-3-030-26061-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics