Abstract
One of the world’s chronic neuro-degenerative diseases, Alzheimer’s Disease (AD), leads its sufferers, among other symptoms, to suffer from speech difficulties. In particular, the inability to recall vocabulary which makes patients’ speech different. Furthermore, Mild Cognitive Impairment (MCI) is usually considered as a prodromal neuro-degenerative state of AD. The key to abate the progress of both disorders is their early diagnosis. However, actual ways of diagnosis are costly and quite time-consuming. In this study, we propose the extraction of features from speech through the use of the i-vector approach, by which we seek to model the speech pattern of the three mental conditions from the subjects. To the best of our knowledge, no previous studies have utilized i-vector features to assess Alzheimer’s before. These i-vectors are extracted from Mel-Frequency Cepstral Coefficients (MFCCs), then they are given to a SVM classifier in order to identify the speech in one of the following manners: AD - Alzheimer Disease, MCI - Mild Cognitive Impairment, HC - Healthy Control. We tested these i-vector features by performing a 5-fold cross-validation and we achieved an F1-score of 79.2%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Baldas, V., Lampiris, C., Capsalis, C., Koutsouris, D.: Early diagnosis of Alzheimer’s type dementia using continuous speech recognition. In: Lin, J.C., Nikita, K.S. (eds.) MobiHealth 2010. LNICST, vol. 55, pp. 105–110. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20865-2_14
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)
Cernak, M., Komaty, A., Mohammadi, A., Anjos, A., Marcel, S.: Bob speaks Kaldi. In: Proceedings of Interspeech, August 2017
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 1–27 (2011)
Dehak, N., et al.: Support vector machines and joint factor analysis for speaker verification. In: 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4237–4240. IEEE (2009)
Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Folstein, M., Folstein, S., McHugh, P.: Mini-mental state: a practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12(3), 189–198 (1975)
Förstl, H., Kurz, A.: Clinical features of Alzheimer’s disease. Eur. Arch. Psychiatry Clin. Neurosci. 249(6), 288–290 (1999). https://doi.org/10.1007/s004060050101
Frank, E.: Effect of Alzheimer’s disease on communication function. J. S. C. Med. Assoc. 9(90), 417–23 (1994)
Fraser, K., Rudzicz, F., Graham, N., Rochon, E.: Automatic speech recognition in the diagnosis of primary progressive aphasia. In: Proceedings of SLPAT, Grenoble, France, pp. 47–54 (2013)
Fraser, K.C., et al.: Automated classification of primary progressive aphasia subtypes from narrative speech transcripts. Cortex 55, 43–60 (2014)
Fraser, K.C., Rudzicz, F., Rochon, E.: Using text and acoustic features to diagnose progressive aphasia and its subtypes. In: Proceedings of Interspeech, Lyon, France, pp. 25–29 (2013)
Freedman, M., Leach, L., Kaplan, E., Winocur, G., Shulman, K., Delis, D.: Clock Drawing: A Neuropsychological Analysis. Oxford University Press, New York (1994)
Galvin, J.E., Sadowsky, C.H.: Practical guidelines for the recognition and diagnosis of dementia. J. Am. Board Fam. Med. 25(3), 367–382 (2012)
Ganchev, T., Fakotakis, N., Kokkinakis, G.: Comparative evaluation of various MFCC implementations on the speaker verification task. In: Proceedings of the SPECOM, vol. 1, pp. 191–194 (2005)
García, N., Orozco-Arroyave, J.R., D’Haro, L.F., Dehak, N., Nöth, E.: Evaluation of the neurological state of people with Parkinson’s Disease using i-vectors. In: INTERSPEECH (2017)
García, N., Vásquez-Correa, J., Orozco-Arroyave, J.R., Nöth, E.: Multimodal i-vectors to detect and evaluate Parkinson’s disease. In: Proceedings of Interspeech 2018, pp. 2349–2353 (2018)
Gosztolya, G., Vincze, V., Tóth, L., Pákáski, M., Kálmán, J., Hoffmann, I.: Identifying mild cognitive impairment and mild Alzheimer’s disease based on spontaneous speech using ASR and linguistic features. Comput. Speech Lang. 53, 181–197 (2019). http://www.sciencedirect.com/science/article/pii/S088523081730342X
Grzybowska, J., Kacprzak, S.: Speaker age classification and regression using i-vectors. In: INTERSPEECH, pp. 1402–1406 (2016)
Hansen, J.H.L., Hasan, T.: Speaker recognition by machines and humans: a tutorial review. IEEE Signal Process. Mag. 32(6), 74–99 (2015). https://doi.org/10.1109/MSP.2015.2462851
Kenny, P.: Joint factor analysis of speaker and session variability: theory and algorithms. CRIM, Montreal, (Report) CRIM-06/08-13, vol. 14, pp. 28–29 (2005)
Lehr, M., Prud’hommeaux, E., Shafran, I., Roark, B.: Fully automated neuropsychological assessment for detecting mild cognitive impairment. In: Proceedings of Interspeech, Portland, OR, USA, pp. 1039–1042 (2012)
Madikeri, S., Dey, S., Motlicek, P., Ferras, M.: Implementation of the standard i-vector system for the Kaldi speech recognition toolkit. Idiap-RR Idiap-RR-26-2016, Idiap, October 2016
Nelson, L., Tabet, N.: Slowing the progression of Alzheimer’s disease; what works? Ageing Res. Rev. 23(B), 193–209 (2015)
Rosen, W., Mohs, R., Davis, K.: A new rating scale for Alzheimer’s disease. J. Psychiatry Res. 141(11), 1356–1364 (1984)
Satt, A., Hoory, R., König, A., Aalten, P., Robert, P.H.: Speech-based automatic and robust detection of very early dementia. In: Proceedings of Interspeech, Singapore, pp. 2538–2542 (2014)
Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)
Acknowledgments
This study was partially funded by the National Research, Development and Innovation Office of Hungary via contract NKFIH FK-124413 and by the Ministry of Human Capacities, Hungary (grant 20391-3/2018/FEKUSTRAT). László Tóth was supported by the János Bolyai Research Scholarship of the Hungarian Academy of Sciences.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Egas López, J.V., Tóth, L., Hoffmann, I., Kálmán, J., Pákáski, M., Gosztolya, G. (2019). Assessing Alzheimer’s Disease from Speech Using the i-vector Approach. In: Salah, A., Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2019. Lecture Notes in Computer Science(), vol 11658. Springer, Cham. https://doi.org/10.1007/978-3-030-26061-3_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-26061-3_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26060-6
Online ISBN: 978-3-030-26061-3
eBook Packages: Computer ScienceComputer Science (R0)