Voice Pathology Detection Using Artificial Neural Networks and Support Vector Machines Powered by a Multicriteria Optimization Algorithm

Areiza-Laverde, Henry Jhoán; Castro-Ospina, Andrés Eduardo; Peluffo-Ordóñez, Diego Hernán

doi:10.1007/978-3-030-00350-0_13

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 915))

Included in the following conference series:

Workshop on Engineering Applications

1170 Accesses
6 Citations

Abstract

Computer-aided diagnosis (CAD) systems have allowed to enhance the performance of conventional, medical diagnosis procedures in different scenarios. Particularly, in the context of voice pathology detection, the use of machine learning algorithms has proved to be a promising and suitable alternative. This work proposes the implementation of two well known classification algorithms, namely artificial neural networks (ANN) and support vector machines (SVM), optimized by particle swarm optimization (PSO) algorithm, aimed at classifying voice signals between healthy and pathologic ones. Three different configurations of the Saarbrucken voice database (SVD) are used. The effect of using balanced and unbalanced versions of this dataset is proved as well as the usefulness of the considered optimization algorithm to improve the final performance outcomes. Also, proposed approach is comparable with state-of-the-art methods.

H.J. Areiza-Laverde—This work is carried out under grants provided by Programa Nacional de Jóvenes Investigadores e Innovadores – COLCIENCIAS – Announcement 775 of 2017.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Smart Data Driven System for Pathological Voices Classification

Pathological Voice Recognition Based on Multi-feature Fusion

Voice Pathology Detection Based on Canonical Correlation Analysis Method Using Hilbert–Huang Transform and LSTM Features

Article 26 September 2024

References

Acharya, U.R., Fujita, H., Oh, S.L., Hagiwara, Y., Tan, J.H., Adam, M.: Application of deep convolutional neural network for automated detection of myocardial infarction using ecg signals. Inf. Sci. 415, 190–198 (2017)
Article Google Scholar
Al-nasheri, A., Muhammad, G., Alsulaiman, M., Ali, Z.: Investigation of voice pathology detection and classification on different frequency regions using correlation functions. J. Voice 31(1), 3–15 (2017)
Article Google Scholar
Al-nasheri, A., et al.: An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification. J. Voice 31(1), 113–e9 (2017)
Article Google Scholar
Ali, F.: Voice recognition anatomy, processing, uses and application in C (2017)
Google Scholar
AlZubaidi, A.K., Sideseq, F.B., Faeq, A., Basil, M.: Computer aided diagnosis in digital pathology application: review and perspective approach in lung cancer classification. In: 2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT), pp. 219–224. IEEE (2017)
Google Scholar
Barry, W., Pützer, M.: Saarbrucken voice database. Institute of Phonetics, Universität des Saarlandes (2007). http://www.stimmdatenbank.coli.uni-saarland.de
Béranger, J.: Big Data and Ethics: The Medical Datasphere. Elsevier, New York City (2016)
Google Scholar
Castro-Ospina, A., Castro-Hoyos, C., Peluffo-Ordonez, D., Castellanos-Dominguez, G.: Novel heuristic search for ventricular arrhythmia detection using normalized cut clustering. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 7076–7079. IEEE (2013)
Google Scholar
Chiu, C.C., et al.: State-of-the-art speech recognition with sequence-to-sequence models. arXiv preprint arXiv:1712.01769 (2017)
Harar, P., Alonso-Hernandezy, J.B., Mekyska, J., Galaz, Z., Burget, R., Smekal, Z.: Voice pathology detection using deep learning: a preliminary study. In: 2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI), pp. 1–4. IEEE (2017)
Google Scholar
Hemmerling, D., Skalski, A., Gajda, J.: Voice data mining for laryngeal pathology assessment. Comput. Biol. Med. 69, 270–276 (2016)
Article Google Scholar
Ibrahim, S., Djemal, R., Alsuwailem, A.: Electroencephalography (EEG) signal processing for epilepsy and autism spectrum disorder diagnosis. Biocybern. Biomed. Eng. 38(1), 16–26 (2018)
Article Google Scholar
Lytras, M.D., Papadopoulou, P.: Applying Big Data Analytics in Bioinformatics and Medicine. IGI Global, Pennsylvania (2017)
Google Scholar
Martínez, D., Lleida, E., Ortega, A., Miguel, A., Villalba, J.: Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using MultiFocal toolkit. In: Torre Toledano, D., et al. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 99–109. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35292-8_11
Chapter Google Scholar
Mendoza, L., Peña, J., Muñoz-Bedoya, L., Velandia-Villamizar, H.: Speech subvocal signal processing using packet wavelet and neuronal network. TecnoLógicas, 655–667 (2013). https://doi.org/10.22430/22565337.371
Muhammad, G., Alhamid, M.F., Hossain, M.S., Almogren, A.S., Vasilakos, A.V.: Enhanced living by assessing voice pathology using a co-occurrence matrix. Sensors 17(2), 267 (2017)
Article Google Scholar
Muhammad, G., et al.: Voice pathology detection using interlaced derivative pattern on glottal source excitation. Biomed. Signal Process. Control 31, 156–164 (2017)
Article Google Scholar
Muhammad, G., et al.: Automatic voice pathology detection and classification using vocal tract area irregularity. Biocybern. Biomed. Eng. 36(2), 309–317 (2016)
Article Google Scholar
Orozco-Naranjo, A.J., Muñoz-Gutiérrez, P.A.: Detection of pathological and normal heartbeat using wavelet packet, support vector machines and multilayer perceptron. Tecno Lógicas 31, 73–91 (2013)
Google Scholar
Parascandolo, P., Cesario, L., Vosilla, L., Viano, G.: Computer aided diagnosis: state-of-the-art and application to musculoskeletal diseases. In: Magnenat-Thalmann, N., Ratib, O., Choi, H.F. (eds.) 3D Multiscale Physiological Human, pp. 277–296. Springer, London (2014). https://doi.org/10.1007/978-1-4471-6275-9_12
Chapter Google Scholar
Schalkoff, R.J.: Artificial Neural Networks, vol. 1. McGraw-Hill, New York (1997)
MATH Google Scholar
Schilling, R.J., Harris, S.L.: Fundamentals of Digital Signal Processing Using MATLAB. Cengage Learning, Boston (2011)
Google Scholar
Semmlow, J.L., Griffel, B.: Biosignal and Medical Image Processing. CRC Press, Boca Raton (2014)
Google Scholar
Shinohara, S., et al.: Multilingual evaluation of voice disability index using pitch rate. ASTESJ 2(3), 765–772 (2017)
Article Google Scholar
Shriberg, L.D., et al.: A diagnostic marker to discriminate childhood apraxia of speech from speech delay: II. Validity studies of the pause marker. J. Speech Lang. Hear. Res. 60(4), S1118–S1134 (2017)
Article Google Scholar
Summers, R.M.: Deep learning and computer-aided diagnosis for medical image processing: a personal perspective. In: Lu, L., Zheng, Y., Carneiro, G., Yang, L. (eds.) Deep Learning and Convolutional Neural Networks for Medical Image Computing. ACVPR, pp. 3–10. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-42999-1_1
Chapter Google Scholar
von Tscharner, V.: Time-frequency and principal-component methods for the analysis of emgs recorded during a mildly fatiguing exercise on a cycle ergometer. J. Electromyogr. Kinesiol. 12(6), 479–492 (2002)
Article Google Scholar
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1999). https://doi.org/10.1007/978-1-4757-3264-1
Book MATH Google Scholar
Verde, L., De Pietro, G., Sannino, G.: Voice disorder identification by using machine learning techniques. IEEE Access 6, 16246–16255 (2018)
Article Google Scholar
Wojcicki, K.: HTK MFCC MATLAB. MATLAB Central File Exchange (2011)
Google Scholar

Download references

Acknowledgements

This work was partially supported by the grants provided by Programa Nacional de Jóvenes Investigadores e Innovadores – COLCIENCIAS – Announcement 775 of 2017 and the support for Instituto Tecnológico Metropolitano from Medellin-Colombia.

Also, authors specially thank the support given by the SDAS Research Group.

Author information

Authors and Affiliations

Grupo de Investigación Automática, Electrónica y Ciencias Computacionales, Instituto Tecnológico Metropolitano, Medellín, Colombia
Henry Jhoán Areiza-Laverde & Andrés Eduardo Castro-Ospina
SDAS Research Group, Yachay Tech, Urcuquí, Ecuador
Diego Hernán Peluffo-Ordóñez

Authors

Henry Jhoán Areiza-Laverde
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Eduardo Castro-Ospina
View author publications
You can also search for this author in PubMed Google Scholar
Diego Hernán Peluffo-Ordóñez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrés Eduardo Castro-Ospina .

Editor information

Editors and Affiliations

Department of Industrial Engineering, Universidad Distrital Francisco José de Caldas, Bogotá, Colombia
Juan Carlos Figueroa-García
Industrial Engineering Department, Univ. Distrital Francisco José de Caldas, Bogotá, Colombia
Eduyn Ramiro López-Santana
Department of Industrial Engineering, Universidad Distrital Francisco José de Caldas, Bogotá, Colombia
José Ignacio Rodriguez-Molano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Areiza-Laverde, H.J., Castro-Ospina, A.E., Peluffo-Ordóñez, D.H. (2018). Voice Pathology Detection Using Artificial Neural Networks and Support Vector Machines Powered by a Multicriteria Optimization Algorithm. In: Figueroa-García, J., López-Santana, E., Rodriguez-Molano, J. (eds) Applied Computer Sciences in Engineering. WEA 2018. Communications in Computer and Information Science, vol 915. Springer, Cham. https://doi.org/10.1007/978-3-030-00350-0_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-00350-0_13
Published: 13 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00349-4
Online ISBN: 978-3-030-00350-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Voice Pathology Detection Using Artificial Neural Networks and Support Vector Machines Powered by a Multicriteria Optimization Algorithm

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Smart Data Driven System for Pathological Voices Classification

Pathological Voice Recognition Based on Multi-feature Fusion

Voice Pathology Detection Based on Canonical Correlation Analysis Method Using Hilbert–Huang Transform and LSTM Features

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Voice Pathology Detection Using Artificial Neural Networks and Support Vector Machines Powered by a Multicriteria Optimization Algorithm

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Smart Data Driven System for Pathological Voices Classification

Pathological Voice Recognition Based on Multi-feature Fusion

Voice Pathology Detection Based on Canonical Correlation Analysis Method Using Hilbert–Huang Transform and LSTM Features

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation