Abstract
With the advent of new algorithms, brain-computer interfacing has been extensively used in medical and non-medical fields. In this regard, an experiment was conducted by the authors to recognize the imagined speech, the results of which are reported in this paper. This work can act as a speech prosthesis for completely paralyzed patients who cannot communicate normally. Thirteen subjects imagined five English words (sos, stop, medicine, comehere, washroom) while their electroencephalogram (EEG) signals were recorded simultaneously. The word pairs were analyzed in six natural frequencies of the brain. The envelopes of analytical signals acquired from Hilbert transform were calculated for all the frequency bands and the resulting features were classified using seven classifiers. The maximum accuracy reached up to 88.36%. The experimental study showed that alpha and theta frequency bands were able to classify the highest amount of imagined speech with a maximum average accuracy of 72.73% and 69.41% respectively. The results were comparable to state-of-the-art methods. The findings reported in this work will encourage the research community to use non-invasive modalities like EEG for exploring more in this area.



Similar content being viewed by others
References
Agarwal P, Kumar S (2021) Transforming imagined thoughts into speech using a covariance-based subset selection method. Indian J Pure Appl Phys 59:180–183. http://nopr.niscair.res.in/handle/123456789/56517
Agarwal P, Kumar S, Singh S (2019) Closed form solutions of various window functions in fractional fourier transform domain. In: 2019 6th International conference on computing for sustainable global development (INDIACom). IEEE, New Delhi, India, pp 64–68
Agarwal P, Kale RK, Kumar M, Kumar S (2020) Silent speech classification based upon various feature extraction methods. In: 2020 7th International conference on signal processing and integrated networks (SPIN). IEEE, Noida, India, pp 16–20
Chapelle O, Vapnik V, Bousquet O, Mukherjee S (2002) Choosing multiple parameters for support vector machines. Mach Learn 46:131–159. https://doi.org/10.1023/A:1012450327387
Cooney C, Korik A, Raffaella F, Coyle D (2019) Classification of imagined spoken word-pairs using convolutional neural networks. In: Proceedings of the 8th Graz brain computer interface conference 2019: bridging science and application. Graz University of Technology, Graz, Austria, pp 338–343
DaSalla CS, Kambara H, Sato M, Koike Y (2009) Single-trial classification of vowel speech imagery using common spatial patterns. Neural Netw 22:1334–1339. https://doi.org/10.1016/j.neunet.2009.05.008
Dash D, Ferrari P, Wang J (2020) Decoding imagined and spoken phrases from non-invasive neural (MEG) signals. Front Neurosci 14:290. https://doi.org/10.3389/fnins.2020.00290
Deng S, Srinivasan R, Lappas T, D’Zmura M (2010) EEG classification of imagined syllable rhythm using Hilbert spectrum methods. J Neural Eng 7:046006. https://doi.org/10.1088/1741-2560/7/4/046006
DEWAN EM (1967) Occipital alpha rhythm eye position and lens accommodation. Nature 214:975–977. https://doi.org/10.1038/214975a0
Freund Y, Schapire R (1996) Experiments with a new boosting algorithm. In: Proceedings of the thirteenth international conference on machine learning. pp 148–156
Hinke RM, Hu X, Stillman AE et al (1993) Functional magnetic resonance imaging of Broca’s area during internal speech. NeuroReport 4:675–678
Huang NE, Attoh-Okine NO (eds) (2005) The Hilbert-Huang transform in engineering, 1st edn. CRC Press, Boca Raton
Klem GH, Lüders HO, Jasper HH, Elger C (1999) The ten-twenty electrode system of the International Federation. The international federation of clinical neurophysiology. Electroencephalogr Clin Neurophysiol Suppl 52:3–6
Kumar S (2020) Directed searching optimization-based speech enhancement technique. Fluct Noise Lett 19:2050035. https://doi.org/10.1142/S0219477520500352
Kumar P, Saini R, Roy PP et al (2018) Envisioned speech recognition using EEG sensors. Pers Ubiquit Comput 22:185–199. https://doi.org/10.1007/s00779-017-1083-4
La Vaque TJ (1999) The History of EEG Hans Berger: psychophysiologist. A historical vignette. J Neurotherapy 3:1–9. https://doi.org/10.1300/J184v03n02_01
Marple L (1999) Computing the discrete-time “analytic” signal via FFT. IEEE Trans Signal Process 47:2600–2603. https://doi.org/10.1109/78.782222
Martin S, Brunner P, Iturrate I et al (2016) Word pair classification during imagined speech using direct brain recordings. Sci Rep 6:25803. https://doi.org/10.1038/srep25803
Mohanchandra K, Saha S (2016) A communication paradigm using subvocalized speech: translating brain signals into speech. Augm Human Res 1:3. https://doi.org/10.1007/s41133-016-0001-z
Müller-Gerking J, Pfurtscheller G, Flyvbjerg H (1999) Designing optimal spatial filters for single-trial EEG classification in a movement task. Clin Neurophysiol 110:787–798. https://doi.org/10.1016/S1388-2457(98)00038-8
Nguyen CH, Karavas GK, Artemiadis P (2017) Inferring imagined speech using EEG signals: a new approach using Riemannian manifold features. J Neural Eng 15:016002. https://doi.org/10.1088/1741-2552/aa8235
Panachakel JT, Ramakrishnan AG, Ananthapadmanabha TV (2019) Decoding imagined speech using wavelet features and deep neural networks. In: 2019 IEEE 16th India council international conference (INDICON). IEEE, Rajkot, India, pp 1–4
Pawar D, Dhage S (2020) Multiclass covert speech classification using extreme learning machine. Biomed Eng Lett 10:217–226. https://doi.org/10.1007/s13534-020-00152-x
Qureshi MNI, Min B, Park H et al (2018) Multiclass classification of word imagination speech with hybrid connectivity features. IEEE Trans Biomed Eng 65:2168–2177. https://doi.org/10.1109/TBME.2017.2786251
Ramadan RA, Vasilakos AV (2017) Brain computer interface: control signals review. Neurocomputing 223:26–44. https://doi.org/10.1016/j.neucom.2016.10.024
Seiffert C, Khoshgoftaar TM, Hulse JV, Napolitano A (2008) RUSBoost: Improving classification performance when training data is skewed. In: 2008 19th International conference on pattern recognition. IEEE, Tampa, FL, USA, pp 1–4
Siuly S, Li Y, Zhang Y (2016) EEG signal analysis and classification: techniques and applications, 1st edn. Springer Nature, Cham, Switzerland
Torres-García AA, Reyes-García CA, Villaseñor-Pineda L, García-Aguilar G (2016) Implementing a fuzzy inference system in a multi-objective EEG channel selection model for imagined speech classification. Expert Syst Appl 59:1–12. https://doi.org/10.1016/j.eswa.2016.04.011
Zhao S, Rudzicz F (2015) Classifying phonological categories in imagined and articulated speech. In: 2015 IEEE International conference on acoustics, speech and signal processing (ICASSP). IEEE, South Brisbane, QLD, Australia, pp 992–996
Funding
The authors did not receive support from any organization for the submitted work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Ethics approval and consent to participate.
The subjects voluntarily participated and submitted written consent for the experiment. Their identity was maintained confidential throughout the work. The approval of the work was taken from the competent authority of the institute. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional research committee and with the 1964 Helsinki Declaration.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Agarwal, P., Kumar, S. Imagined word pairs recognition from non-invasive brain signals using Hilbert transform. Int J Syst Assur Eng Manag 13, 385–394 (2022). https://doi.org/10.1007/s13198-021-01283-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13198-021-01283-9