Improvement of Speech Recognition Accuracy Using Post-processing of Recognized Text

Rudzionis, Vytautas; Malukas, Ugnius; Danieliene, Renata

doi:10.1007/978-3-031-16302-9_21

Vytautas Rudzionis⁸,
Ugnius Malukas⁹ &
Renata Danieliene⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1665))

Included in the following conference series:

International Conference on Information and Software Technologies

582 Accesses

Abstract

Modern deep learning-based speech recognition methods allow for achieving phenomenal speech recognition accuracy. But this requires enormous amounts of data to train. Unfortunately, developers of recognizers for less widely spoken languages are often facing the problem of scarce resources to train recognizers. The paper presents a novel method to increase recognition accuracy by post-processing of the text outputs of two different speech recognizers. The method is using machine learning to find a more likely symbol or group of symbols from two different deep learning-based recognizers. The experiments showed that the method allows increasing recognition accuracy by 3%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

English Speech Recognition Based on Deep Machine Learning Algorithm

A Deep Learning Approach to Speech Recognition of Digits

Latest Trends in Deep Learning for Automatic Speech Recognition System

References

Bourlard, H., Morgan, N.: Connectionist Speech Recognition: A Hybrid Approach. Kluwer Press, Amsterdam (1994)
Book Google Scholar
Tebelskis, J.: Speech recognition using neural networks. Ph.D. thesis, CMU (1995)
Google Scholar
Hinton, G., Osindero, S., Yee-Wee, T.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)
Article MathSciNet Google Scholar
Hinton, G., et al.: Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Process. Mag. 29(6), 82–97 (2012)
Article Google Scholar
Saon, G., Chien, J.: Large-vocabulary continuous speech recognition systems: a look at some recent advances. Signal Process. Mag. 29(6), 18–33 (2012)
Article Google Scholar
Graves, A., Jaitly, N., Mohamed, A.-R.: Hybrid speech recognition with deep bidirectional LSTM. In: Poroceedings of 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, Olomouc, Czech Republic, pp. 273–278 (2013)
Google Scholar
Hannun, A., et al.: Deep speech: scaling up end-to-end speech recognition (2014). https://arxiv.org/abs/1412.5567
Amodei, D., et al.: Deep speech 2: end-to-end speech recognition in English and mandarin. In: Proceedings of the 33rd International Conference on International Conference on Machine Learning, New York, pp. 173–182 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Vilnius University Kaunas Faculty, Muitines 8, Kaunas, Lithuania
Vytautas Rudzionis & Renata Danieliene
Kaunas University of Technology, Studentu 50, Kaunas, Lithuania
Ugnius Malukas

Authors

Vytautas Rudzionis
View author publications
You can also search for this author in PubMed Google Scholar
Ugnius Malukas
View author publications
You can also search for this author in PubMed Google Scholar
Renata Danieliene
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vytautas Rudzionis .

Editor information

Editors and Affiliations

Kaunas University of Technology, Kaunas, Lithuania
Audrius Lopata
Kaunas University of Technology, Kaunas, Lithuania
Daina Gudonienė
Kaunas University of Technology, Kaunas, Lithuania
Rita Butkienė

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rudzionis, V., Malukas, U., Danieliene, R. (2022). Improvement of Speech Recognition Accuracy Using Post-processing of Recognized Text. In: Lopata, A., Gudonienė, D., Butkienė, R. (eds) Information and Software Technologies. ICIST 2022. Communications in Computer and Information Science, vol 1665. Springer, Cham. https://doi.org/10.1007/978-3-031-16302-9_21

Download citation

DOI: https://doi.org/10.1007/978-3-031-16302-9_21
Published: 06 October 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16301-2
Online ISBN: 978-3-031-16302-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics