Pseudo Label Is Better Than Human Label

Hwang, Dongseong; Sim, Khe Chai; Huo, Zhouyuan; Strohman, Trevor

Computer Science > Machine Learning

arXiv:2203.12668 (cs)

[Submitted on 22 Mar 2022 (v1), last revised 2 Jul 2022 (this version, v3)]

Title:Pseudo Label Is Better Than Human Label

Authors:Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman

View PDF

Abstract:State-of-the-art automatic speech recognition (ASR) systems are trained with tens of thousands of hours of labeled speech data. Human transcription is expensive and time consuming. Factors such as the quality and consistency of the transcription can greatly affect the performance of the ASR models trained with these data. In this paper, we show that we can train a strong teacher model to produce high quality pseudo labels by utilizing recent self-supervised and semi-supervised learning techniques. Specifically, we use JUST (Joint Unsupervised/Supervised Training) and iterative noisy student teacher training to train a 600 million parameter bi-directional teacher model. This model achieved 4.0% word error rate (WER) on a voice search task, 11.1% relatively better than a baseline. We further show that by using this strong teacher model to generate high-quality pseudo labels for training, we can achieve 13.6% relative WER reduction (5.9% to 5.1%) for a streaming model compared to using human labels.

Comments:	6 pages, 2 figures, 9 tables, Proceedings of INTERSPEECH 2022
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2203.12668 [cs.LG]
	(or arXiv:2203.12668v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.12668

Submission history

From: Dongseong Hwang [view email]
[v1] Tue, 22 Mar 2022 00:03:13 UTC (249 KB)
[v2] Mon, 28 Mar 2022 22:59:08 UTC (249 KB)
[v3] Sat, 2 Jul 2022 01:43:30 UTC (249 KB)

Computer Science > Machine Learning

Title:Pseudo Label Is Better Than Human Label

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Pseudo Label Is Better Than Human Label

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators