Computer Science and Information Systems 2010 Volume 7, Issue 1, Pages: 75-84
https://doi.org/10.2298/CSIS1001075T
Full text (
295 KB)
Cited by
A dynamic alignment algorithm for imperfect speech and transcript
Tao Ye (Shandong University, Department of Computer Science and Technology, Ji Nan, China + Shanghai Qitai Internet Technology Co. Ltd., Shanghai, China)
Xueqing Li (Shandong University, Department of Computer Science and Technology, Ji Nan, China)
Bian Wu (Shanghai Qitai Internet Technology Co. Ltd., Shanghai, China)
This paper presents a novel alignment approach for imperfect speech and the corresponding transcription. The algorithm gets started with multi-stage sentence boundary detection in audio, followed by a dynamic programming based search, to find the optimal alignment and detect the mismatches at sentence level. Experiments show promising performance, compared to the traditional forced alignment approach. The proposed algorithm has already been applied in preparing multimedia content for an online English training platform.
Keywords: text-audio alignment, dynamic programming