Abstract
Compared with high sample-rate speeches, low sample-rate speeches lose all high frequency components that outrange the Nyquist frequency, which might severely impair the speeches’ sound effects. To address this problem, this paper proposes a novel High-frequency (HF) restoration method of low sample-rate speech based on Bayesian inference, which turns the restoration problem into a maximizing a posteriori estimation. With this method, the relation between high frequency components and low frequency components is first extracted from the training set. The compatibility between neighboring audio frames is also modelled by a one dimensional Markov Random Field. Then the extracted knowledge is adopted in reconstructing the original high sample-rate signal for the testing low sample-rate audio. Experiments prove the applicability and effectiveness of this method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Braud, P.: Markov chains: Gibbs fields, Monte Carlo simulation and queues. Springer, Heidelberg (1999)
Cheng, C.I.: High-frequency compensation of low sample-rate audio files: A wavelet-based spectral excitation algorithm. In: Proc. of International Computer Music Conference (September 1997)
Ephraim, Y., Ari, H.L., Roberts, W.J.J.: A brief survey of speech enhancement. In: The Electronic Handbook. CRC Press, Boca Raton (2003)
Freeman, W.T., Jones, T.R., Pasztor, E.C.: Example-based super-resolution. IEEE Trans. on Computer Graphics and applications 22(2), 56–65 (2002)
Rabiner, L., Juang, B.H.: Fundamentals of speech recognition. Prentice-Hall, Englewood Cliffs (1993)
Vetterli, M.: A theory of multirate filter banks. IEEE Trans. on Acoustics, Speech, and Signal Processing 35(3), 356–372 (1987)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, Y., Zhang, C., Lu, N. (2005). A Bayesian Method for High-Frequency Restoration of Low Sample-Rate Speech. In: Singh, S., Singh, M., Apte, C., Perner, P. (eds) Pattern Recognition and Data Mining. ICAPR 2005. Lecture Notes in Computer Science, vol 3686. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551188_60
Download citation
DOI: https://doi.org/10.1007/11551188_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28757-5
Online ISBN: 978-3-540-28758-2
eBook Packages: Computer ScienceComputer Science (R0)