Artificial Bandwidth Extension of Narrowband Speech Signals for the Improvement of Perceptual Speech Communication Quality | SpringerLink
Skip to main content

Artificial Bandwidth Extension of Narrowband Speech Signals for the Improvement of Perceptual Speech Communication Quality

  • Conference paper
Communication and Networking (FGCN 2011)

Abstract

In this paper, an artificial bandwidth extension (ABE) algorithm from narrowband to wideband is proposed in order to improve the quality of narrowband speech. The proposed ABE algorithm is based on spectral band replication in the modified discrete cosine transform (MDCT) domain with no additional bits. In particular, the patch index search for the replication band is restricted so that the harmonic structure of the wideband speech is maintained after ABE. In the proposed ABE algorithm, we first determine whether the current analysis frame of speech signals is voiced or unvoiced. A harmonic spectral replication or a correlation-based replication approach is then applied for the voiced or unvoiced frame, respectively. The proposed ABE algorithm is finally embedded into the G.729 speech decoder as a post-processor. It is shown from the subjective evaluation using a MUSHRA test that the mean opinion score of the wideband speech signals extended by the proposed ABE method is measured as 75.5, which is higher of around 14% than that of narrowband speech signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Mikko, T., Lasse, L., Anssi, R., Henri, T.: Scalable super-wideband extension for wideband coding. In: Proceedings of ICASSP, pp. 161–164 (2009)

    Google Scholar 

  2. Stephen, V.: Listener ratings of speech passbands. In: Proceedings of IEEE Workshop on Speech Coding, pp. 81–82 (1997)

    Google Scholar 

  3. Park, N.I., Kim, H.K., Jung, M.A., Lee, S.R., Choi, S.H.: Burst packet loss concealment using multiple Codebooks and comfort noise for CELP-type speech coders in wireless sensor networks. Sensors 11(5), 5323–5336 (2011)

    Article  Google Scholar 

  4. ITU-T Recommendation G.830: Subjective Performance Assessment of Telephone-band and Wideband Digital Codec (1996)

    Google Scholar 

  5. Goode, B.: Voice over internet protocol (VoIP). Proceedings of the IEEE 90(9), 1495–1517 (2002)

    Article  Google Scholar 

  6. Kosuke, T., Kei, K.: Low-complexity bandwidth extension in MDCT domain for low-bitrate speech coding. In: Proceedings of ICASSP, pp. 4145–4148 (2009)

    Google Scholar 

  7. Rogot, S., Kovesi, B., Trilling, R., Virette, D., Duc, N., Massaloux, D., Proust, S., Geiser, B., Gartner, M., Schandl, S., Taddei, H., Yang, G., Shlomot, E., Ehara, H., Yoshida, K., Vaillancourt, T., Salami, R., Lee, M.S., Kim, D.Y.: ITU-T G.729.1: an 8-32 kbit/s scalable coder interoperable with G.729 for wideband Telephony and voice over IP. In: Proceedings of ICASSP, pp. 529–532 (2007)

    Google Scholar 

  8. Jax, P., Vary, P.: On artificial bandwidth extension of telephone speech. Signal Processing 83, 1707–1719 (2003)

    Article  MATH  Google Scholar 

  9. Song, G.-B., Martynovich, P.: A study of HMM-based bandwidth extension of speech signals. Signal Processing 89, 2036–2044 (2009)

    Article  MATH  Google Scholar 

  10. Kornagel, U.: Techniques for artificial bandwidth extension of telephone speech. Signal Processing 86, 1296–1306 (2006)

    Article  MATH  Google Scholar 

  11. Pulakka, H., Laaksonen, L., Vainio, M., Pohjalainen, J., Alku, P.: Evaluation of an artificial speech bandwidth extension method in three languages. IEEE Transactions on Audio, Speech, and Language Processing 16, 1124–1137 (2008)

    Article  Google Scholar 

  12. Kim, K., Lee, M., Kang, H.: Speech bandwidth extension using temporal envelope modeling. IEEE Signal Processing Letters 15, 429–432 (2008)

    Article  Google Scholar 

  13. Murali, M.D., Karpur, D.B., Narayan, M., Kishore, J.: Artificial bandwidth extension of narrowband speech using Gaussian mixture model. In: Proceedings of ICASSP, pp. 410–412 (2011)

    Google Scholar 

  14. Tsujino, K., Kikuiri, K.: Low-complexity bandwidth extension in MDCT domain for low-bitrate speech coding. In: Proceedings of ICASSP, pp. 4145–4148 (2009)

    Google Scholar 

  15. Kondoz, A.M.: Digital Speech: Coding for Low Bit Rate Communication Systems, 2nd edn. Weiley (2004)

    Google Scholar 

  16. ITU-T Recommendation G.729: Coding of Speech at 8 kbit/s Using Conjugate-Structure Code-Excited Linear Prediction, CS-ACELP (1996)

    Google Scholar 

  17. ITU/ITU-R BS 1534: Method for Subjective Assessment of Intermediate Quality Level of Coding Systems (2001)

    Google Scholar 

  18. EBU.: Sound Quality Assessment Material Recording for Subjective Tests (1988)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, N.I., Lee, Y.H., Kim, H.K. (2011). Artificial Bandwidth Extension of Narrowband Speech Signals for the Improvement of Perceptual Speech Communication Quality. In: Kim, Th., et al. Communication and Networking. FGCN 2011. Communications in Computer and Information Science, vol 266. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27201-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27201-1_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27200-4

  • Online ISBN: 978-3-642-27201-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics