Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index

doi:10.1121/1.4932168

. 2015 Nov;138(5):2692-706.

doi: 10.1121/1.4932168.

Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index

Henning Schepker¹, Jan Rennies¹, Simon Doclo²

Affiliations

¹ Project Group Hearing, Speech and Audio Technology, Fraunhofer Institute for Digital Media Technology IDMT, D-26129 Oldenburg, Germany.
² Signal Processing Group, Department of Medical Physics and Acoustics and Cluster of Excellence Hearing4All, University of Oldenburg, D-26111 Oldenburg, Germany.

PMID: 26627746
DOI: 10.1121/1.4932168

Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index

Henning Schepker et al. J Acoust Soc Am. 2015 Nov.

. 2015 Nov;138(5):2692-706.

doi: 10.1121/1.4932168.

Authors

Henning Schepker¹, Jan Rennies¹, Simon Doclo²

Affiliations

¹ Project Group Hearing, Speech and Audio Technology, Fraunhofer Institute for Digital Media Technology IDMT, D-26129 Oldenburg, Germany.
² Signal Processing Group, Department of Medical Physics and Acoustics and Cluster of Excellence Hearing4All, University of Oldenburg, D-26111 Oldenburg, Germany.

PMID: 26627746
DOI: 10.1121/1.4932168

Abstract

In many speech communication applications, such as public address systems, speech is degraded by additive noise, leading to reduced speech intelligibility. In this paper a pre-processing algorithm is proposed that is capable of increasing speech intelligibility under an equal-power constraint. The proposed AdaptDRC algorithm comprises two time- and frequency-dependent stages, i.e., an amplification stage and a dynamic range compression stage that are both dependent on the Speech Intelligibility Index (SII). Experiments using two objective measures, namely, the extended SII and the short-time objective intelligibility measure (STOI), and a formal listening test were conducted to compare the AdaptDRC algorithm with a modified version of a recently proposed algorithm in three different noise conditions (stationary car noise and speech-shaped noise and non-stationary cafeteria noise). While the objective measures indicate a similar performance for both algorithms, results from the formal listening test indicate that for the two stationary noises both algorithms lead to statistically significant improvements in speech intelligibility and for the non-stationary cafeteria noise only the proposed AdaptDRC algorithm leads to statistically significant improvements. A comparison of both objective measures and results from the listening test shows high correlations, although, in general, the performance of both algorithms is overestimated.

PubMed Disclaimer

Cited by

Speech Intelligibility Prediction using Spectro-Temporal Modulation Analysis.
Edraki A, Chan WY, Jensen J, Fogerty D. Edraki A, et al. IEEE/ACM Trans Audio Speech Lang Process. 2021;29:210-225. doi: 10.1109/taslp.2020.3039929. Epub 2020 Nov 24. IEEE/ACM Trans Audio Speech Lang Process. 2021. PMID: 33748329 Free PMC article.
Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions.
Odya P, Kotus J, Kurowski A, Kostek B. Odya P, et al. Sensors (Basel). 2021 Sep 21;21(18):6320. doi: 10.3390/s21186320. Sensors (Basel). 2021. PMID: 34577527 Free PMC article.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index

Affiliations

Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index

Authors

Affiliations

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources