Abstract
In order to function effectively as a means of communication, speech must be intelligible under the noisy conditions encountered in everyday life. Two types of perceptual synthesis have been reported that can reduce or cancel the effects of masking by extraneous sounds: Phonemic restoration can enhance intelligibility when segments are replaced or masked by noise, and contralateral induction can prevent mislateralization by effectively restoring speech masked at one ear when it is heard in the other. The present study reports a third type of perceptual synthesis induced by noise: enhancement of intelligibility produced by adding noise to spectral gaps. In most of the experiments, the speech stimuli consisted of two widely separated narrow bands of speech (center frequencies of 370 and 6000 Hz, each band having high-pass and low-pass slopes of 115 dB/octave meeting at the center frequency). These very narrow bands effectively reduced the available information to frequency-limited patterns of amplitude fluctuation lacking information concerning formant structure and frequency transitions. When stochastic noise was introduced into the gap separating the two speech bands, intelligibility increased for “everyday” sentences, for sentences that varied in the transitional probability of keywords, and for monosyllabic word lists. Effects produced by systematically varying noise amplitude and noise bandwidth are reported, and the implications of some of the novel effects observed are discussed.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
American National Standards Institute (1969).American national standard methods for the calculation of the articulation index [ANSI-S3.5, 1969 (R1986)]. New York: Author.
Bashford, J. A., Jr.,Meyers, M. D., Brubaker, B. S., &Warren, R. M. (1988). Illusory continuity of interrupted speech: Speech rate determines durational limits.Journal of the Acoustical Society of America,84, 1635–1638.
Bashford, J. A., Jr.,Riener, K. R., &Warren, R. M. (1992). Increasing the intelligibility of speech through multiple phonemic restorations.Perception & Psychophysics,51, 211–217.
Bashford, J. A., Jr., &Warren, R. M. (1979). Perceptual synthesis of deleted phonemes. In J. J. Wolf & D. H. Klatt (Eds.),Speech communication papers presented at the 97th meeting of the Acoustical Society of America (pp. 423–426). New York: Acoustical Society of America.
Bashford, J. A., Jr., &Warren, R. M. (1987a). Effects of spectral alternation on the intelligibility of words and sentences.Perception & Psychophysics,42, 431–438.
Bashford, J. A., Jr., &Warren, R. M. (1987b). Multiple phonemic restorations follow the rules for auditory induction.Perception & Psychophysics,42, 114–121.
Bashford, J. A., Jr.,Warren R. M., &Brown, C. A. (1996). Use of speech-modulated noise adds strong “bottom-up” cues for phonemic restoration.Perception & Psychophysics,58, 342–350.
Bregman, A. S. (1990).Auditory scene analysis: The perceptual organization of sound. Cambridge, MA: MIT Press.
Dirks, D. D., &Bower, D. (1970). The role of knowledge-based expectations in music perception: Evidence from musical restoration.Journal of Experimental Psychology: General,119, 123–144.
Egan, J. P. (1948). Articulation testing methods.Laryngoscope,58, 955–991.
Hays, W. L. (1988).Statistics. Chicago: Rinehart & Winston.
Houtgast, T. (1976). Subharmonic pitches of a pure tone at low S/N ratio.Journal of the Acoustical Society of America,60, 405–409.
Kalikow, D. N., Stevens, K. N., &Elliot, L. L. (1977). Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability.Journal of the Acoustical Society of America,61, 1337–1351.
Kryter, K. D. (1960). Speech bandwidth compression through spectrum selection.Journal of the Acoustical Society of America,26, 530–538.
Miller, G. A., &Licklider, J. C. R. (1950). The intelligibility of interrupted speech with and without noise.Journal of the Acoustical Society of America,22, 167–173.
Pavlovic, C. V. (1994). Band importance functions for audiological applications.Ear & Hearing,15, 100–104.
Plomp, R. (1981). Perception of sound signals at low signal-to-noise ratio. In D. J. Getty & J. H. Howard (Eds.),Auditory and visual pattern recognition (pp. 27–35). Hillsdale, NJ: Erlbaum.
Pollack, I. (1975). Auditory informational masking [Abstract].Journal of the Acoustical Society of America,57, S5.
Powers, G. L., &Wilcox, J. C. (1977). Intelligibility of temporally interrupted speech with and without intervening noise.Journal of the Acoustical Society of America,61, 195–199.
Repp, B. H. (1992). Perceptual restoration of a “missing” speech sound: Auditory induction or illusion?Perception & Psychophysics,51, 14–32.
Samuel, A. G. (1981). Phonemic restoration: Insights from a new methodology.Journal of Experimental Psychology: General,110, 474–494.
Samuel, A. G. (1991). A further examination of attentional effects in the phonemic restoration illusion.Quarterly Journal of Experimental Psychology,43A, 679–699.
Shriberg, E. E. (1992). Perceptual restoration of filtered vowels with added noise.Language & Speech,35, 127–136.
Silverman, S. R., &Hirsh, I. J. (1955). Problems related to the use of speech in clinical audiometry.Annals of Otology, Rhinology & Laryngology,64, 1234–1245.
Studebaker, G. A., Pavlovic, C. V., &Sherbecoe, R. L. (1987). A frequency importance function for continuous discourse.Journal of the Acoustical Society of America,81, 1130–1138.
Tillman, T. W., &Carhart, R. (1966).An expanded test for speech discrimination utilizing CVC monosyllabic words (Northwestern University Auditory Test No. 6). Brooks Air Force Base, TX: USAF School of Aerospace Medicine.
Verschuure, J., &Brocaar, M. P. (1983). Intelligibility of interrupted meaningful and nonsense speech with and without intervening noise.Perception & Psychophysics,33, 232–240.
Warren, R. M. (1970). Perceptual restoration of missing speech sounds.Science,167, 392–393.
Warren, R. M. (1984). Perceptual restoration of obliterated sounds.Psychological Bulletin,96, 371–383.
Warren, R. M., &Bashford, J. A. (1976). Auditory contralateral induction: An early stage in binaural processing.Perception & Psychophysics,20, 380–386.
Warren, R. M., Bashford, J. A., Jr., &Healy, E. W. (1992). The subtractive nature of illusory continuity: Reciprocal changes in alternating sounds [Abstract].Journal of the Acoustical Society of America,91, 2334.
Warren, R. M., Bashford, J. A., Jr.,Healy, E.W., &Brubaker, B. S. (1994). Auditory induction: Reciprocal changes in alternating sounds.Perception & Psychophysics,55, 313–322.
Warren, R. M., &Obusek, C. J. (1971). Speech perception and phonemic restorations.Perception & Psychophysics,9, 358–362.
Warren, R. M., Riener, K. R., Bashford, J. A., Jr., &Brubaker, B. S. (1995). Spectral redundancy: Intelligibility of sentences heard through narrow spectral slits.Perception & Psychophysics,57, 175–182.
Warren, R. M., &Sherman, G. L. (1974). Phonemic restorations based on subsequent context.Perception & Psychophysics,16, 150–156.
Watson, C. S., Kelly, W. J., &Wroton, H. W. (1976). Factors in the discrimination of tonal patterns. II. Selective attention and learning under various levels of uncertainty.Journal of the Acoustical Society of America,60, 1176–1186.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Warren, R.M., Hainsworth, K.R., Brubaker, B.S. et al. Spectral restoration of speech: Intelligibility is increased by inserting noise in spectral gaps. Perception & Psychophysics 59, 275–283 (1997). https://doi.org/10.3758/BF03211895
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03211895