The perceptual organization of sine-wave speech under competitive conditions

Brian Roberts, Robert J. Summers, Peter J. Bailey

Research output: Contribution to journalArticle

Abstract

Speech comprises dynamic and heterogeneous acoustic elements, yet it is heard as a single perceptual stream even when accompanied by other sounds. The relative contributions of grouping “primitives” and of speech-specific grouping factors to the perceptual coherence of speech are unclear, and the acoustical correlates of the latter remain unspecified. The parametric manipulations possible with simplified speech signals, such as sine-wave analogues, make them attractive stimuli to explore these issues. Given that the factors governing perceptual organization are generally revealed only where competition operates, the second-formant competitor (F2C) paradigm was used, in which the listener must resist competition to optimize recognition [Remez et al., Psychol. Rev. 101, 129-156 (1994)]. Three-formant (F1+F2+F3) sine-wave analogues were derived from natural sentences and presented dichotically (one ear = F1+F2C+F3; opposite ear = F2). Different versions of F2C were derived from F2 using separate manipulations of its amplitude and frequency contours. F2Cs with time-varying frequency contours were highly effective competitors, regardless of their amplitude characteristics. In contrast, F2Cs with constant frequency contours were completely ineffective. Competitor efficacy was not due to energetic masking of F3 by F2C. These findings indicate that modulation of the frequency, but not the amplitude, contour is critical for across-formant grouping.
Original languageEnglish
Pages (from-to)804-817
Number of pages14
JournalJournal of the Acoustical Society of America
Volume128
Issue number2
DOIs
Publication statusPublished - Aug 2010
Event157th Meeting of Acoustical Society of America - Portland, United States
Duration: 18 May 200922 May 2009

Fingerprint

sine waves
ear
manipulators
sentences
acoustics
masking
stimuli
analogs
modulation
Waves
Perceptual Organization
Formants
Grouping
Ear
Manipulation

Keywords

  • speech
  • acoustic elements
  • single perceptual stream
  • parametric manipulations
  • second-formant competitor paradigm

Cite this

@article{3ad0869cd6ef4f17aa2274539c07f532,
title = "The perceptual organization of sine-wave speech under competitive conditions",
abstract = "Speech comprises dynamic and heterogeneous acoustic elements, yet it is heard as a single perceptual stream even when accompanied by other sounds. The relative contributions of grouping “primitives” and of speech-specific grouping factors to the perceptual coherence of speech are unclear, and the acoustical correlates of the latter remain unspecified. The parametric manipulations possible with simplified speech signals, such as sine-wave analogues, make them attractive stimuli to explore these issues. Given that the factors governing perceptual organization are generally revealed only where competition operates, the second-formant competitor (F2C) paradigm was used, in which the listener must resist competition to optimize recognition [Remez et al., Psychol. Rev. 101, 129-156 (1994)]. Three-formant (F1+F2+F3) sine-wave analogues were derived from natural sentences and presented dichotically (one ear = F1+F2C+F3; opposite ear = F2). Different versions of F2C were derived from F2 using separate manipulations of its amplitude and frequency contours. F2Cs with time-varying frequency contours were highly effective competitors, regardless of their amplitude characteristics. In contrast, F2Cs with constant frequency contours were completely ineffective. Competitor efficacy was not due to energetic masking of F3 by F2C. These findings indicate that modulation of the frequency, but not the amplitude, contour is critical for across-formant grouping.",
keywords = "speech, acoustic elements, single perceptual stream, parametric manipulations, second-formant competitor paradigm",
author = "Brian Roberts and Summers, {Robert J.} and Bailey, {Peter J.}",
year = "2010",
month = "8",
doi = "10.1121/1.3445786",
language = "English",
volume = "128",
pages = "804--817",
journal = "Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "Acoustical Society of America",
number = "2",

}

The perceptual organization of sine-wave speech under competitive conditions. / Roberts, Brian; Summers, Robert J.; Bailey, Peter J.

In: Journal of the Acoustical Society of America, Vol. 128, No. 2, 08.2010, p. 804-817.

Research output: Contribution to journalArticle

TY - JOUR

T1 - The perceptual organization of sine-wave speech under competitive conditions

AU - Roberts, Brian

AU - Summers, Robert J.

AU - Bailey, Peter J.

PY - 2010/8

Y1 - 2010/8

N2 - Speech comprises dynamic and heterogeneous acoustic elements, yet it is heard as a single perceptual stream even when accompanied by other sounds. The relative contributions of grouping “primitives” and of speech-specific grouping factors to the perceptual coherence of speech are unclear, and the acoustical correlates of the latter remain unspecified. The parametric manipulations possible with simplified speech signals, such as sine-wave analogues, make them attractive stimuli to explore these issues. Given that the factors governing perceptual organization are generally revealed only where competition operates, the second-formant competitor (F2C) paradigm was used, in which the listener must resist competition to optimize recognition [Remez et al., Psychol. Rev. 101, 129-156 (1994)]. Three-formant (F1+F2+F3) sine-wave analogues were derived from natural sentences and presented dichotically (one ear = F1+F2C+F3; opposite ear = F2). Different versions of F2C were derived from F2 using separate manipulations of its amplitude and frequency contours. F2Cs with time-varying frequency contours were highly effective competitors, regardless of their amplitude characteristics. In contrast, F2Cs with constant frequency contours were completely ineffective. Competitor efficacy was not due to energetic masking of F3 by F2C. These findings indicate that modulation of the frequency, but not the amplitude, contour is critical for across-formant grouping.

AB - Speech comprises dynamic and heterogeneous acoustic elements, yet it is heard as a single perceptual stream even when accompanied by other sounds. The relative contributions of grouping “primitives” and of speech-specific grouping factors to the perceptual coherence of speech are unclear, and the acoustical correlates of the latter remain unspecified. The parametric manipulations possible with simplified speech signals, such as sine-wave analogues, make them attractive stimuli to explore these issues. Given that the factors governing perceptual organization are generally revealed only where competition operates, the second-formant competitor (F2C) paradigm was used, in which the listener must resist competition to optimize recognition [Remez et al., Psychol. Rev. 101, 129-156 (1994)]. Three-formant (F1+F2+F3) sine-wave analogues were derived from natural sentences and presented dichotically (one ear = F1+F2C+F3; opposite ear = F2). Different versions of F2C were derived from F2 using separate manipulations of its amplitude and frequency contours. F2Cs with time-varying frequency contours were highly effective competitors, regardless of their amplitude characteristics. In contrast, F2Cs with constant frequency contours were completely ineffective. Competitor efficacy was not due to energetic masking of F3 by F2C. These findings indicate that modulation of the frequency, but not the amplitude, contour is critical for across-formant grouping.

KW - speech

KW - acoustic elements

KW - single perceptual stream

KW - parametric manipulations

KW - second-formant competitor paradigm

UR - http://www.scopus.com/inward/record.url?scp=77955814279&partnerID=8YFLogxK

U2 - 10.1121/1.3445786

DO - 10.1121/1.3445786

M3 - Article

VL - 128

SP - 804

EP - 817

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

SN - 0001-4966

IS - 2

ER -