Clarity in Motion Phase-1 Perceptual Study of Speech Intelligibility (CIM PhI PercX)

Speech Technology and Applied Research Corp.

Status

Completed

Conditions

Healthy

Treatments

Behavioral: Solo: Unmasked Speech Stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Study type

Interventional

Funder types

Other

Industry

NIH

Identifiers

NCT07020728

CIM Phase 1

1R43DC020690-01A1 (U.S. NIH Grant/Contract)

Details and patient eligibility

About

Participants received a bilateral pure-tone hearing screen administered by the research team. All potential participants who failed the hearing screen were provided with information about its meaning and referral for further audiological testing.

Participants who passed the hearing screen and other inclusion criteria were divided into 6 groups, each of which were presented with 144 stimuli equally distributed among processing conditions. Listeners choose a comfortable listening level using supplied headphones and were able to control the rate of presentation. Following a short practice session, listeners were be asked to transcribe each target sentence. The intelligibility of each stimulus was estimated by determining the mean percentage of content words correctly transcribed. After transcription, listeners were asked for two qualitative judgments: (1) the "clarity" of the stimulus, and (2) the "listening effort" involved. The quality of each stimulus was estimated by the median quality judgment, and the effort likewise. Listening sessions were located in a quiet room and presentation was controlled by the Superlab presentation software program.

Collectively, the estimates of word intelligibility, clarity, and listening effort under the different conditions shed light on the effectiveness with which the tested algorithm preserves listener intelligibility with acceptable effort and quality.

Full description

Participants received a bilateral pure-tone hearing screen administered by a clinically trained member of the research team. The threshold criteria was 20 dB SPL at 250 and 500 Hz, and 25 dB SPL at 1000, 2000, 4000, and 8000 Hz. All potential participants who failed the hearing screen were provided with information about its meaning and referred for further audiological testing.

Participants who passed the hearing screen and other inclusion criteria were divided into 6 groups, each of which was presented with 144 stimuli equally distributed among processing conditions (Pristine Non-Moving Speech plus QoS Levels 1-5). Listeners self-selected a comfortable listening level using supplied headphones and were able to control the rate of presentation. Following a short practice session, listeners were asked to transcribe the target sentences, and the intelligibility of each stimulus was estimated by determining the mean percentage of content words correctly transcribed. After transcription, listeners were also asked for two qualitative judgments using a visual analog scale: (1) the "clarity" of the stimulus, and (2) the "listening effort" involved. These measures are sensitive to situations where listeners manage to extract the uttered words from the signal, but with increasing difficulty. The quality of each stimulus was estimated by the median quality judgment, and the effort likewise. Listening sessions took place in a quiet room and presentation was controlled by the Superlab presentation software program.

The Stimuli consisted of audio recordings of target spondaic words embedded in a carrier sentence produced by a male and a female native speaker of American English recorded under quiet conditions. Each stimulus presented to the listeners for identification was either unmasked pristine speech or speech that had have been processed in one of five ways with different mixtures of noise and sensor movement. The latter are identified as QoS Levels 1-5. Each type of processing was expected to have a different effect on the underlying probability that the listener would be able to correctly identify the spoken word, the effort required to do so, and the quality of the presented recording. Data on familiarity ratings for the target spondaic words and relative intelligibility of the two speakers under different conditions of noise masking has been previously reported.

Collectively, the estimates of word intelligibility, clarity, and listening effort under the different conditions is expected to shed light on the effectiveness with which the tested algorithm preserves listener intelligibility with acceptable effort and quality.

Enrollment

72 patients

Sex

All

Ages

18+ years old

Volunteers

Accepts Healthy Volunteers

Inclusion criteria

Inclusion criteria will include adult native speakers of American English with auditory thresholds within age-normal limits, defined as passing a pure tone screening test of 20 dB SPL at 250 and 500 Hz, and 25 dB SPL at 1000, 2000, 4000, and 8000 Hz. Exclusion criteria will include self-report of hearing difficulties and failure to pass hearing screen.

No special populations requiring special protections will be utilized in this study. Potential participants who fail the hearing screen will be provided with results and appropriate clinical referral as per the IRB protocol.

Exclusion criteria

Exclusion criteria will include self-report of hearing difficulties and failure to pass hearing screen.

Trial design

Primary purpose

Basic Science

Allocation

Randomized

Interventional model

Crossover Assignment

Masking

Single Blind

72 participants in 12 patient groups

Group 1A

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 1B

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 2A

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 2B

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 3A

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 3B

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 4A

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 4B

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 5A

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 5B

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 6A

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Group 6B

Experimental group

Description:

Subjects in this Group listened and responded to a unique set of stimulus/treatment pairs.

Treatment:

Behavioral: SlideNoise: Speech Scrubbed from linearly moving and stationary noise stimuli

Behavioral: Raw: Fully masked speech--no motion stimuli

Behavioral: SlideSpch: Scrubbed Speech emitted from linearly moving speaker stimuli

Behavioral: StatScrub: Extracted Speech--no motion stimuli

Behavioral: SlideMic: Stationary sources scrubbed from a linearly moving mic stimuli

Behavioral: Solo: Unmasked Speech Stimuli

Trial contacts and locations

Data sourced from clinicaltrials.gov

Clinical trials

Find clinical trials Trials by location

Research sites

Find research sites Learn about CTV for professionals

Resources

Contact CTV support

Legal

Privacy Notice Terms