Natural speech representations in the human brain during a cocktail party

Kiretmitçi, İbrahim

Natural speech representations in the human brain during a cocktail party

Files

10418802.pdf (10.39 MB)

Date

2021-08

Authors

Kiretmitçi, İbrahim

Advisor

Çukur, Tolga

BUIR Usage Stats

4
views

56
downloads

Abstract

Humans are remarkably adept in selectively listening to a desired speaker in a crowded environment, while ﬁltering out non-target speakers in the background. Attention is key to solving this diﬃcult cocktail-party task, yet a detailed char-acterization of attentional eﬀects on speech representations is lacking. It remains unclear across what levels of speech features and how much attentional modula-tion occurs in each brain area during the cocktail-party task. Besides, it should be clariﬁed whether unattended speech is represented in cortex during selective listening and if so, at what feature levels its representations are maintained. To address these questions, we recorded whole-brain blood-oxygen-level-dependent (BOLD) responses while subjects either passively listened to single-speaker stories, or selectively attended to a male or a female speaker in temporally-overlaid stories in separate experiments. Spectral, articulatory, and semantic models of the natural stories were constructed to enable comprehensive assessments on the hierarchy of speech features. Intrinsic selectivity proﬁles were identiﬁed via vox-elwise models ﬁt to passive listening responses. Attentional modulations were then quantiﬁed based on model predictions for attended and unattended stories in the cocktail-party task. We ﬁnd that acoustic representations are conﬁned to the early auditory cortex whereas linguistic representations are broadly distributed across cortex, that attention causes broad modulations at multiple levels of speech representations (articulatory and semantic) while growing stronger towards later stages of processing, and that unattended speech is represented up to the semantic level in parabelt auditory cortex. These results provide insights on speech perception and attentional mechanisms that underlie the ability to selectively listen to a desired speaker in noisy multi-speaker environments.

Keywords

Functional magnetic resonance imaging (fMRI), Cocktail-party, Dorsal and ventral stream, Encoding model, Natural speech

Degree Discipline

Neuroscience

Degree Level

Doctoral

Degree Name

Ph.D. (Doctor of Philosophy)

Permalink

http://hdl.handle.net/11693/76514

Collections

Graduate School of Engineering and Science

Language

English

Type

Thesis

Full item page

Natural speech representations in the human brain during a cocktail party

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Natural speech representations in the human brain during a cocktail party

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type