Modeling speech transcriptions for automatic assessment of depression severity

Kaynak, Ergün Batuhan

Modeling speech transcriptions for automatic assessment of depression severity

Available

The embargo period has ended, and this item is now available.

Files

B161316.pdf (1.04 MB)

Date

2022-09

Authors

Kaynak, Ergün Batuhan

Advisor

Dibeklioğlu, Hamdi

BUIR Usage Stats

3
views

115
downloads

Abstract

It is true that everyone has bad days from time to time. Unfortunately, for peo-ple suffering from depression, every day is a constant battle for motivation to do even the simplest of things, all the while dealing with hopelessness, physical and emotional fatigue, and sadness. Considering the ever-increasing number of people suffering from this disease, the necessity for automated depression severity assess-ment systems is profound. These systems can be used in treatment procedures, and the findings provided from learned models can help us better understand the dynamics of depression. To help in the solution to this illness, we propose a modular deep learning pipeline that uses speech transcripts as input for depression severity prediction. Through our pipeline, we investigate the role of popular deep learning archi-tectures in creating representations for depression assessment. To extend the depression assessment literature on text modality, we provide a thorough anal-ysis of sentence statistics and their effects on model training. We also present an investigation regarding the use of sentiment information for depression assess-ment. Evaluation of the proposed architectures is performed on the publicly available Extended Distress Analysis Interview Corpus dataset (E-DAIC). Through the results and discussions, we show that informative representations for depression assessment can be obtained without exploiting the temporal dynamics between sentences. Our proposed non-temporal model outperforms the state of the art by %8.8 in terms of Concordance Correlation Coefficient (CCC). In light of our findings on trained models and data statistics, we discuss how recurrent structures can have a bias toward certain sequence lengths during training and that shorter sentences can be more informative during inference. Our experimental results suggest that relying on semantic information rather than sentiment information, contrary to previous literature, may be more reliable for depression assessment.

Keywords

Depression severity assessment, Speech transcription analysis, Text analysis, Deep learning

Degree Discipline

Computer Engineering

Degree Level

Master's

Degree Name

MS (Master of Science)

Permalink

http://hdl.handle.net/11693/110549

Collections

Graduate School of Engineering and Science

Language

English

Type

Thesis

Full item page

Modeling speech transcriptions for automatic assessment of depression severity

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Modeling speech transcriptions for automatic assessment of depression severity

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type