Modeling speech transcriptions for automatic assessment of depression severity

buir.advisorDibeklioğlu, Hamdi
dc.contributor.authorKaynak, Ergün Batuhan
dc.date.accessioned2022-09-20T11:14:22Z
dc.date.available2022-09-20T11:14:22Z
dc.date.copyright2022-09
dc.date.issued2022-09
dc.date.submitted2022-09-19
dc.departmentDepartment of Computer Engineeringen_US
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionThesis (Master's): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2022.en_US
dc.descriptionIncludes bibliographical references (leaves 55-66).en_US
dc.description.abstractIt is true that everyone has bad days from time to time. Unfortunately, for peo-ple suffering from depression, every day is a constant battle for motivation to do even the simplest of things, all the while dealing with hopelessness, physical and emotional fatigue, and sadness. Considering the ever-increasing number of people suffering from this disease, the necessity for automated depression severity assess-ment systems is profound. These systems can be used in treatment procedures, and the findings provided from learned models can help us better understand the dynamics of depression. To help in the solution to this illness, we propose a modular deep learning pipeline that uses speech transcripts as input for depression severity prediction. Through our pipeline, we investigate the role of popular deep learning archi-tectures in creating representations for depression assessment. To extend the depression assessment literature on text modality, we provide a thorough anal-ysis of sentence statistics and their effects on model training. We also present an investigation regarding the use of sentiment information for depression assess-ment. Evaluation of the proposed architectures is performed on the publicly available Extended Distress Analysis Interview Corpus dataset (E-DAIC). Through the results and discussions, we show that informative representations for depression assessment can be obtained without exploiting the temporal dynamics between sentences. Our proposed non-temporal model outperforms the state of the art by %8.8 in terms of Concordance Correlation Coefficient (CCC). In light of our findings on trained models and data statistics, we discuss how recurrent structures can have a bias toward certain sequence lengths during training and that shorter sentences can be more informative during inference. Our experimental results suggest that relying on semantic information rather than sentiment information, contrary to previous literature, may be more reliable for depression assessment.en_US
dc.description.degreeM.S.en_US
dc.description.statementofresponsibilityby Ergün Batuhan Kaynaken_US
dc.embargo.release2023-03-13
dc.format.extentxv, 66 leaves : illustrations (color), charts ; 30 cm.en_US
dc.identifier.itemidB161316
dc.identifier.urihttp://hdl.handle.net/11693/110549
dc.language.isoEnglishen_US
dc.publisherBilkent Universityen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectDepression severity assessmenten_US
dc.subjectSpeech transcription analysisen_US
dc.subjectText analysisen_US
dc.subjectDeep learningen_US
dc.titleModeling speech transcriptions for automatic assessment of depression severityen_US
dc.title.alternativeDepresyon şiddeti değerlendirmesi için konuşma çevriyazılarının modellenmesien_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B161316.pdf
Size:
1.04 MB
Format:
Adobe Portable Document Format
Description:
Full printable version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: