Natural language processing for defining linguistic features in schizophrenia: a sample from Turkish speakers

buir.contributor.authorÇabuk, Tuğçe
buir.contributor.authorSevim, Nurullah
buir.contributor.authorKoç, Aykut
buir.contributor.authorToulopoulou, Timothea
buir.contributor.orcidKoç, Aykut|0000-0002-6348-2663
dc.citation.epage189
dc.citation.spage183
dc.citation.volumeNumber266
dc.contributor.authorÇabuk, Tuğçe
dc.contributor.authorSevim, Nurullah
dc.contributor.authorMutlu, Emre
dc.contributor.authorYağcıoğlu, A. Elif Anıl
dc.contributor.authorKoç, Aykut
dc.contributor.authorToulopoulou, Timothea
dc.date.accessioned2025-02-27T13:10:23Z
dc.date.available2025-02-27T13:10:23Z
dc.date.issued2024-04
dc.departmentDepartment of Psychology
dc.departmentAysel Sabuncu Brain Research Center (BAM)
dc.departmentDepartment of Electrical and Electronics Engineering
dc.departmentNational Magnetic Resonance Research Center (UMRAM)
dc.description.abstractNatural language processing (NLP) provides fast and accurate extraction of features related to the language of schizophrenia. We utilized NLP methods to test the hypothesis that schizophrenia is associated with altered linguistic features in Turkish, a non-Indo-European language, compared to controls. We also explored whether these possible altered linguistic features were language-dependent or -independent. We extracted and compared speech in schizophrenia (SZ, N = 38) and healthy well-matched control (HC, N = 38) participants using NLP. The analysis was conducted in two parts. In the first one, mean sentence length, total completed words, moving average type-token ratio to measure the lexical diversity, and first-person singular pronoun usage were calculated. In the second one, we used parts-of-speech tagging (POS) and Word2Vec in schizophrenia and control. We found that SZ had lower mean sentence length and moving average type-token ratio but higher use of first-person singular pronoun. All these significant results were correlated with the Thought and Language Disorder Scale score. The POS approach demonstrated that SZ used fewer coordinating conjunctions. Our methodology using Word2Vec detected that SZ had higher semantic similarity than HC and K-Means could differentiate between SZ and HC into two distinct groups with high accuracy, 86.84 %. Our findings showed that altered linguistic features in SZ are mostly language-independent. They are promising to describe language patterns in schizophrenia which proposes that NLP measurements may allow for rapid and objective measurements of linguistic features.
dc.embargo.release2025-04
dc.identifier.doi10.1016/j.schres.2024.02.026
dc.identifier.eissn1573-2509
dc.identifier.issn0920-9964
dc.identifier.urihttps://hdl.handle.net/11693/116947
dc.language.isoEnglish
dc.publisherElsevier BV
dc.relation.isversionofhttps://dx.doi.org/10.1016/j.schres.2024.02.026
dc.rightsCC BY-NC-ND 4.0 (Attribution-NonCommercial-NoDerivatives 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.source.titleSchizophrenia Research
dc.subjectLinguistics
dc.subjectSpeech
dc.subjectNatural language processing
dc.subjectSchizophrenia
dc.subjectTurkish
dc.subjectMachine learning
dc.subjectPsychosis
dc.titleNatural language processing for defining linguistic features in schizophrenia: a sample from Turkish speakers
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Natural_language_processing_for_defining_linguistic_features_in_schizophrenia_a_sample_from_Turkish_speakers.pdf
Size:
1.01 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Version History

Now showing 1 - 2 of 2
VersionDateSummary
2025-02-27 16:30:09
The latest version
1*
2025-02-27 16:10:23
A Correction to this article is available
* Selected version