Noun phrase chunker for Turkish using dependency parser

buir.advisorUlusoy, Özgür
dc.contributor.authorKutlu, Mücahid
dc.date.accessioned2016-01-08T18:14:13Z
dc.date.available2016-01-08T18:14:13Z
dc.date.issued2010
dc.departmentDepartment of Computer Engineeringen_US
dc.descriptionAnkara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2010.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2010.en_US
dc.descriptionIncludes bibliographical references leaves 89-97.en_US
dc.description.abstractNoun phrase chunking is a sub-category of shallow parsing that can be used for many natural language processing tasks. In this thesis, we propose a noun phrase chunker system for Turkish texts. We use a weighted constraint dependency parser to represent the relationship between sentence components and to determine noun phrases. The dependency parser uses a set of hand-crafted rules which can combine morphological and semantic information for constraints. The rules are suitable for handling complex noun phrase structures because of their flexibility. The developed dependency parser can be easily used for shallow parsing of all phrase types by changing the employed rule set. The lack of reliable human tagged datasets is a significant problem for natural language studies about Turkish. Therefore, we constructed the first noun phrase dataset for Turkish. According to our evaluation results, our noun phrase chunker gives promising results on this dataset. The correct morphological disambiguation of words is required for the correctness of the dependency parser. Therefore, in this thesis, we propose a hybrid morphological disambiguation technique which combines statistical information, hand-crafted grammar rules, and transformation based learning rules. We have also constructed a dataset for testing the performance of our disambiguation system. According to tests, the disambiguation system is highly effective.en_US
dc.description.degreeM.S.en_US
dc.description.statementofresponsibilityKutlu, Mücahiden_US
dc.format.extentxiii, 124 leavesen_US
dc.identifier.itemidB122447
dc.identifier.urihttp://hdl.handle.net/11693/15149
dc.language.isoEnglishen_US
dc.publisherBilkent Universityen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectNatural Language Processingen_US
dc.subjectNoun Phrase Chunkeren_US
dc.subjectTurkishen_US
dc.subjectShallow Parsingen_US
dc.subjectMorphological Disambiguationen_US
dc.subject.lccQA76.9.N38 K88 2010en_US
dc.subject.lcshNatural language processing (Computer science)en_US
dc.subject.lcshComputational linguistics.en_US
dc.subject.lcshParsing (Computer grammar)en_US
dc.titleNoun phrase chunker for Turkish using dependency parseren_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0005003.pdf
Size:
1.26 MB
Format:
Adobe Portable Document Format