Improving the precision of example-based machine translation by learning from user feedback
buir.advisor | Çiçekli, İlyas | |
dc.contributor.author | Daybelge, Turhan Osman | |
dc.date.accessioned | 2016-01-08T18:02:05Z | |
dc.date.available | 2016-01-08T18:02:05Z | |
dc.date.issued | 2007 | |
dc.description | Ankara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2007. | en_US |
dc.description | Thesis (Master's) -- Bilkent University, 2007. | en_US |
dc.description | Includes bibliographical references leaves 110-113 | en_US |
dc.description.abstract | Example-Based Machine Translation (EBMT) is a corpus based approach to Machine Translation (MT), that utilizes the translation by analogy concept. In our EBMT system, translation templates are extracted automatically from bilingual aligned corpora, by substituting the similarities and differences in pairs of translation examples with variables. As this process is done on the lexical-level forms of the translation examples, and words in natural language texts are often morphologically ambiguous, a need for morphological disambiguation arises. Therefore, we present here a rule-based morphological disambiguator for Turkish. In earlier versions of the discussed system, the translation results were solely ranked using confidence factors of the translation templates. In this study, however, we introduce an improved ranking mechanism that dynamically learns from user feedback. When a user, such as a professional human translator, submits his evaluation of the generated translation results, the system learns “contextdependent co-occurrence rules” from this feedback. The newly learned rules are later consulted, while ranking the results of the following translations. Through successive translation-evaluation cycles, we expect that the output of the ranking mechanism complies better with user expectations, listing the more preferred results in higher ranks. The evaluation of our ranking method, using the precision value at top 1, 3 and 5 results and the BLEU metric, is also presented. | en_US |
dc.description.provenance | Made available in DSpace on 2016-01-08T18:02:05Z (GMT). No. of bitstreams: 1 0003359.pdf: 2605482 bytes, checksum: 2f1f7f2839732691a06849c1e70e41a9 (MD5) | en |
dc.description.statementofresponsibility | Daybelge, Turhan Osman | en_US |
dc.format.extent | xiv, 144 leaves, ill, tables | en_US |
dc.identifier.itemid | BILKUTUPB103951 | |
dc.identifier.uri | http://hdl.handle.net/11693/14558 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Example-Based Machine Translation | en_US |
dc.subject | Learning from User Feedback | en_US |
dc.subject | Morphological Disambiguation | en_US |
dc.subject.lcc | P309 .D39 2007 | en_US |
dc.subject.lcsh | Machine translating. | en_US |
dc.subject.lcsh | Computational linguistic. | en_US |
dc.title | Improving the precision of example-based machine translation by learning from user feedback | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1