Browsing by Subject "Linguistics--Data processing."
Now showing 1 - 7 of 7
Results Per Page
Sort Options
Item Open Access An ATN grammar for Turkish(Bilkent University, 1993) Demir, CoşkunSyntactic parsing i.s an iinporta.nt step in a.ny natural language processing system. Augmented Transition Networks (A'l'Ns) are procedural mechanisms which have been one of the earliest and most common paradigms for parsing natural language. ATNs have the generative power of a Turing machine and were first popularized by Woods in 1970. This thesis presents our efforts in developing an ATN grammar for a subset of Turkish including simple and complex sentences. There are five networks in our grammar: the sentence (S) network, which includes the sentence structures that falls in our scope, the noun phrase (NP) network, the adverbial phrase (ADVP) network and finally the clause (CLAUSE) and gerund (GERUND) networks for handling complex sentences. We present results from parsing a large number of Turkish sentences.Item Open Access Design and implementation of a tactical generator for Turkish, a free constituent order language(Bilkent University, 1996) Hakkani, Dilek ZeynepThis thesis describes a tactical generator for Turkish, a free constituent order language, in which the order of the constituents may change according to the information structure of the sentences to be generated. In the absence of any information regarding the information structure of a sentence (i.e., topic, focus, background, etc.), the constituents of the sentence obey a default order, but the order is almost freely changeable, depending on the constraints of the text flow or discourse. We have used a recursively structured finite state machine for handling the changes in constituent order, implemented as a right-linear grammar backbone. Our implementation environment is the GenKit system, developed at Carnegie Mellon University-Center for Machine Translation. Morphological realization has been implemented using an external morphological analysis/generation component which performs concrete morpheme selection and handles morphographemic processes.Item Open Access Design and implementation of a verb lexicon and verb sense disambiguator for Turkish(Bilkent University, 1994) Yılmaz, OkanThe lexicon has a crucial role in all natural language processing systems and has special importance in machine translation systems. This thesis presents the design and implementation of a verb lexicon and a verb sense disambigua- tor for Turkish. The lexicon contains only verbs because verbs encode events in sentences and play the most important role in natural language processing systems, especially in parsing (syntactic analyzing) and machine translation. The verb sense disambiguator uses the information stored in the verb lexicon that we developed. The main purpose of this tool is to disambiguate senses of verbs having several meanings, some of which are idiomatic. We also present a tool implemented in Lucid Common Lisp under X-Windows for adding, accessing, modifying, and removing entries of the lexicon, and a semantic concept ontology containing semantic features of commonly used Turkish nouns.Item Open Access A lexical-functional grammar for Turkish(Bilkent University, 1993) Güngördü, ZelalNatural language processing is a research area which is becoming increasingly popular each day for both academic and commercial reasons. Syntactic parsing underlies most of the applications in natural language processing. Although there have been comprehensive studies of Turkish syntax from a linguistic perspective, this is one of the first attempts for investigating it extensively from a computational point of view. In this thesis, a lexical-functional grammar for Turkish syntax is presented. Our current work deals with regular Turkish sentences that are structurally simple or complex.Item Open Access A performatory analysis of the overt use of the predicate "true"(Bilkent University, 2013) Şenol, Mahmut BurakThe de ationary theory has been one of the most in uential theories of truth in contemporary philosophy. This theory holds that there is no property of truth at all, and that overt uses of the predicate \true" in our sentences are redundant, having absolutely no e ect on what we express. However, all hypothetical examples used by de ationary theorists in exemplifying the theory, in papers, books, have been taken out of context. Thus, there is no way to examine and analyze what the predicate adds to the sentence within context. We oppose this theory not on philosophical grounds, but on empirical grounds, with an \ordinary language philosophy" approach. We computationally collect 7610 occurrences of overt uses of the predicate \true" in the form \it is true that", from 10 in uential periodicals (newspapers and a magazine) published in the United States. We classify and annotate these examples with respect to coordinating and subordinating conjunctions' positions they contain. We investigate contextual relations of the proposition following the phrase \it is true that" with its surrounding propositions. We encounter 34 di erent syntactical patterns. We propose that in some occurrences of overt uses of the predicate \true", existence of the predicate makes an emphasis, performs an action in the same manner as a performatory verb does. We provide ordinary language appearances of overt uses of the predicate \true", which have been used in linguistically reliable media and constitute pragmatic `counter-examples' to the de ationary theory of truth.Item Open Access Tagging and morphological disambiguation of Turkish text(Bilkent University, 1994) Kuruöz, İlkerA part-of-speech (POS) tagger is a system that uses various sources of information to assign possibly unique POS to words. Automatic text tagging is an important component in higher level analysis of text corpora. Its output can also be used in many natural language processing applications. In languages like Turkish or Finnish, with agglutinative morphology, morphological disambiguation is a very crucial process in tagging as the structures of many lexical forms are morphologically ambiguous. This thesis present a POS tagger for Turkish text based on a full-scale two-level specification of Turkish morphology. The tagger is augmented with a multi-word and idiomatic construct recognizer, and most importantly morphological disambiguator based on local lexical neighborhood constraints, heuristics and limited amount of statistical information. The tagger also has additional functionality for statistics compilation and fine tuning of the morphological analyzer, such as logging erroneous morphological parses, commonly used roots, etc. Test results indicate that the tagger can tag about 97/% to 99% of the texts accurately with very minimal user intervention. Furthermore for sentences morphologically disambiguated with the tagger, an LFG parser developed for Turkish, on the average, generates 50% less ambiguous parses almost 2.5 times faster.Item Open Access Turkish text generation with systemic-functional grammar(Bilkent University, 1996) Korkmaz, TurgayNatural Language Generation (NLG) is roughly decomposed into two stages: text planning, and text generation. In the text planning stage, the semantic description of the text is produced from the conceptual inputs. Then, the text generation system transforms this semantic description into an actual text. This thesis focuses on the design and implementation of a Turkish text generation system rather than text planning. To develop a text generator, we need a linguistic theory that describes the resources of the desired natural language, and also a software tool that represents and performs these linguistic resources in a computational environment. In this thesis, in order to carry out the mentioned requirements, we have used a functional linguistic theory called Systemic-Functional Grammar (SFG), and the FUF text generation system as a software tool. The ultimate text generation system takes the semantic description of the text sentence by sentence, and then produces a morphological description for each lexical constituent of the sentence. The morphological descriptions are worded by a Turkish morphological generator. Because of our concentration on the text generation, we have not considered the details of the text planning. Hence, we assume that the semantic description of the text is produced and lexicalized by an application (currently given by hand).