Improving educational search and question answering

buir.advisorUlusoy, Özgür
dc.contributor.authorYılmaz, Tolga
dc.date.accessioned2016-08-29T07:36:14Z
dc.date.available2016-08-29T07:36:14Z
dc.date.copyright2016-06
dc.date.issued2016-06
dc.date.submitted2016-08-12
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionThesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2016.en_US
dc.descriptionIncludes bibliographical references (leaves 74-85).en_US
dc.description.abstractStudents use general web search engines (GSEs) as their primary source of research while trying to find answers to school related questions. Although GSEs are highly relevant for the general population, they may return results that are out of education context. Another rising trend; social community question answering websites (CQ&A) are the secondary choice for students who try to get answers from other peers online. We focus on discovering possible improvements on educational search by leveraging both of the two information sources. The first part of our work involves Q&A websites. In order to gain contextual and behavioral insights, we extract the content of a commonly used educational Q&A website with a scraper we implement. We analyze the content in terms of user behavior and try to understand to what extent the educational Q&A differs from the general purpose Q&A. In the second part, we implement a classifier for educational questions. This classifier is built by an ensemble method that employs several regular learning algorithms and retrieval based ones that utilize external resources. We also build a query expander to facilitate classification. We further improve the classification using search engine results. In the third part, in order to find out whether search engine ranking can be improved in the education domain using the classification model, we collect and label a set of query results retrieved from a GSE. We propose five ad-hoc methods to improve search ranking based on the idea that the query-document category relation is an indicator of relevance. We evaluate these methods on various query sets and show that some of the methods significantly improve the rankings in the education domain. In the last part, we focus on educational spell checking. In educational search systems, it is common for users to make spelling mistakes. Actual query logs of two commercial search engines in the education domain are analyzed in terms of spelling mistakes using 5 well-known spell correction software that are not education specific and lack the terms that are used in the education field. It is shown that by extending the spell-check dictionary of one of them, even with a small-sized education oriented word-list, one can improve the precision, recall and F1 values of a spell-checker.en_US
dc.description.provenanceSubmitted by Betül Özen (ozen@bilkent.edu.tr) on 2016-08-29T07:36:14Z No. of bitstreams: 1 10118598.pdf: 968630 bytes, checksum: aa414ab1cec88d130eb2b9562c5f6612 (MD5)en
dc.description.provenanceMade available in DSpace on 2016-08-29T07:36:14Z (GMT). No. of bitstreams: 1 10118598.pdf: 968630 bytes, checksum: aa414ab1cec88d130eb2b9562c5f6612 (MD5) Previous issue date: 2016-08en
dc.description.statementofresponsibilityby Tolga Yılmaz.en_US
dc.format.extentxv, 86 leaves : illustrations (some color).en_US
dc.identifier.itemidB119849
dc.identifier.urihttp://hdl.handle.net/11693/32174
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectEducationen_US
dc.subjectClassificationen_US
dc.subjectSearch Engine Rankingen_US
dc.subjectSpell Checkersen_US
dc.subjectSocial Q&Aen_US
dc.titleImproving educational search and question answeringen_US
dc.title.alternativeEğitsel arama ve soru cevaplandırmanın geliştirilmesien_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
10118598.pdf
Size:
945.93 KB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: