Regression by selecting best feature(s)

buir.advisorGüvenir, Halil Altay
dc.contributor.authorAydın, Tolga
dc.date.accessioned2016-01-08T20:17:39Z
dc.date.available2016-01-08T20:17:39Z
dc.date.issued2000
dc.descriptionAnkara : Department of Computer Engineering and the Institute of Engineering and Science of Bilkent Univ., 2000.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2000.en_US
dc.descriptionIncludes bibliographical references leaves 75-78.en_US
dc.description.abstractTwo new machine learning methods, Regression by Selecting Best Feature Projections (RSBFP) and Regression by Selecting Best Features (RSBF), are presented for regression problems. These methods heavily make use of least squares regression to induce eager, parametric and context-sensitive models. Famous regression approaches of machine learning and statistics literature such as DART, MARS, RULE and kNN can not construct models that are both predictive and have reasonable training and/or querying time durations. We developed RSBFP and RSBF to fill the gap in the literature for a regression method having higher predictive accuracy and faster training and querying time durations. RSBFP constructs a decision list consisting of simple linear regression lines belonging to linear features and/or categorical feature segments. RSBF is the extended version of RSBFP such that the decision list consists of both simple, belonging to categorical feature segments, and/or multiple, belonging to linear features, linear regression lines. A relevancy heuristic has been developed to determine the features involved in the multiple regression lines. It is shown that the proposed methods are robust to irrelevant features, missing feature values and target feature noise, which make them suitable prediction tools for real-world databases. In terms of robustness, RSBFP and RSBF give better results when compared to other famous regression methods.en_US
dc.description.provenanceMade available in DSpace on 2016-01-08T20:17:39Z (GMT). No. of bitstreams: 1 1.pdf: 78510 bytes, checksum: d85492f20c2362aa2bcf4aad49380397 (MD5)en
dc.description.statementofresponsibilityAydın, Tolgaen_US
dc.format.extentxv, 78 leavesen_US
dc.identifier.itemidBILKUTUPB053302
dc.identifier.urihttp://hdl.handle.net/11693/18249
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectRegressionen_US
dc.subjectFunction approximationen_US
dc.subjectFeature projectionsen_US
dc.subject.lccQA278.2 .A93 2000en_US
dc.subject.lcshRegression analysis.en_US
dc.subject.lcshRegression analysis--Data processing.en_US
dc.titleRegression by selecting best feature(s)en_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0008390.pdf
Size:
3.26 MB
Format:
Adobe Portable Document Format
Description:
Full printable version