Regression by selecting best feature(s)
buir.advisor | Güvenir, Halil Altay | |
dc.contributor.author | Aydın, Tolga | |
dc.date.accessioned | 2016-01-08T20:17:39Z | |
dc.date.available | 2016-01-08T20:17:39Z | |
dc.date.issued | 2000 | |
dc.description | Ankara : Department of Computer Engineering and the Institute of Engineering and Science of Bilkent Univ., 2000. | en_US |
dc.description | Thesis (Master's) -- Bilkent University, 2000. | en_US |
dc.description | Includes bibliographical references leaves 75-78. | en_US |
dc.description.abstract | Two new machine learning methods, Regression by Selecting Best Feature Projections (RSBFP) and Regression by Selecting Best Features (RSBF), are presented for regression problems. These methods heavily make use of least squares regression to induce eager, parametric and context-sensitive models. Famous regression approaches of machine learning and statistics literature such as DART, MARS, RULE and kNN can not construct models that are both predictive and have reasonable training and/or querying time durations. We developed RSBFP and RSBF to fill the gap in the literature for a regression method having higher predictive accuracy and faster training and querying time durations. RSBFP constructs a decision list consisting of simple linear regression lines belonging to linear features and/or categorical feature segments. RSBF is the extended version of RSBFP such that the decision list consists of both simple, belonging to categorical feature segments, and/or multiple, belonging to linear features, linear regression lines. A relevancy heuristic has been developed to determine the features involved in the multiple regression lines. It is shown that the proposed methods are robust to irrelevant features, missing feature values and target feature noise, which make them suitable prediction tools for real-world databases. In terms of robustness, RSBFP and RSBF give better results when compared to other famous regression methods. | en_US |
dc.description.provenance | Made available in DSpace on 2016-01-08T20:17:39Z (GMT). No. of bitstreams: 1 1.pdf: 78510 bytes, checksum: d85492f20c2362aa2bcf4aad49380397 (MD5) | en |
dc.description.statementofresponsibility | Aydın, Tolga | en_US |
dc.format.extent | xv, 78 leaves | en_US |
dc.identifier.itemid | BILKUTUPB053302 | |
dc.identifier.uri | http://hdl.handle.net/11693/18249 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Regression | en_US |
dc.subject | Function approximation | en_US |
dc.subject | Feature projections | en_US |
dc.subject.lcc | QA278.2 .A93 2000 | en_US |
dc.subject.lcsh | Regression analysis. | en_US |
dc.subject.lcsh | Regression analysis--Data processing. | en_US |
dc.title | Regression by selecting best feature(s) | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 0008390.pdf
- Size:
- 3.26 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version