Online anomaly detection with kernel density estimators
buir.advisor | Kozat, Süleyman Serdar | |
dc.contributor.author | Kerpiççi, Mine | |
dc.date.accessioned | 2019-08-02T08:02:16Z | |
dc.date.available | 2019-08-02T08:02:16Z | |
dc.date.copyright | 2019-07 | |
dc.date.issued | 2019-07 | |
dc.date.submitted | 2019-07-29 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description | Thesis (M.S.) : Bilkent University, Department of Electrical and Electronics Engineering, İhsan Doğramacı Bilkent University, 2019. | en_US |
dc.description | Includes bibliographical references (leaves 40-44). | en_US |
dc.description.abstract | We study online anomaly detection in an unsupervised framework and introduce an algorithm to detect the anomalies in sequential data. We first sequentially learn the density for the observed data with a novel kernel based hierarchical approach for which we also provide a regret bound in a competitive manner against an exponentially large class of estimators. In our approach, we use a binary partitioning tree and apply the nonparametric Kernel Density Estimation (KDE) method at each node of the introduced tree. The use of the partitioning tree allows us not only to generate a large class of estimators of size doubly exponential in the depth that we compete against in estimating the density, but also to hierarchically organize the class to obtain a computationally efficient final estimation. Moreover, we do not assume any underlying distribution for the data so that our algorithm can work for data coming from any unknown arbitrarily complex distribution. Note that the end-to-end processing in our work is truly online. For this, we exploit a random Fourier kernel expansion for sequentially exact kernel evaluations without a repetitive access to past data. Our algorithm learns not only the optimal partitioning of the observation space but also the optimal bandwidth, which is locally tuned for the optimal partition. Thus, we solve the bandwidth selection problem in KDE methods in a highly novel and computationally efficient way. Finally, as the data density is sequentially being learned in the stream, we compare the estimated density with a threshold to detect the anomalies. We also adaptively learn the threshold in time to achieve the optimal threshold. In our experiments with synthetic and real datasets, we illustrate significant performance improvements achieved by our method against the state-of-the-art anomaly detection algorithms. | en_US |
dc.description.provenance | Submitted by Betül Özen (ozen@bilkent.edu.tr) on 2019-08-02T08:02:16Z No. of bitstreams: 1 Mine Kerpicci - Thesis.pdf: 814903 bytes, checksum: 38114070a2ba051814eac87d6a25c5ac (MD5) | en |
dc.description.provenance | Made available in DSpace on 2019-08-02T08:02:16Z (GMT). No. of bitstreams: 1 Mine Kerpicci - Thesis.pdf: 814903 bytes, checksum: 38114070a2ba051814eac87d6a25c5ac (MD5) Previous issue date: 2019-07 | en |
dc.description.statementofresponsibility | by Mine Kerpiççi | en_US |
dc.embargo.release | 2020-01-29 | |
dc.format.extent | xi, 44 leaves : charts (some color) ; 30 cm. | en_US |
dc.identifier.itemid | B160106 | |
dc.identifier.uri | http://hdl.handle.net/11693/52290 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Online anomaly detection | en_US |
dc.subject | Kernel density estimation | en_US |
dc.subject | Bandwidth selection | en_US |
dc.subject | Regret analysis | en_US |
dc.title | Online anomaly detection with kernel density estimators | en_US |
dc.title.alternative | Çekirdek yoğunluk tahmincileri ile çevrimiçi anomali tespiti | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Electrical and Electronic Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Mine Kerpicci - Thesis.pdf
- Size:
- 795.8 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: