Online minimax optimal density estimation and anomaly detection in nonstationary environments

buir.advisorKozat, Süleyman Serdar
dc.contributor.authorGökcesu, Kaan
dc.date.accessioned2017-08-29T06:08:31Z
dc.date.available2017-08-29T06:08:31Z
dc.date.copyright2017-07
dc.date.issued2017-07
dc.date.submitted2017-08-28
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionIncludes bibliographical references (leaves 64-71).en_US
dc.description.abstractOnline anomaly detection has attracted signi cant attention in recent years due to its applications in network monitoring, cybersecurity, surveillance and sensor failure. To this end, we introduce an algorithm that sequentially processes data to detect anomalies in time series. Our algorithm consists of two stages: density estimation and anomaly detection. First, we construct a probability density function to model the normal data. Then, we threshold the density of the newly observed data to detect anomalies. We approach this problem from an information theoretic perspective and, for the rst time in the literature, propose minimax optimal schemes for both stages to create an optimal anomaly detection algorithm in a strong deterministic sense. For the rst stage, we introduce an online density estimator that is minimax optimal for general nonstationary exponential-family of distributions without any assumptions on the observation sequence. Our algorithm does not require a priori knowledge of the time horizon, the drift of the underlying distribution or the time instances the parameters of the source changes. Our results are guaranteed to hold in an individual sequence manner. For the second stage, we propose an online threshold selection scheme that has logarithmic performance bounds against the best threshold chosen in hindsight. Our complete algorithm adaptively updates its parameters in a truly sequential manner to achieve log-linear regrets in both stages. Because of its universal prediction perspective on its density estimation, our anomaly detection algorithm can be used in unsupervised, semi-supervised and supervised manner. Through synthetic and real life experiments, we demonstrate substantial performance gains with respect to the state-of-the-art.en_US
dc.description.provenanceSubmitted by Betül Özen (ozen@bilkent.edu.tr) on 2017-08-29T06:08:30Z No. of bitstreams: 1 thesis-gokcesu.pdf: 1651109 bytes, checksum: f54e78774acf940f87f6536b7dc5408f (MD5)en
dc.description.provenanceMade available in DSpace on 2017-08-29T06:08:31Z (GMT). No. of bitstreams: 1 thesis-gokcesu.pdf: 1651109 bytes, checksum: f54e78774acf940f87f6536b7dc5408f (MD5) Previous issue date: 2017-08en
dc.description.statementofresponsibilityby Kaan Gökcesu.en_US
dc.embargo.release2020-08-25
dc.format.extentix, 71 leaves : charts (some color) ; 29 cm.en_US
dc.identifier.itemidB156121
dc.identifier.urihttp://hdl.handle.net/11693/33561
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectAnomaly Detectionen_US
dc.subjectTime Seriesen_US
dc.subjectOnline Learningen_US
dc.subjectDensity Estimationen_US
dc.subjectMinimax Optimalen_US
dc.subjectNonstationaryen_US
dc.titleOnline minimax optimal density estimation and anomaly detection in nonstationary environmentsen_US
dc.title.alternativeDurağan olmayan ortamlarda çevrimiçi minimaks optimal yoğunluk tahmini ve anomali tespitien_US
dc.typeThesisen_US
thesis.degree.disciplineElectrical and Electronic Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis-gokcesu.pdf
Size:
1.57 MB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: