Anomaly detection in time series

Poyraz, Onur

View/Open

File_10251493 (12.96Mb)

Date

2019

Author

Poyraz, Onur

Metadata

Show full item record

Abstract

Olağandışılık Sezimi (AD), mevcut veri kümesinin diğer gözlemleriyle örtüşmeyen gözlemlerin sezimlenmesidir. Olağandışılık türleri ile bunların veri kümesi içindeki oluşumları belirlenmeye çalışılır. Öte yandan, zaman serisi yapıları zaman içinde gelişen devingen yapılara sahiptir ve bu tür yapılarda gözlemler önceki gözlemlere bağlıdır. Bu tez, zaman serisi yapıları altındaki olağandışılık sezimi sürecine odaklanmaktadır. Bu problem her zaman kolay değildir, çünkü olağandışılığın tanımı, sürecin devingen yapısı bağlamında değişebilir ve sistem içerisindeki olağandışılık sezimi işlemi gözlemlerdeki yüksek gürültülerle karışabilir.Bu tezde, akış verilerinin alt kümelerinde meydana gelene olağandışılıkları belirlemeye çalışıyoruz. Bunu yaparken, sistemdeki olağandışılıkları hatalı gözlemlerden ayırt etmek istiyoruz. Bu nedenle verilerdeki toplu olağandışılıkları araştırmaktayız. Bu özelliklere sahip olan zaman serisinde olağandışılık sezimi (ADTS) için hem istatistiksel çıkarım yöntemleri hem de derin öğrenme yaklaşımları öneriyoruz. Derin öğrenme yaklaşımları olarak Tekrarlayan Sinir Ağları (RNN) ve Uzun Kısa Süreli Bellek'i (LSTM) kullanırken, Gaussian karışım modelini (GMM) ve özelleştirilmiş bir gizli Markov modelini (HMM) istatistiksel yaklaşımlar olarak kullanıyoruz. GMM'ler hariç, yukarıda önerilen modellerde veri kümelerinin sıralı yapılarını dikkate alıyoruz. Yöntemlerimizi, Borusan rüzgar türbinleri verilerine uyguluyoruz ve model sonuçlarını bu veri kümesi üzerinde yaptığımız deneylerle karşılaştırıyoruz.

Anomaly detection (AD) is the discovery of the observations which does not conform with the rest of the observations. The types of anomalies and their occurrences that exist in the data set are tried to be determined. On the other hand, time series structures have dynamic structures, which are evolving over time, and in such structures, observations will be affected by previous observations. This thesis focuses on the anomaly detection process under time series structures. This problem is not always straightforward because the definition of anomaly could change with the context of the dynamic structure and anomaly detection process in the system could interfere with the intense noises at the observations.In this thesis, we try to identify anomalies in the sub-sequences of the streaming data. When doing so, we also want to discriminate the anomalies in the system with the faulty observations. Therefore we investigate collective anomalies in the data. We propose both statistical inference methods and deep learning approaches for such type of anomaly detection in time series (ADTS) problem. We use a Gaussian mixture model (GMM) and a customized hidden Markov model (HMM) as statistical approaches, while we use Recurrent Neural Networks (RNN) and Long Short-Term Memories (LSTM) as deep learning approaches. Except for GMMs, we take into account the sequential structures of data sets in the models proposed above. We apply our methodologies to the Borusan wind turbines data and we compare the model results with the experiments we performed on this dataset.

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/72376

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess