Autoencoder algorithms for time series anomaly detection

Κωνσταντοπούλου, Ευαγγελία

Αλγόριθμοι autoencoder για ανίχνευση ανωμαλιών σε χρονοσειρές

Μεταπτυχιακή διπλωματική εργασία

Συγγραφέας

Κωνσταντοπούλου, Ευαγγελία

Ημερομηνία

2024-09-02

Επιβλέπων

Κάντζος, Δημήτριος

Konstantopoulou Evangelia - MSc Thesis, Autoencoder Algorithms for Time Series Anomaly Detection_DD.pdf (5.862Mb)

Λέξεις-κλειδιά

Time series ; Anomaly detection ; Autoencoders ; Semi-supervised learning ; Χρονικές σειρές ; Ανίχνευση ανωμαλιών ; Αυτόματοι κωδικοποιητές ; Ημι-εποπτευόμενη μάθηση

Περίληψη

Time series data capture the behavior of variables over time and as such they are of core importance in a broad variety of scientific disciplines and applications including finance, industry, healthcare and networks, among others. In many use cases, it is crucial to ensure that the observed variable behavior corresponds to a historical normality. When deviations from the norm occur, it might signify a problem or an event of interest, so detecting them is important in order to take appropriate actions. In this thesis the problem of time series anomaly detection is tackled from a data-driven approach, focusing on the use of Autoencoders. Methodologically, the problem is addressed using semi-supervised learning, where all models are trained solely on normal data. The underlying assumption is that a model trained to capture only normal data behavior will underperform when fed with data anomalies. Several Autoencoders are developed and compared to other commonly used anomaly detection methods to assess their ability. Different constraints were are applied to each Autoencoder to evaluate how addressing their tendency to overfit training data towards the identity mapping, can influence their performance. Additionally, a custom Autoencoder-based scheme, named “Fusion Autoencoder”, is designed and implemented to account for past input data, i.e., possible memory effects. During the detection stage, a sliding window is employed which enables to smooth model outputs and produce confidence intervals for models that do not natively support it. All the models are evaluated using the UCR Anomaly Archive, a carefully constructed dataset free from common dataset flaws, ensuring reliable results. Model performance is evaluated using three different categories of metrics to provide an exhaustive assessment over multiple different aspects of their performance. To bring out the strengths and limitations of each model, the conducted experiments focus on two main axes: overall performance comparison between the different models and performance on different types of anomalies. Although all of the implemented autoencoders performed better than the other models, the simplest one, the “Vanilla” autoencoder, outperforms the rest by a significant margin.

Αριθμός σελίδων

Σχολή

Σχολή Μηχανικών

Ακαδημαϊκό Τμήμα

Τμήμα Ηλεκτρολόγων και Ηλεκτρονικών Μηχανικών
Τμήμα Μηχανικών Βιομηχανικής Σχεδίασης και Παραγωγής

Τίτλος Προγράμματος Μεταπτυχιακών Σπουδών

Τεχνητή Νοημοσύνη και Βαθιά Μάθηση

Γλώσσα

Αγγλικά

URI

https://polynoe.lib.uniwa.gr/xmlui/handle/11400/7574
http://dx.doi.org/10.26265/polynoe-7406

Συλλογή

Μεταπτυχιακές διπλωματικές εργασίες - Τεχνητή Νοημοσύνη και Βαθιά Μάθηση

Εμφάνιση πλήρους εγγραφής

Αναφορά Δημιουργού - Μη Εμπορική Χρήση - Παρόμοια Διανομή 4.0 Διεθνές

Εκτός από όπου επισημαίνεται κάτι διαφορετικό, το τεκμήριο διανέμεται με την ακόλουθη άδεια:
Αναφορά Δημιουργού - Μη Εμπορική Χρήση - Παρόμοια Διανομή 4.0 Διεθνές