Interpretable Anomaly Detection in Event Sequences via Sequence Matching and Visual Comparison

Shunan Guo, Zhuochen Jin, Qing Chen, David Gotz, Hongyuan Zha, Nan Cao

View presentation:2022-10-21T14:12:00ZGMT-0600Change your timezone on the schedule page
2022-10-21T14:12:00Z
Exemplar figure, described by caption below
The user interface of the visual analytics system, which consists of seven key views to support comparison-based visual anomaly detection: (1) anomaly overview, (2) similarity view, (3) reconstruction view, (4) anomalous sequence view, (5) normal sequence view with two variants(5A, 5B) in cluster mode and sequence mode, respectively, (6) anomalous record view and (7) similar record list.

Prerecorded Talk

The live footage of the talk, including the Q&A, can be viewed on the session page, Comparisons.

Fast forward
Keywords

Anomaly detection, Data models, Data visualizations, Task analysis, Sequences, Heart, Diabetes

Abstract

Anomaly detection is a common analytical task that aims to identify rare cases that differ from the typical cases that make up the majority of a dataset. When analyzing event sequence data, the task of anomaly detection can be complex because the sequential and temporal nature of such data results in diverse definitions and flexible forms of anomalies. This, in turn, increases the difficulty in interpreting detected anomalies. In this paper, we propose a visual analytic approach for detecting anomalous sequences in an event sequence dataset via an unsupervised anomaly detection algorithm based on Variational AutoEncoders. We further compare the anomalous sequences with their reconstructions and with the normal sequences through a sequence matching algorithm to identify event anomalies. A visual analytics system is developed to support interactive exploration and interpretations of anomalies through novel visualization designs that facilitate the comparison between anomalous sequences and normal sequences. Finally, we quantitatively evaluate the performance of our anomaly detection algorithm, demonstrate the effectiveness of our system through case studies, and report feedback collected from study participants.