Volume 41, Issue 3 pp. 271-282
High Dimensional Data

Exploring Multivariate Event Sequences with an Interactive Similarity Builder

Shaobin Xu

Shaobin Xu

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author
Minghui Sun

Minghui Sun

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author
Zhengtai Zhang

Zhengtai Zhang

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author
Hao Xue

Hao Xue

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author
First published: 12 August 2022

Abstract

Similarity-based exploration is an effective method in knowledge discovery. Faced with multivariate event sequence data (MVES), developing a satisfactory similarity measurement for a specific question is challenging because of the heterogeneity introduced by numerous attributes with different data formats, coupled with their associations. Additionally, the absence of effective validation feedback makes judging the goodness of a measurement scheme a time-consuming and error-prone procedure. To free analysts from tedious programming to concentrate on the exploration of MVES data, this paper introduces an interactive similarity builder, where analysts can use visual building blocks for assembling similarity measurements in a drag-and-drop and incremental fashion. Based on the builder, we further propose a visual analytics framework that provides multi-granularity visual validations for measurement schemes and supports a recursive workflow for refining the focus set. We illustrate the power of our prototype through a case study and a user study with real-world datasets. Results suggest that the system improves the efficiency of developing similarity measurements and the usefulness of exploring MVES data.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.