Volume 41, Issue 3 pp. 271-282

High Dimensional Data

Exploring Multivariate Event Sequences with an Interactive Similarity Builder

Shaobin Xu,

Shaobin Xu

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Minghui Sun,

Minghui Sun

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Zhengtai Zhang,

Zhengtai Zhang

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Hao Xue,

Hao Xue

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Shaobin Xu,

Shaobin Xu

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Minghui Sun,

Minghui Sun

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Zhengtai Zhang,

Zhengtai Zhang

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

Hao Xue,

Hao Xue

College of Computer Science and Technology, Jilin University, China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China

Search for more papers by this author

First published: 12 August 2022

https://doi.org/10.1111/cgf.14539

Share a link

Email
Wechat
Bluesky

Abstract

Similarity-based exploration is an effective method in knowledge discovery. Faced with multivariate event sequence data (MVES), developing a satisfactory similarity measurement for a specific question is challenging because of the heterogeneity introduced by numerous attributes with different data formats, coupled with their associations. Additionally, the absence of effective validation feedback makes judging the goodness of a measurement scheme a time-consuming and error-prone procedure. To free analysts from tedious programming to concentrate on the exploration of MVES data, this paper introduces an interactive similarity builder, where analysts can use visual building blocks for assembling similarity measurements in a drag-and-drop and incremental fashion. Based on the builder, we further propose a visual analytics framework that provides multi-granularity visual validations for measurement schemes and supports a recursive workflow for refining the focus set. We illustrate the power of our prototype through a case study and a user study with real-world datasets. Results suggest that the system improves the efficiency of developing similarity measurements and the usefulness of exploring MVES data.

Supporting Information

References

Angelini M., Santucci G., Schumann H., Schulz H.-J.: A review and characterization of progressive visual analytics. In Informatics (2018), vol. 5, Multidisciplinary Digital Publishing Institute, p. 31. 10
Google Scholar
Borg I., Groenen P. J.: Modern multidimensional scaling: Theory and applications. Springer Science & Business Media, 2005. 6
Google Scholar
Bostock M., Ogievetsky V., Heer J.: D³ data-driven documents. IEEE transactions on visualization and computer graphics 17, 12 (2011), 2301–2309. 5
10.1109/TVCG.2011.185
PubMed Web of Science® Google Scholar
Bach B., Shi C., Heulot N., Madhyastha T., Grabowski T., Dragicevic P.: Time curves: Folding time to visualize patterns of temporal evolution in data. IEEE transactions on visualization and computer graphics 22, 1 (2015), 559–568. 3
10.1109/TVCG.2015.2467851
Web of Science® Google Scholar
Bernard J., Wilhelm N., Scherer M., May T., Schreck T.: Timeseriespaths: Projection-based explorative analysis of multivariate time series data. 3
Google Scholar
Cavallo M., Demiralp Ç.: Clustrophile 2: Guided visual clustering analysis. IEEE transactions on visualization and computer graphics 25, 1 (2018), 267–276. 2
10.1109/TVCG.2018.2864477
Web of Science® Google Scholar
Cappers B. C., Meessen P. N., Etalle S., Van Wijk J. J.: Eventpad: Rapid malware analysis and reverse engineering using visual analytics. In 2018 IEEE Symposium on Visualization for Cyber Security (VizSec) (2018), IEEE, pp. 1–8. 3, 6, 7
Google Scholar
Cappers B. C., van Wijk J. J.: Exploring multivariate event sequences using rules, aggregations, and selections. IEEE transactions on visualization and computer graphics 24, 1 (2017), 532–541. 3, 6, 7
10.1109/TVCG.2017.2745278
PubMed Web of Science® Google Scholar
Chen Y., Xu P., Ren L.: Sequence synopsis: Optimize visual summary of temporal event data. IEEE transactions on visualization and computer graphics 24, 1 (2017), 45–55. 7
10.1109/TVCG.2017.2745083
PubMed Web of Science® Google Scholar
Chen Q., Yue X., Plantaz X., Chen Y., Shi C., Pong T.-C., Qu H.: Viseq: Visual analytics of learning sequence in massive open online courses. IEEE transactions on visualization and computer graphics 26, 3 (2018), 1622–1636. 2
10.1109/TVCG.2018.2872961
PubMed Web of Science® Google Scholar
Du F., Plaisant C., Spring N., Shneiderman B.: Eventaction: Visual analytics for temporal event sequence recommendation. In 2016 IEEE Conference on Visual Analytics Science and Technology (VAST) (2016), IEEE, pp. 61–70. 2
Google Scholar
Espadoto M., Martins R. M., Kerren A., Hirata N. S., Telea A. C.: Toward a quantitative survey of dimension reduction techniques. IEEE transactions on visualization and computer graphics 27, 3 (2019), 2153–2173. 2
10.1109/TVCG.2019.2944182
Web of Science® Google Scholar
Eick C. F., Zeidat N., Zhao Z.: Supervised clustering-algorithms and benefits. In 16Th IEEE international conference on tools with artificial intelligence (2004), IEEE, pp. 774–776. 6
Google Scholar
Fujiwara T., Chou J.-K., Shilpika S., Xu P., Ren L., Ma K.-L.: An incremental dimensionality reduction method for visualizing streaming multidimensional data. IEEE transactions on visualization and computer graphics 26, 1 (2019), 418–428. 6
10.1109/TVCG.2019.2934433
PubMed Web of Science® Google Scholar
Gower J. C., Dijksterhuis G. B., et al.: Procrustes problems, vol. 30. Oxford University Press on Demand, 2004. 6
Google Scholar
Guo R., Fujiwara T., Li Y., Lima K. M., Sen S., Tran N. K., Ma K.-L.: Comparative visual analytics for assessing medical records with sequence embedding. Visual Informatics 4, 2 (2020), 72–85. 2, 3
10.1016/j.visinf.2020.04.001
Google Scholar
Guo Y., Guo S., Jin Z., Kaul S., Gotz D., Cao N.: Survey on visual analysis of event sequence data. IEEE Transactions on Visualization and Computer Graphics (2021). 2
Google Scholar
Guo S., Jin Z., Gotz D., Du F., Zha H., Cao N.: Visual progression analysis of event sequence data. IEEE transactions on visualization and computer graphics 25, 1 (2018), 417–426. 2, 6
10.1109/TVCG.2018.2864885
Web of Science® Google Scholar
Ge T., Lee B., Wang Y.: Cast: Authoring data-driven chart animations. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (2021), pp. 1–15. 3
Google Scholar
Johnson A., Bulgarelli L., Pollard T., Horng S., Celi L. A., Mark R.: Mimic-iv (version 1.0). PhysioNet (2021). 9
Google Scholar
Jin Z., Cui S., Guo S., Gotz D., Sun J., Cao N.: Carepre: An intelligent clinical decision assistance system. ACM Transactions on Computing for Healthcare 1, 1 (2020), 1–20. 2
10.1145/3344258
Google Scholar
Javed W., McDonnel B., Elmqvist N.: Graphical perception of multiple time series. IEEE transactions on visualization and computer graphics 16, 6 (2010), 927–934. 3
10.1109/TVCG.2010.162
PubMed Web of Science® Google Scholar
Jönsson D., Steneteg P., Sundén E., Englund R., Kottravel S., Falk M., Ynnerman A., Hotz I., Ropinski T.: Inviwo—a visualization system with usage abstraction levels. IEEE transactions on visualization and computer graphics 26, 11 (2019), 3241–3254. 3
10.1109/TVCG.2019.2920639
PubMed Web of Science® Google Scholar
Kodinariya T. M., Makwana P. R.: Review on determining number of cluster in k-means clustering. International Journal 1, 6 (2013), 90–95. 6
Google Scholar
Keogh E., Ratanamahatana C. A.: Exact indexing of dynamic time warping. Knowledge and information systems 7, 3 (2005), 358–386. 4
10.1007/s10115-004-0154-9
Web of Science® Google Scholar
Levenshtein V. I., et al.: Binary codes capable of correcting deletions, insertions, and reversals. In Soviet physics doklady (1966), vol. 10, Soviet Union, pp. 707–710. 8
Google Scholar
Law P.-M., Liu Z., Malik S., Basole R. C.: Maqui: Interweaving queries and pattern mining for recursive event sequence exploration. IEEE transactions on visualization and computer graphics 25, 1 (2018), 396–406. 2
Google Scholar
Liu Y., Li Z., Xiong H., Gao X., Wu J.: Understanding of internal clustering validation measures. In 2010 IEEE international conference on data mining (2010), IEEE, pp. 911–916. 2
Google Scholar
Loorak M. H., Perin C., Kamal N., Hill M., Carpendale S.: Timespan: Using visualization to explore temporal multidimensional data of stroke patients. IEEE transactions on visualization and computer graphics 22, 1 (2015), 409–418. 3
10.1109/TVCG.2015.2467325
PubMed Web of Science® Google Scholar
Liu D., Xu P., Ren L.: Tpflow: Progressive partition and multidimensional pattern extraction for large-scale spatio-temporal data analysis. IEEE transactions on visualization and computer graphics 25, 1 (2018), 1–11. 7
10.1109/TVCG.2018.2865018
Web of Science® Google Scholar
MeVis Medical Solutions AG: Mevislab. Accessed: 2022-02-17. URL: https://www.mevislab.de. 3
Google Scholar
Mannila H., Ronkainen P.: Similarity of event sequences. In Proceedings of TIME'97: 4th International Workshop on Temporal Representation and Reasoning (1997), IEEE, pp. 136–139. 2
Google Scholar
Micallef L., Schulz H.-J., Angelini M., Aupetit M., Chang R., Kohlhammer J., Perer A., Santucci G.: The human user in progressive visual analytics. In Eurovis (short papers) (2019), pp. 19–23. 10
Google Scholar
Mirzargar M., Whitaker R. T., Kirby R. M.: Curve boxplot: Generalization of boxplot for ensembles of curves. IEEE transactions on visualization and computer graphics 20, 12 (2014), 2654–2663. 7
10.1109/TVCG.2014.2346455
PubMed Web of Science® Google Scholar
Salvador S., Chan P.: Toward accurate dynamic time warping in linear time and space. Intelligent Data Analysis 11, 5 (2007), 561–580. 5
10.3233/IDA-2007-11508
Web of Science® Google Scholar
Stolper C. D., Perer A., Gotz D.: Progressive visual analytics: User-driven visual exploration of in-progress analytics. IEEE Transactions on Visualization and Computer Graphics 20, 12 (2014), 1653–1662. 10
10.1109/TVCG.2014.2346574
PubMed Web of Science® Google Scholar
Thompson J. R., Liu Z., Stasko J.: Data animator: Authoring expressive animated data graphics. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (2021), pp. 1–18. 3
Google Scholar
Van der Maaten L., Hinton G.: Visualizing data using t-sne. Journal of machine learning research 9, 11 (2008). 6
Google Scholar
Van Der Maaten L., Postma E., Van den Herik J., et al.: Dimensionality reduction: a comparative. J Mach Learn Res 10, 66–71 (2009), 13. 6
Google Scholar
Wenskovitch J., Crandell I., Ramakrishnan N., House L., North C.: Towards a systematic combination of dimension reduction and clustering in visual analytics. IEEE transactions on visualization and computer graphics 24, 1 (2017), 131–141. 6
10.1109/TVCG.2017.2745258
PubMed Web of Science® Google Scholar
Wu J., Guo Z., Wang Z., Xu Q., Wu Y.: Visual analytics of multivariate event sequence data in racquet sports. In 2020 IEEE Conference on Visual Analytics Science and Technology (VAST) (2020), IEEE, pp. 36–47. 2, 3, 7
Google Scholar
Wu J., Liu D., Guo Z., Xu Q., Wu Y.: Tacticflow: Visual analytics of ever-changing tactics in racket sports. IEEE Transactions on Visualization and Computer Graphics (2021). 3
Google Scholar
Wongsuphasawat K., Plaisant C., Taieb-Maimon M., Shneiderman B.: Querying event sequences by exact match or similarity search: Design and empirical evaluation. Interacting with computers 24, 2 (2012), 55–68. 2, 3
10.1016/j.intcom.2012.01.003
PubMed Web of Science® Google Scholar
Waser J., Ribicic H., Fuchs R., Hirsch C., Schindler B., Bloschl G., Groller E.: Nodes on ropes: A comprehensive data and control flow for steering ensemble simulations. IEEE transactions on visualization and computer graphics 17, 12 (2011), 1872–1881. 3
10.1109/TVCG.2011.225
PubMed Web of Science® Google Scholar
Wongsuphasawat K., Shneiderman B.: Finding comparable temporal categorical records: A similarity measure with an interactive visualization. In 2009 IEEE Symposium on Visual Analytics Science and Technology (2009), IEEE, pp. 27–34. 2, 3
Google Scholar
Zgraggen E., Drucker S. M., Fisher D., DeLine R.: (sl qu) eries: Visual regular expressions for querying and exploring event sequences. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (2015), pp. 2683–2692. 3
Google Scholar

Volume41, Issue3

June 2022

Pages 271-282

Filename	Description
cgf14539-sup-0001-S1.pdf232.9 KB	Supporting Information
cgf14539-sup-0002-S1.mp468.2 MB	Supporting Information

Exploring Multivariate Event Sequences with an Interactive Similarity Builder

Abstract

Supporting Information

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Exploring Multivariate Event Sequences with an Interactive Similarity Builder

Abstract

Supporting Information

References

References

Related

Information