Exploring Multivariate Event Sequences with an Interactive Similarity Builder
Shaobin Xu
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorMinghui Sun
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorZhengtai Zhang
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorHao Xue
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorShaobin Xu
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorMinghui Sun
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorZhengtai Zhang
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorHao Xue
College of Computer Science and Technology, Jilin University, China
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, China
Search for more papers by this authorAbstract
Similarity-based exploration is an effective method in knowledge discovery. Faced with multivariate event sequence data (MVES), developing a satisfactory similarity measurement for a specific question is challenging because of the heterogeneity introduced by numerous attributes with different data formats, coupled with their associations. Additionally, the absence of effective validation feedback makes judging the goodness of a measurement scheme a time-consuming and error-prone procedure. To free analysts from tedious programming to concentrate on the exploration of MVES data, this paper introduces an interactive similarity builder, where analysts can use visual building blocks for assembling similarity measurements in a drag-and-drop and incremental fashion. Based on the builder, we further propose a visual analytics framework that provides multi-granularity visual validations for measurement schemes and supports a recursive workflow for refining the focus set. We illustrate the power of our prototype through a case study and a user study with real-world datasets. Results suggest that the system improves the efficiency of developing similarity measurements and the usefulness of exploring MVES data.
Supporting Information
Filename | Description |
---|---|
cgf14539-sup-0001-S1.pdf232.9 KB | Supporting Information |
cgf14539-sup-0002-S1.mp468.2 MB | Supporting Information |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
References
- Angelini M., Santucci G., Schumann H., Schulz H.-J.: A review and characterization of progressive visual analytics. In Informatics (2018), vol. 5, Multidisciplinary Digital Publishing Institute, p. 31. 10
- Borg I., Groenen P. J.: Modern multidimensional scaling: Theory and applications. Springer Science & Business Media, 2005. 6
- Bostock M., Ogievetsky V., Heer J.: D3 data-driven documents. IEEE transactions on visualization and computer graphics 17, 12 (2011), 2301–2309. 5
- Bach B., Shi C., Heulot N., Madhyastha T., Grabowski T., Dragicevic P.: Time curves: Folding time to visualize patterns of temporal evolution in data. IEEE transactions on visualization and computer graphics 22, 1 (2015), 559–568. 3
- Bernard J., Wilhelm N., Scherer M., May T., Schreck T.: Timeseriespaths: Projection-based explorative analysis of multivariate time series data. 3
- Cavallo M., Demiralp Ç.: Clustrophile 2: Guided visual clustering analysis. IEEE transactions on visualization and computer graphics 25, 1 (2018), 267–276. 2
- Cappers B. C., Meessen P. N., Etalle S., Van Wijk J. J.: Eventpad: Rapid malware analysis and reverse engineering using visual analytics. In 2018 IEEE Symposium on Visualization for Cyber Security (VizSec) (2018), IEEE, pp. 1–8. 3, 6, 7
- Cappers B. C., van Wijk J. J.: Exploring multivariate event sequences using rules, aggregations, and selections. IEEE transactions on visualization and computer graphics 24, 1 (2017), 532–541. 3, 6, 7
- Chen Y., Xu P., Ren L.: Sequence synopsis: Optimize visual summary of temporal event data. IEEE transactions on visualization and computer graphics 24, 1 (2017), 45–55. 7
- Chen Q., Yue X., Plantaz X., Chen Y., Shi C., Pong T.-C., Qu H.: Viseq: Visual analytics of learning sequence in massive open online courses. IEEE transactions on visualization and computer graphics 26, 3 (2018), 1622–1636. 2
- Du F., Plaisant C., Spring N., Shneiderman B.: Eventaction: Visual analytics for temporal event sequence recommendation. In 2016 IEEE Conference on Visual Analytics Science and Technology (VAST) (2016), IEEE, pp. 61–70. 2
- Espadoto M., Martins R. M., Kerren A., Hirata N. S., Telea A. C.: Toward a quantitative survey of dimension reduction techniques. IEEE transactions on visualization and computer graphics 27, 3 (2019), 2153–2173. 2
- Eick C. F., Zeidat N., Zhao Z.: Supervised clustering-algorithms and benefits. In 16Th IEEE international conference on tools with artificial intelligence (2004), IEEE, pp. 774–776. 6
- Fujiwara T., Chou J.-K., Shilpika S., Xu P., Ren L., Ma K.-L.: An incremental dimensionality reduction method for visualizing streaming multidimensional data. IEEE transactions on visualization and computer graphics 26, 1 (2019), 418–428. 6
- Gower J. C., Dijksterhuis G. B., et al.: Procrustes problems, vol. 30. Oxford University Press on Demand, 2004. 6
- Guo R., Fujiwara T., Li Y., Lima K. M., Sen S., Tran N. K., Ma K.-L.: Comparative visual analytics for assessing medical records with sequence embedding. Visual Informatics 4, 2 (2020), 72–85. 2, 3
10.1016/j.visinf.2020.04.001 Google Scholar
- Guo Y., Guo S., Jin Z., Kaul S., Gotz D., Cao N.: Survey on visual analysis of event sequence data. IEEE Transactions on Visualization and Computer Graphics (2021). 2
- Guo S., Jin Z., Gotz D., Du F., Zha H., Cao N.: Visual progression analysis of event sequence data. IEEE transactions on visualization and computer graphics 25, 1 (2018), 417–426. 2, 6
- Ge T., Lee B., Wang Y.: Cast: Authoring data-driven chart animations. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (2021), pp. 1–15. 3
- Johnson A., Bulgarelli L., Pollard T., Horng S., Celi L. A., Mark R.: Mimic-iv (version 1.0). PhysioNet (2021). 9
- Jin Z., Cui S., Guo S., Gotz D., Sun J., Cao N.: Carepre: An intelligent clinical decision assistance system. ACM Transactions on Computing for Healthcare 1, 1 (2020), 1–20. 2
10.1145/3344258 Google Scholar
- Javed W., McDonnel B., Elmqvist N.: Graphical perception of multiple time series. IEEE transactions on visualization and computer graphics 16, 6 (2010), 927–934. 3
- Jönsson D., Steneteg P., Sundén E., Englund R., Kottravel S., Falk M., Ynnerman A., Hotz I., Ropinski T.: Inviwo—a visualization system with usage abstraction levels. IEEE transactions on visualization and computer graphics 26, 11 (2019), 3241–3254. 3
- Kodinariya T. M., Makwana P. R.: Review on determining number of cluster in k-means clustering. International Journal 1, 6 (2013), 90–95. 6
- Keogh E., Ratanamahatana C. A.: Exact indexing of dynamic time warping. Knowledge and information systems 7, 3 (2005), 358–386. 4
- Levenshtein V. I., et al.: Binary codes capable of correcting deletions, insertions, and reversals. In Soviet physics doklady (1966), vol. 10, Soviet Union, pp. 707–710. 8
- Law P.-M., Liu Z., Malik S., Basole R. C.: Maqui: Interweaving queries and pattern mining for recursive event sequence exploration. IEEE transactions on visualization and computer graphics 25, 1 (2018), 396–406. 2
- Liu Y., Li Z., Xiong H., Gao X., Wu J.: Understanding of internal clustering validation measures. In 2010 IEEE international conference on data mining (2010), IEEE, pp. 911–916. 2
- Loorak M. H., Perin C., Kamal N., Hill M., Carpendale S.: Timespan: Using visualization to explore temporal multidimensional data of stroke patients. IEEE transactions on visualization and computer graphics 22, 1 (2015), 409–418. 3
- Liu D., Xu P., Ren L.: Tpflow: Progressive partition and multidimensional pattern extraction for large-scale spatio-temporal data analysis. IEEE transactions on visualization and computer graphics 25, 1 (2018), 1–11. 7
- MeVis Medical Solutions AG: Mevislab. Accessed: 2022-02-17. URL: https://www.mevislab.de. 3
- Mannila H., Ronkainen P.: Similarity of event sequences. In Proceedings of TIME'97: 4th International Workshop on Temporal Representation and Reasoning (1997), IEEE, pp. 136–139. 2
- Micallef L., Schulz H.-J., Angelini M., Aupetit M., Chang R., Kohlhammer J., Perer A., Santucci G.: The human user in progressive visual analytics. In Eurovis (short papers) (2019), pp. 19–23. 10
- Mirzargar M., Whitaker R. T., Kirby R. M.: Curve boxplot: Generalization of boxplot for ensembles of curves. IEEE transactions on visualization and computer graphics 20, 12 (2014), 2654–2663. 7
- Salvador S., Chan P.: Toward accurate dynamic time warping in linear time and space. Intelligent Data Analysis 11, 5 (2007), 561–580. 5
- Stolper C. D., Perer A., Gotz D.: Progressive visual analytics: User-driven visual exploration of in-progress analytics. IEEE Transactions on Visualization and Computer Graphics 20, 12 (2014), 1653–1662. 10
- Thompson J. R., Liu Z., Stasko J.: Data animator: Authoring expressive animated data graphics. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (2021), pp. 1–18. 3
- Van der Maaten L., Hinton G.: Visualizing data using t-sne. Journal of machine learning research 9, 11 (2008). 6
- Van Der Maaten L., Postma E., Van den Herik J., et al.: Dimensionality reduction: a comparative. J Mach Learn Res 10, 66–71 (2009), 13. 6
- Wenskovitch J., Crandell I., Ramakrishnan N., House L., North C.: Towards a systematic combination of dimension reduction and clustering in visual analytics. IEEE transactions on visualization and computer graphics 24, 1 (2017), 131–141. 6
- Wu J., Guo Z., Wang Z., Xu Q., Wu Y.: Visual analytics of multivariate event sequence data in racquet sports. In 2020 IEEE Conference on Visual Analytics Science and Technology (VAST) (2020), IEEE, pp. 36–47. 2, 3, 7
- Wu J., Liu D., Guo Z., Xu Q., Wu Y.: Tacticflow: Visual analytics of ever-changing tactics in racket sports. IEEE Transactions on Visualization and Computer Graphics (2021). 3
- Wongsuphasawat K., Plaisant C., Taieb-Maimon M., Shneiderman B.: Querying event sequences by exact match or similarity search: Design and empirical evaluation. Interacting with computers 24, 2 (2012), 55–68. 2, 3
- Waser J., Ribicic H., Fuchs R., Hirsch C., Schindler B., Bloschl G., Groller E.: Nodes on ropes: A comprehensive data and control flow for steering ensemble simulations. IEEE transactions on visualization and computer graphics 17, 12 (2011), 1872–1881. 3
- Wongsuphasawat K., Shneiderman B.: Finding comparable temporal categorical records: A similarity measure with an interactive visualization. In 2009 IEEE Symposium on Visual Analytics Science and Technology (2009), IEEE, pp. 27–34. 2, 3
- Zgraggen E., Drucker S. M., Fisher D., DeLine R.: (sl qu) eries: Visual regular expressions for querying and exploring event sequences. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (2015), pp. 2683–2692. 3