Volume 35, Issue 11 e2444
SPECIAL ISSUE - TECHNOLOGY PAPER

Supporting maintenance and testing for AI functions of mobile apps based on user reviews: An empirical study on plant identification apps

Chuanqi Tao

Corresponding Author

Chuanqi Tao

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Ministry Key Laboratory for Safety-Critical Software Development and Verification, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing University, Nanjing, China

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China

Correspondence

Chuanqi Tao, College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China.

Email: [email protected]

Search for more papers by this author
Hongjing Guo

Hongjing Guo

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Search for more papers by this author
Jingxuan Zhang

Jingxuan Zhang

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Ministry Key Laboratory for Safety-Critical Software Development and Verification, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Search for more papers by this author
Zhiqiu Huang

Zhiqiu Huang

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Ministry Key Laboratory for Safety-Critical Software Development and Verification, Nanjing University of Aeronautics and Astronautics, Nanjing, China

Search for more papers by this author
First published: 27 February 2022

Abstract

Despite the tremendous development of artificial intelligence (AI)-based mobile apps, they suffer from quality issues. Data-driven AI software poses challenges for maintenance and quality assurance. Metamorphic testing has been successfully adopted to AI software. However, most previous studies require testers to manually identify metamorphic relations in an ad hoc and arbitrary manner, thereby encountering difficulties in reflecting real-world usage scenarios. Previous work showed that information available in user reviews is effective for maintenance and testing tasks. Yet, there is a lack of studies leveraging reviews to facilitate AI function maintenance and testing activities. This paper proposes METUR, a novel approach to supporting maintenance and testing for AI functions based on reviews. Firstly, METUR automatically classifies reviews that can be exploited for supporting AI function maintenance and evolution activities. Then, it identifies test contexts from reviews in the usage scenario category. METUR instantiates the metamorphic relation pattern for deriving concrete metamorphic relations based on test contexts. The follow-up test dataset is constructed for conducting metamorphic testing. Empirical studies on plant identification apps indicate that METUR effectively categorizes reviews that are related to AI functions. METUR is feasible and effective in detecting inconsistent behaviors by using the metamorphic relations constructed based on reviews.

CONFLICT OF INTEREST

The authors declare no potential conflict of interests.

DATA AVAILABILITY STATEMENT

The data and code that support the findings of this study are openly available in GitHub at https://github.com/TestingAIGroup/METUR.59

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.