Volume 37, Issue 5 e12582
ORIGINAL ARTICLE

Domain problem-solving expert identification in community question answering

Weizhao Tang

Weizhao Tang

School of Computer Science, Fudan University, Shanghai, China

Shanghai Key Laboratory of Data Science, Shanghai, China

Shanghai Institute of Intelligent Electronics & Systems, Shanghai, China

Search for more papers by this author
Tun Lu

Corresponding Author

Tun Lu

School of Computer Science, Fudan University, Shanghai, China

Shanghai Key Laboratory of Data Science, Shanghai, China

Shanghai Institute of Intelligent Electronics & Systems, Shanghai, China

Correspondence

Tun Lu, School of Computer Science, Fudan University, Shanghai, 200433, China.

Email: [email protected]

Hansu Gu, Microsoft Inc., Seattle, WA, 98052.

Email: [email protected]

Search for more papers by this author
Hansu Gu

Corresponding Author

Hansu Gu

Microsoft Inc., Seattle, Washington, USA

Correspondence

Tun Lu, School of Computer Science, Fudan University, Shanghai, 200433, China.

Email: [email protected]

Hansu Gu, Microsoft Inc., Seattle, WA, 98052.

Email: [email protected]

Search for more papers by this author
Peng Zhang

Peng Zhang

School of Computer Science, Fudan University, Shanghai, China

Shanghai Key Laboratory of Data Science, Shanghai, China

Shanghai Institute of Intelligent Electronics & Systems, Shanghai, China

Search for more papers by this author
Ning Gu

Ning Gu

School of Computer Science, Fudan University, Shanghai, China

Shanghai Key Laboratory of Data Science, Shanghai, China

Shanghai Institute of Intelligent Electronics & Systems, Shanghai, China

Search for more papers by this author
First published: 08 June 2020
Citations: 3

Funding information: National Natural Science Foundation of China, Grant/Award Numbers: 61902075, 61932007

Abstract

Question-Answering (Q&A) services provide internet users with platforms to exchange knowledge and ideas. The development of Q&A sites, or Community Question Answering (CQA), mainly depends on the high-quality content continuously contributed by users with high-level expertise, who can be recognized as experts. Expert finding is an important task for the authorities of Q&A communities to encourage commitment. In a highly competitive market environment, CQA managers have to take measures to retain and nurture users, especially superior contributors. However, current expertise scoring techniques adopted in CQA often give much credit to very active users and fail to identify real experts. This study aims to develop a robust and practical expert identification framework for Q&A communities, by combining well-designed expertise scoring technique and probabilistic clustering model. With regard to expert identification, a numerical metric of users' expertise is developed as the optimal expert finding strategy, and a clustering algorithm based on Gaussian-Gamma mixture model (GGMM) is proposed to efficiently distinguish experts from nonexperts. In the experiments, the proposed method is applied to real-world datasets collected from subcommunities of Stack Exchange Q&A networks. Results obtained from comparative experiments show that our method achieves better performance than the state-of-the-art methods and demonstrate the effectiveness of the proposed framework. The analysis shows that the framework which combines the proposed expertise scoring technique and Gaussian–Gamma mixture clustering model is capable of detecting excellent domain problem-solving experts who exhibit both domain interest and expertise.

CONFLICTS OF INTEREST

None.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.