Volume 29, Issue 18 e4201
RESEARCH ARTICLE

Parallelization of skyline probability computation over uncertain preferences

Haoyang Zhu

Haoyang Zhu

College of Computer, National University of Defense Technology, Changsha, Hunan, China

Search for more papers by this author
Peidong Zhu

Corresponding Author

Peidong Zhu

College of Computer, National University of Defense Technology, Changsha, Hunan, China

Department of Math and Computer Science, Changsha University, Changsha, China

Correspondence

Peidong Zhu, College of Computer, National University of Defense Technology, Changsha, China.

Email: [email protected]

Search for more papers by this author
Xiaoyong Li

Xiaoyong Li

College of Computer, National University of Defense Technology, Changsha, Hunan, China

Academy of Ocean Science and Engineering, National University of Defense Technology, Hunan, China

Search for more papers by this author
Qiang Liu

Qiang Liu

College of Computer, National University of Defense Technology, Changsha, Hunan, China

Search for more papers by this author
Peng Xun

Peng Xun

College of Computer, National University of Defense Technology, Changsha, Hunan, China

Search for more papers by this author
First published: 07 July 2017
Citations: 5

Summary

Query processing over uncertain preferences is very common in real-life situations, because many times, we cannot model users' preferences as strict partial orders. In this paper, we investigate skyline queries over uncertain preferences. The latest state-of-the-art algorithm, called Usky-base algorithm, makes significant advances. However, it still needs to be perfected in 2 aspects. (1) Theoretic analysis: The correctness of the algorithm is not fully verified. (2) Efficiency: Due to the heavy calculation introduced by adopting inclusion-exclusion principle to express the skyline probability, it needs massive time when computing skyline probabilities for large data sets. To address the above 2 concerns, we first review the Usky-base algorithm and lemmas it based on. Then we propose a novel parallel algorithm, called Parallel-sky, to compute skyline probability of a given object. Moreover, we propose an adding algorithm and a deleting algorithm to deal with dynamic scenarios where new objects are added in and outdated objects are deleted out. Furthermore, we extend our algorithm from computing skyline probability of a given object to all objects in a data set. We conduct extensive experiments on real and synthetic data sets to validate the effectiveness and efficiency of our proposals.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.