Volume 42, Issue 8 e70095
ORIGINAL ARTICLE

Active Object Detection Using a Novel Network and Partial Prior Information

Jianyu Wang

Jianyu Wang

Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China

Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China

Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China

Search for more papers by this author
Feng Zhu

Corresponding Author

Feng Zhu

Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China

Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China

Correspondence:

Feng Zhu ([email protected])

Search for more papers by this author
Qun Wang

Qun Wang

Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China

Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China

University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author
Pengfei Zhao

Pengfei Zhao

Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China

Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China

University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author
First published: 07 July 2025

ABSTRACT

Active object detection (AOD) enables a system to actively adjust camera parameters or plan the next viewpoint to improve detection accuracy when the current visual input is insufficient. However, most existing AOD methods assume that the target object is visible from the initial viewpoint, which is often unrealistic and reduces task efficiency. To address this limitation, we propose a novel AOD framework that leverages partial prior information to enhance detection performance and task efficiency. Specifically, we construct an extensible prior information library that describes large and easily identifiable adjacent objects (Adj-objects) that are spatially related to the target. This allows the system to initiate AOD based on the presence of an Adj-object, even when the target is initially out of view. Our approach incorporates a duelling deep Q-learning network (Duelling-DQN) with a newly designed reward function to effectively utilise prior information. Additionally, we introduce a viewpoint storage scheme to support fast retrieval and transition between viewpoints. We evaluate the proposed method on the Active Vision Dataset (AVD) and compare it with several state-of-the-art (SOTA) approaches. The experimental results show that our method achieves a superior average success rate of 81.3%, demonstrating its effectiveness in overcoming the initial state limitations of traditional AOD tasks.

Conflicts of Interest

The authors declare no conflicts of interest.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.