Active object detection (AOD) enables a system to actively adjust camera parameters or plan the next viewpoint to improve detection accuracy when the current visual input is insufficient. However, most existing AOD methods assume that the target object is visible from the initial viewpoint, which is often unrealistic and reduces task efficiency. To address this limitation, we propose a novel AOD framework that leverages partial prior information to enhance detection performance and task efficiency. Specifically, we construct an extensible prior information library that describes large and easily identifiable adjacent objects (Adj-objects) that are spatially related to the target. This allows the system to initiate AOD based on the presence of an Adj-object, even when the target is initially out of view. Our approach incorporates a duelling deep Q-learning network (Duelling-DQN) with a newly designed reward function to effectively utilise prior information. Additionally, we introduce a viewpoint storage scheme to support fast retrieval and transition between viewpoints. We evaluate the proposed method on the Active Vision Dataset (AVD) and compare it with several state-of-the-art (SOTA) approaches. The experimental results show that our method achieves a superior average success rate of 81.3%, demonstrating its effectiveness in overcoming the initial state limitations of traditional AOD tasks.

Conflicts of Interest

The authors declare no conflicts of interest.

Open Research

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Al-Sharman, M. K., Y. Zweiri, M. A. K. Jaradat, R. Al-Husari, D. Gan, and L. D. Seneviratne. 2020. “Deep-Learning-Based Neural Network Training for State Estimation Enhancement: Application to Attitude Estimation.” IEEE Transactions on Instrumentation and Measurement 69, no. 1: 24–34. https://doi.org/10.1109/TIM.2019.2895495.
10.1109/TIM.2019.2895495
Web of Science® Google Scholar
Ammirato, P., P. Poirson, E. Park, J. Košecká, and A. C. Berg. 2017. “ A Dataset for Developing and Benchmarking Active Vision.” In 2017 IEEE International Conference on Robotics and Automation (ICRA), 1378–1385. IEEE. https://doi.org/10.1109/ICRA.2017.7989164.
10.1109/ICRA.2017.7989164
Google Scholar
Chang, A., A. Dai, T. Funkhouser, et al. 2017. “ Matterport3D: Learning From RGB-D Data in Indoor Environments.” In 2017 International Conference on 3D Vision (3DV), 667–676. IEEE. https://doi.org/10.1109/3DV.2017.00081.
10.1109/3DV.2017.00081
Google Scholar
Chen, W., and N. Rojas. 2024. “TraKDis: A Transformer-Based Knowledge Distillation Approach for Visual Reinforcement Learning With Application to Cloth Manipulation.” IEEE Robotics and Automation Letters 9, no. 3: 2455–2462. https://doi.org/10.1109/LRA.2024.3358750.
10.1109/LRA.2024.3358750
Web of Science® Google Scholar
Chen, X., H. Peng, D. Wang, H. Lu, and H. Hu. 2023. “ SeqTrack: Sequence to Sequence Learning for Visual Object Tracking.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14572–14581. IEEE. https://doi.org/10.1109/CVPR52729.2023.01400.
10.1109/CVPR52729.2023.01400
Google Scholar
Cui, Y., B. Hou, Q. Wu, B. Ren, S. Wang, and L. Jiao. 2022. “Remote Sensing Object Tracking With Deep Reinforcement Learning Under Occlusion.” IEEE Transactions on Geoscience and Remote Sensing 60: 1–13. https://doi.org/10.1109/TGRS.2021.3096809.
10.1109/TGRS.2021.3096809
Web of Science® Google Scholar
Deng, J., W. Dong, R. Socher, L. J. Li, L. Kai, and F. F. Li. 2009. “ ImageNet: A Large-Scale Hierarchical Image Database.” In 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 248–255. IEEE. https://doi.org/10.1109/CVPR.2009.5206848.
10.1109/CVPR.2009.5206848
Google Scholar
Denzler, J., and C. Brown. 2002. “Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation.” IEEE Transactions on Pattern Analysis and Machine Intelligence 24, no. 2: 145–157. https://doi.org/10.1109/34.982896.
10.1109/34.982896
Web of Science® Google Scholar
Diederik, K., and B. Jimmy. 2014. “Adam: A Method for Stochastic Optimization.” arXiv Preprint. arXiv:1412.6980. https://arxiv.org/abs/1412.6980.
Google Scholar
Dionigi, A., A. Devo, L. Guiducci, and G. Costante. 2022. “E-VAT: An Asymmetric End-to-End Approach to Visual Active Exploration and Tracking.” IEEE Robotics and Automation Letters 7, no. 2: 4259–4266. https://doi.org/10.1109/LRA.2022.3150866.
10.1109/LRA.2022.3150866
Web of Science® Google Scholar
Du, H., L. Li, Z. Huang, and X. Yu. 2023. “ Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2563–2573. IEEE. https://doi.org/10.1109/CVPR52729.2023.00252.
10.1109/CVPR52729.2023.00252
Google Scholar
Everingham, M., L. V. Gool, C. Williams, J. Winn, and A. Zisserman. 2010. “The PASCAL Visual Object Classes (VOC) Challenge.” International Journal of Computer Vision 88, no. 2: 303–338. https://doi.org/10.1007/s11263-009-0275-4.
10.1007/s11263-009-0275-4
Web of Science® Google Scholar
Fang, F., W. Liang, Y. Wu, Q. Xu, and J.-H. Lim. 2022. “Self-Supervised Reinforcement Learning for Active Object Detection.” IEEE Robotics and Automation Letters 7, no. 2: 10224–10231. https://doi.org/10.1109/LRA.2022.3193019.
10.1109/LRA.2022.3193019
Google Scholar
Fang, F., Q. Xu, N. Gauthier, L. Li, and J.-H. Lim. 2021. “ Enhancing Multi-Step Action Prediction for Active Object Detection.” In 2021 IEEE International Conference on Image Processing (ICIP), 2189–2193. IEEE. https://doi.org/10.1109/ICIP42928.2021.9506078.
10.1109/ICIP42928.2021.9506078
Google Scholar
Fang, Q., X. Xu, X. Wang, and Y. Zeng. 2022. “Target-Driven Visual Navigation in Indoor Scenes Using Reinforcement Learning and Imitation Learning.” CAAI Transactions on Intelligence Technology 7, no. 2: 167–176. https://doi.org/10.1049/cit2.12043.
10.1049/cit2.12043
Web of Science® Google Scholar
Gao, C., S. Liu, J. Chen, et al. 2024. “Room-Object Entity Prompting and Reasoning for Embodied Referring Expression.” IEEE Transactions on Pattern Analysis and Machine Intelligence 46, no. 2: 994–1010. https://doi.org/10.1109/TPAMI.2023.3326851.
10.1109/TPAMI.2023.3326851
PubMed Web of Science® Google Scholar
Girshick, R. 2015. “ Fast R-CNN.” In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 1440–1448. IEEE. https://doi.org/10.1109/ICCV.2015.169.
10.1109/ICCV.2015.169
Google Scholar
Girshick, R., J. Donahue, T. Darrell, and J. Malik. 2014. “ Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 580–587. IEEE. https://doi.org/10.1109/CVPR.2014.81.
10.1109/CVPR.2014.81
Google Scholar
Grau, V., A. Mewes, M. Alcaniz, R. Kikinis, and S. Warfield. 2004. “Improved Watershed Transform for Medical Image Segmentation Using Prior Information.” IEEE Transactions on Medical Imaging 23, no. 4: 447–458. https://doi.org/10.1109/TMI.2004.824224.
10.1109/TMI.2004.824224
CAS PubMed Web of Science® Google Scholar
Han, H., A. K. Jain, F. Wang, S. G. Shan, and X. L. Chen. 2018. “Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach.” IEEE Transactions on Pattern Analysis and Machine Intelligence 40, no. 11: 2597–2609. https://doi.org/10.1109/TPAMI.2017.2738004.
10.1109/TPAMI.2017.2738004
PubMed Web of Science® Google Scholar
Han, X., H. Liu, F. Sun, and D. Yang. 2018. “ Active Object Detection Using Double DQN and Prioritized Experience Replay.” In 2018 International Joint Conference on Neural Networks (IJCNN), 1–7. IEEE. https://doi.org/10.1109/IJCNN.2018.8489296.
10.1109/IJCNN.2018.8489296
Google Scholar
Han, X., H. Liu, F. Sun, and X. Zhang. 2019. “Active Object Detection With Multistep Action Prediction Using Deep Q-Network.” IEEE Transactions on Industrial Informatics 15, no. 6: 3723–3731. https://doi.org/10.1109/TII.2019.2890849.
10.1109/TII.2019.2890849
Web of Science® Google Scholar
He, K., X. Zhang, S. Ren, and J. Sun. 2016. “ Deep Residual Learning for Image Recognition.” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. IEEE. https://doi.org/10.1109/CVPR.2016.90.
10.1109/CVPR.2016.90
Google Scholar
Hou, Y., J. Li, and M. Chen 2024. “Self-Supervised Antipodal Grasp Learning With Fine-Grained Grasp Quality Feedback in Clutter.” IEEE Transactions on Industrial Electronics 71, no. 4: 3853–3861. https://doi.org/10.1109/TIE.2023.3274854.
10.1109/TIE.2023.3274854
Web of Science® Google Scholar
Huber, M. F., T. Dencker, M. Roschani, and J. Beyerer. 2012. “ Bayesian Active Object Recognition via Gaussian Process Regression.” In 2012 15th International Conference on Information Fusion, 1718–1725. IEEE.
Google Scholar
Khan, A. H., M. S. Nawaz, and A. Dengel. 2023. “ Localized Semantic Feature Mixers for Efficient Pedestrian Detection in Autonomous Driving.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5476–5485. IEEE. https://doi.org/10.1109/CVPR52729.2023.00530.
10.1109/CVPR52729.2023.00530
Google Scholar
Kivrak, H., F. Cakmak, H. Kose, and S. Yavuz. 2021. “Social Navigation Framework for Assistive Robots in Human Inhabited Unknown Environments.” Engineering Science and Technology, an International Journal 24, no. 2: 284–298. https://doi.org/10.1016/j.jestch.2020.08.008.
10.1016/j.jestch.2020.08.008
Web of Science® Google Scholar
Kong, Y. Z., F. Zhu, H. B. Sun, Z. Y. Lin, and Q. Wang. 2022. “A Generic View Planning System Based on Formal Expression of Perception Tasks.” Entropy 24, no. 5: 578. https://doi.org/10.3390/e24050578.
10.3390/e24050578
PubMed Web of Science® Google Scholar
Lehnert, C., D. Tsai, A. Eriksson, and C. McCool. 2019. “ 3D Move to See: Multi-Perspective Visual Servoing Towards the Next Best View Within Unstructured and Occluded Environments.” In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 3890–3897. IEEE. https://doi.org/10.1109/IROS40897.2019.8967918.
10.1109/IROS40897.2019.8967918
Google Scholar
Li, Y., Y. Ma, X. Huo, and X. Wu. 2022. “Remote Object Navigation for Service Robots Using Hierarchical Knowledge Graph in Human-Centered Environments.” Intelligent Service Robotics 15, no. 4: 459–473. https://doi.org/10.1007/s11370-022-00428-4.
10.1007/s11370-022-00428-4
Web of Science® Google Scholar
Lin, J., Y. Wang, Z. Miao, H. Zhong, and R. Fierro. 2022. “Low-Complexity Control for Vision-Based Landing of Quadrotor UAV on Unknown Moving Platform.” IEEE Transactions on Industrial Informatics 18, no. 8: 5348–5358. https://doi.org/10.1109/TII.2021.3129486.
10.1109/TII.2021.3129486
Web of Science® Google Scholar
Lin, T.-Y., M. Maire, S. Belongie, et al. 2014. “ Microsoft COCO: Common Objects in Context.” In Computer Vision – ECCV 2014, 740–755. Springer. https://doi.org/10.1007/978-3-319-10602-1_48.
10.1007/978-3-319-10602-1_48
Google Scholar
Liu, S., G. Tian, Y. Zhang, M. Zhang, and S. Liu. 2022. “Active Object Detection Based on a Novel Deep Q-Learning Network and Long-Term Learning Strategy for the Service Robot.” IEEE Transactions on Industrial Electronics 69, no. 6: 5984–5993. https://doi.org/10.1109/TIE.2021.3090707.
10.1109/TIE.2021.3090707
Web of Science® Google Scholar
Liu, W., D. Anguelov, D. Erhan, et al. 2016. “ SSD: Single Shot MultiBox Detector.” In Computer Vision – ECCV 2016, 21–37. Springer. https://doi.org/10.1007/978-3-319-46448-0_2.
10.1007/978-3-319-46448-0_2
Google Scholar
Long, Y., X. Li, W. Cai, and H. Dong. 2024. “ Discuss Before Moving: Visual Language Navigation via Multiexpert Discussions.” In 2024 IEEE International Conference on Robotics and Automation (ICRA), 17380–17387. IEEE. https://doi.org/10.1109/ICRA57147.2024.10611565.
10.1109/ICRA57147.2024.10611565
Google Scholar
Lorbach, M., S. Höfer, and O. Brock. 2014. “ Prior-Assisted Propagation of Spatial Information for Object Search.” In 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2904–2909. IEEE. https://doi.org/10.1109/IROS.2014.6942962.
10.1109/IROS.2014.6942962
Google Scholar
Luo, W., P. Sun, F. Zhong, W. Liu, T. Zhang, and Y. Wang. 2020. “End-to-End Active Object Tracking and Its Real-World Deployment via Reinforcement Learning.” IEEE Transactions on Pattern Analysis and Machine Intelligence 42, no. 6: 1317–1332. https://doi.org/10.1109/TPAMI.2019.2899570.
10.1109/TPAMI.2019.2899570
PubMed Web of Science® Google Scholar
Lyu, Y., Y. Shi, and X. Zhang. 2022. “Improving Targetdriven Visual Navigation With Attention on 3D Spatial Relationships.” Neural Processing Letters 54, no. 5: 3979–3998. https://doi.org/10.1007/s11063-022-10796-8.
10.1007/s11063-022-10796-8
Google Scholar
Majumdar, A., K. Yadav, S. Arnaud, et al. 2023. “ Where Are We in the Search for an Artificial Visual Cortex for Embodied Intelligence?” In Advances in Neural Information Processing Systems (NeurIPS 2023), vol. 36, 655–677. Curran Associates, Inc.
Google Scholar
Meda, K. C., S. S. Milla, and B. S. Rostad. 2021. “Artificial Intelligence Research Within Reach: An Object Detection Model to Identify Rickets on Pediatric Wrist Radiographs.” Pediatric Radiology 51: 782–791. https://doi.org/10.1007/s00247-020-04895-8.
10.1007/s00247-020-04895-8
PubMed Web of Science® Google Scholar
Morrison, D., P. Corke, and J. Leitner. 2019. “ Multi-View Picking: Next-Best-View Reaching for Improved Grasping in Clutter.” In 2019 International Conference on Robotics and Automation (ICRA), 8762–8768. IEEE. https://doi.org/10.1109/ICRA.2019.8793805.
10.1109/ICRA.2019.8793805
Google Scholar
Paletta, L., and A. Pinz. 2000. “Active Object Recognition by View Integration and Reinforcement Learning.” Robotics and Autonomous Systems 31, no. 30: 71–86. https://doi.org/10.1016/S0921-8890(99)00079-2.
10.1016/S0921-8890(99)00079-2
Google Scholar
Patrício, D. I., and R. Rieder. 2018. “Computer Vision and Artificial Intelligence in Precision Agriculture for Grain Crops: A Systematic Review.” Computers and Electronics in Agriculture 153: 69–81. https://doi.org/10.1016/j.compag.2018.08.001.
10.1016/j.compag.2018.08.001
Web of Science® Google Scholar
Piotr, J., D. Brian, W. Victor, and J. R. R. Service. 1995. “ Active Object Detection Using Color.” In Research in Computer and Robot Vision, 37–53. World Scientific Publishing. https://doi.org/10.1142/9789812812483_0003.
Google Scholar
Prasanth, K. D., B. M. Naga Praveen, and P. Rajalakshmi. 2024. “Stereo Vision Based Object Detection for Autonomous Navigation in Space Environments.” Acta Astronautica 218: 326–329. https://doi.org/10.1016/j.actaastro.2024.02.032.
10.1016/j.actaastro.2024.02.032
Google Scholar
Redmon, J., S. Divvala, R. Girshick, and A. Farhadi. 2016. “ You Only Look Once: Unified, Real-Time Object Detection.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 779–788. IEEE. https://doi.org/10.1109/CVPR.2016.91.
10.1109/CVPR.2016.91
Google Scholar
Reza Loghmani, M., B. Caputo, and M. Vincze. 2018. “ Recognizing Objects In-the-Wild: Where Do we Stand?” In 2018 IEEE International Conference on Robotics and Automation (ICRA), 2170–2177. IEEE. https://doi.org/10.1109/ICRA.2018.8460985.
10.1109/ICRA.2018.8460985
Google Scholar
Samani, E. U., and A. G. Banerjee. 2024. “Persistent Homology Meets Object Unity: Object Recognition in Clutter.” IEEE Transactions on Robotics 40: 886–902. https://doi.org/10.1109/TRO.2023.3343994.
10.1109/TRO.2023.3343994
Web of Science® Google Scholar
Samma, H., and S. El-Ferik. 2024. “Autonomous UAV Visual Navigation Using an Improved Deep Reinforcement Learning.” IEEE Access 12: 79967–79977. https://doi.org/10.1109/ACCESS.2024.3409780.
10.1109/ACCESS.2024.3409780
Web of Science® Google Scholar
Schaul, T., J. Quan, I. Antonoglou, and D. Silver. 2016. “Prioritized Experience Replay.” arXiv Preprint. arXiv:1511.05952. https://arxiv.org/abs/1511.05952.
Google Scholar
Shen, Y. F., H. L. Zhou, J. T. Li, F. J. Jian, and D. S. Jayas. 2018. “Detection of Stored-Grain Insects Using Deep Learning.” Computers and Electronics in Agriculture 145: 319–325. https://doi.org/10.1016/j.compag.2017.11.039.
10.1016/j.compag.2017.11.039
Web of Science® Google Scholar
Shim, V. A., M. Yuan, and B. H. Tan. 2017. “ Automatic Object Searching by a Mobile Robot With Single RGB-D Camera.” In 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 56–62. IEEE. https://doi.org/10.1109/APSIPA.2017.8282002.
10.1109/APSIPA.2017.8282002
Google Scholar
Shin, S. Y., and C. Kim. 2015. “Human-Like Motion Generation and Control for Humanoid's Dual Arm Object Manipulation.” IEEE Transactions on Industrial Electronics 62, no. 4: 2265–2276. https://doi.org/10.1109/TIE.2014.2353017.
10.1109/TIE.2014.2353017
Web of Science® Google Scholar
Taioli, F., S. Rosa, A. Castellini, et al. 2024. “ Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation.” In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 12993–13000. IEEE. https://doi.org/10.1109/IROS58592.2024.10801822.
10.1109/IROS58592.2024.10801822
Google Scholar
Tavakoli, A., F. Pardo, and P. Kormushev. 2018. “Action Branching Architectures for Deep Reinforcement Learning.” Proceedings of the AAAI Conference on Artificial Intelligence 32, no. 1: 1–9. https://doi.org/10.1609/aaai.v32i1.11798.
10.1609/aaai.v32i1.11798
Google Scholar
Van Hasselt, H., A. Guez, and D. Silver. 2016. “Deep Reinforcement Learning With Double Q-Learning.” Proceedings of the AAAI Conference on Artificial Intelligence 30, no. 1: 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295.
10.1609/aaai.v30i1.10295
Google Scholar
Vidit, V., M. Engilberge, and M. Salzmann. 2023. “ CLIP the Gap: A Single Domain Generalization Approach for Object Detection.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 3219–3229. IEEE. https://doi.org/10.1109/CVPR52729.2023.00314.
10.1109/CVPR52729.2023.00314
Google Scholar
Volodymyr, M., K. Koray, S. David, et al. 2013. “Playing ATARI With Deep Reinforcement Learning.” arXiv Preprint. arXiv:1312.5602. https://arxiv.org/abs/1312.5602.
Google Scholar
Wang, A., H. Chen, L. Liu, et al. 2024. “ YOLOv10: Real-Time End-To-End Object Detection.” In Advances in Neural Information Processing Systems (NeurIPS 2024), vol. 37, 107984–108011. Curran Associates, Inc.
Google Scholar
Wang, A., G. Tian, Y. Wang, and Z. Li. 2025. “Move to See More: Approaching Object With Partial Occlusion Using Large Multimodal Model and Active Object Detection.” IET Cyber-Systems and Robotics 7, no. 1: e70008. https://doi.org/10.1049/csy2.70008.
10.1049/csy2.70008
Web of Science® Google Scholar
Wang, Z., T. Schaul, M. Hessel, H. V. Hasselt, M. Lanctot, and N. de Freitas. 2016. “ Dueling Network Architectures for Deep Reinforcement Learning.” In Proceedings of the 33rd International Conference on Machine Learning (PMLR), vol. 48, 1995–2003. PMLR.
Google Scholar
Wu, J., Z. Jin, A. Liu, L. Yu, and F. Yang. 2022. “A Survey of Learning-Based Control of Robotic Visual Servoing Systems.” Journal of the Franklin Institute 359, no. 1: 556–577. https://doi.org/10.1016/j.jfranklin.2021.11.009.
10.1016/j.jfranklin.2021.11.009
Web of Science® Google Scholar
Xing, H., S. Gao, Y. Wang, X. Wei, H. Tang, and W. Zhang. 2023. “Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion.” IEEE Transactions on Circuits and Systems for Video Technology 33, no. 10: 5444–5457. https://doi.org/10.1109/TCSVT.2023.3255304.
10.1109/TCSVT.2023.3255304
Web of Science® Google Scholar
Xu, Q., F. Fang, N. Gauthier, et al. 2021. “ Towards Efficient Multiview Object Detection With Adaptive Action Prediction.” In 2021 IEEE International Conference on Robotics and Automation (ICRA), 13423–13429. IEEE. https://doi.org/10.1109/ICRA48506.2021.9561388.
10.1109/ICRA48506.2021.9561388
Google Scholar
Yang, N., F. Lu, B. Yu, F. Yao, D. Zhang, and G. Tian. 2023. “ Service Robot Active Object Detection Based on Spatial Exploration Using Deep Recurrent Q-Learning Network.” In 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO), 1–6. IEEE. https://doi.org/10.1109/ROBIO58561.2023.10354931.
Google Scholar
Yu, X., S. Liu, S. Zhang, W. He, and H. Huang. 2024. “Adaptive Neural Network Force Tracking Control of Flexible Joint Robot With an Uncertain Environment.” IEEE Transactions on Industrial Electronics 71, no. 6: 5941–5949. https://doi.org/10.1109/TIE.2023.3290250.
10.1109/TIE.2023.3290250
Web of Science® Google Scholar
Yuankai, Q., W. Qi, A. Peter, et al. 2020. “ REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments.” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9982–9991. IEEE. https://doi.org/10.1109/CVPR42600.2020.01000.
Google Scholar
Zaenker, T., C. Smitt, C. McCool, and M. Bennewitz. 2021. “ Viewpoint Planning for Fruit Size and Position Estimation.” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 3271–3277. IEEE. https://doi.org/10.1109/IROS51168.2021.9636701.
10.1109/IROS51168.2021.9636701
Google Scholar
Zeng, Z., A. Röfer, and O. C. Jenkins. 2020. “ Semantic Linking Maps for Active Visual Object Search.” In 2020 IEEE International Conference on Robotics and Automation (ICRA), 1984–1990. IEEE. https://doi.org/10.1109/ICRA40945.2020.9196830.
10.1109/ICRA40945.2020.9196830
Google Scholar
Zhang, G. L., S. M. Jia, D. S. Zeng, and Z. L. Zheng. 2018. “ Object Detection and Grabbing Based on Machine Vision for Service Robot.” In 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference, 89–94. Springer. https://doi.org/10.1109/IEMCON.2018.8615062.
10.1109/IEMCON.2018.8615062
Google Scholar
Zhang, H., H. Liu, D. Guo, and F. Sun. 2017. “ From Foot to Head: Active Face Finding Using Deep Q-Learning.” In 2017 IEEE International Conference on Image Processing (ICIP), 1862–1866. IEEE. https://doi.org/10.1109/ICIP.2017.8296604.
10.1109/ICIP.2017.8296604
Google Scholar
Zhong, F., X. Bi, Y. Zhang, W. Zhang, and Y. Wang. 2023. “RSPT: Reconstruct Surroundings and Predict Trajectory for Generalizable Active Object Tracking.” Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 37, no. 3: 3705–3714. https://doi.org/10.1609/aaai.v37i3.25482.
10.1609/aaai.v37i3.25482
Google Scholar
Zohar, O., K.-C. Wang, and S. Yeung. 2023. “ PROB: Probabilistic Objectness for Open World Object Detection.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 11444–11453. IEEE. https://doi.org/10.1109/CVPR52729.2023.01101.
10.1109/CVPR52729.2023.01101
Google Scholar
Zou, Z. X., K. Chen, Z. W. Shi, Y. Guo, and J. Ye. 2023. “Object Detection in 20 Years: A Survey.” Proceedings of the IEEE 111, no. 3: 257–276. https://doi.org/10.1109/JPROC.2023.3238524.
10.1109/JPROC.2023.3238524
Web of Science® Google Scholar

Volume42, Issue8

August 2025

e70095

Active Object Detection Using a Novel Network and Partial Prior Information

ABSTRACT

Conflicts of Interest

Open Research

Data Availability Statement

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Active Object Detection Using a Novel Network and Partial Prior Information

ABSTRACT

Conflicts of Interest

Open Research

Data Availability Statement

References

References

Related

Information