Active Object Detection Using a Novel Network and Partial Prior Information
Jianyu Wang
Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
Search for more papers by this authorCorresponding Author
Feng Zhu
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
Correspondence:
Feng Zhu ([email protected])
Search for more papers by this authorQun Wang
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorPengfei Zhao
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorJianyu Wang
Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
Search for more papers by this authorCorresponding Author
Feng Zhu
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
Correspondence:
Feng Zhu ([email protected])
Search for more papers by this authorQun Wang
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorPengfei Zhao
Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorABSTRACT
Active object detection (AOD) enables a system to actively adjust camera parameters or plan the next viewpoint to improve detection accuracy when the current visual input is insufficient. However, most existing AOD methods assume that the target object is visible from the initial viewpoint, which is often unrealistic and reduces task efficiency. To address this limitation, we propose a novel AOD framework that leverages partial prior information to enhance detection performance and task efficiency. Specifically, we construct an extensible prior information library that describes large and easily identifiable adjacent objects (Adj-objects) that are spatially related to the target. This allows the system to initiate AOD based on the presence of an Adj-object, even when the target is initially out of view. Our approach incorporates a duelling deep Q-learning network (Duelling-DQN) with a newly designed reward function to effectively utilise prior information. Additionally, we introduce a viewpoint storage scheme to support fast retrieval and transition between viewpoints. We evaluate the proposed method on the Active Vision Dataset (AVD) and compare it with several state-of-the-art (SOTA) approaches. The experimental results show that our method achieves a superior average success rate of 81.3%, demonstrating its effectiveness in overcoming the initial state limitations of traditional AOD tasks.
Conflicts of Interest
The authors declare no conflicts of interest.
Open Research
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
- Al-Sharman, M. K., Y. Zweiri, M. A. K. Jaradat, R. Al-Husari, D. Gan, and L. D. Seneviratne. 2020. “Deep-Learning-Based Neural Network Training for State Estimation Enhancement: Application to Attitude Estimation.” IEEE Transactions on Instrumentation and Measurement 69, no. 1: 24–34. https://doi.org/10.1109/TIM.2019.2895495.
- Ammirato, P., P. Poirson, E. Park, J. Košecká, and A. C. Berg. 2017. “ A Dataset for Developing and Benchmarking Active Vision.” In 2017 IEEE International Conference on Robotics and Automation (ICRA), 1378–1385. IEEE. https://doi.org/10.1109/ICRA.2017.7989164.
10.1109/ICRA.2017.7989164 Google Scholar
- Chang, A., A. Dai, T. Funkhouser, et al. 2017. “ Matterport3D: Learning From RGB-D Data in Indoor Environments.” In 2017 International Conference on 3D Vision (3DV), 667–676. IEEE. https://doi.org/10.1109/3DV.2017.00081.
10.1109/3DV.2017.00081 Google Scholar
- Chen, W., and N. Rojas. 2024. “TraKDis: A Transformer-Based Knowledge Distillation Approach for Visual Reinforcement Learning With Application to Cloth Manipulation.” IEEE Robotics and Automation Letters 9, no. 3: 2455–2462. https://doi.org/10.1109/LRA.2024.3358750.
- Chen, X., H. Peng, D. Wang, H. Lu, and H. Hu. 2023. “ SeqTrack: Sequence to Sequence Learning for Visual Object Tracking.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14572–14581. IEEE. https://doi.org/10.1109/CVPR52729.2023.01400.
10.1109/CVPR52729.2023.01400 Google Scholar
- Cui, Y., B. Hou, Q. Wu, B. Ren, S. Wang, and L. Jiao. 2022. “Remote Sensing Object Tracking With Deep Reinforcement Learning Under Occlusion.” IEEE Transactions on Geoscience and Remote Sensing 60: 1–13. https://doi.org/10.1109/TGRS.2021.3096809.
- Deng, J., W. Dong, R. Socher, L. J. Li, L. Kai, and F. F. Li. 2009. “ ImageNet: A Large-Scale Hierarchical Image Database.” In 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 248–255. IEEE. https://doi.org/10.1109/CVPR.2009.5206848.
10.1109/CVPR.2009.5206848 Google Scholar
- Denzler, J., and C. Brown. 2002. “Information Theoretic Sensor Data Selection for Active Object Recognition and State Estimation.” IEEE Transactions on Pattern Analysis and Machine Intelligence 24, no. 2: 145–157. https://doi.org/10.1109/34.982896.
- Diederik, K., and B. Jimmy. 2014. “Adam: A Method for Stochastic Optimization.” arXiv Preprint. arXiv:1412.6980. https://arxiv.org/abs/1412.6980.
- Dionigi, A., A. Devo, L. Guiducci, and G. Costante. 2022. “E-VAT: An Asymmetric End-to-End Approach to Visual Active Exploration and Tracking.” IEEE Robotics and Automation Letters 7, no. 2: 4259–4266. https://doi.org/10.1109/LRA.2022.3150866.
- Du, H., L. Li, Z. Huang, and X. Yu. 2023. “ Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2563–2573. IEEE. https://doi.org/10.1109/CVPR52729.2023.00252.
10.1109/CVPR52729.2023.00252 Google Scholar
- Everingham, M., L. V. Gool, C. Williams, J. Winn, and A. Zisserman. 2010. “The PASCAL Visual Object Classes (VOC) Challenge.” International Journal of Computer Vision 88, no. 2: 303–338. https://doi.org/10.1007/s11263-009-0275-4.
- Fang, F., W. Liang, Y. Wu, Q. Xu, and J.-H. Lim. 2022. “Self-Supervised Reinforcement Learning for Active Object Detection.” IEEE Robotics and Automation Letters 7, no. 2: 10224–10231. https://doi.org/10.1109/LRA.2022.3193019.
10.1109/LRA.2022.3193019 Google Scholar
- Fang, F., Q. Xu, N. Gauthier, L. Li, and J.-H. Lim. 2021. “ Enhancing Multi-Step Action Prediction for Active Object Detection.” In 2021 IEEE International Conference on Image Processing (ICIP), 2189–2193. IEEE. https://doi.org/10.1109/ICIP42928.2021.9506078.
10.1109/ICIP42928.2021.9506078 Google Scholar
- Fang, Q., X. Xu, X. Wang, and Y. Zeng. 2022. “Target-Driven Visual Navigation in Indoor Scenes Using Reinforcement Learning and Imitation Learning.” CAAI Transactions on Intelligence Technology 7, no. 2: 167–176. https://doi.org/10.1049/cit2.12043.
- Gao, C., S. Liu, J. Chen, et al. 2024. “Room-Object Entity Prompting and Reasoning for Embodied Referring Expression.” IEEE Transactions on Pattern Analysis and Machine Intelligence 46, no. 2: 994–1010. https://doi.org/10.1109/TPAMI.2023.3326851.
- Girshick, R. 2015. “ Fast R-CNN.” In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 1440–1448. IEEE. https://doi.org/10.1109/ICCV.2015.169.
10.1109/ICCV.2015.169 Google Scholar
- Girshick, R., J. Donahue, T. Darrell, and J. Malik. 2014. “ Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 580–587. IEEE. https://doi.org/10.1109/CVPR.2014.81.
10.1109/CVPR.2014.81 Google Scholar
- Grau, V., A. Mewes, M. Alcaniz, R. Kikinis, and S. Warfield. 2004. “Improved Watershed Transform for Medical Image Segmentation Using Prior Information.” IEEE Transactions on Medical Imaging 23, no. 4: 447–458. https://doi.org/10.1109/TMI.2004.824224.
- Han, H., A. K. Jain, F. Wang, S. G. Shan, and X. L. Chen. 2018. “Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach.” IEEE Transactions on Pattern Analysis and Machine Intelligence 40, no. 11: 2597–2609. https://doi.org/10.1109/TPAMI.2017.2738004.
- Han, X., H. Liu, F. Sun, and D. Yang. 2018. “ Active Object Detection Using Double DQN and Prioritized Experience Replay.” In 2018 International Joint Conference on Neural Networks (IJCNN), 1–7. IEEE. https://doi.org/10.1109/IJCNN.2018.8489296.
10.1109/IJCNN.2018.8489296 Google Scholar
- Han, X., H. Liu, F. Sun, and X. Zhang. 2019. “Active Object Detection With Multistep Action Prediction Using Deep Q-Network.” IEEE Transactions on Industrial Informatics 15, no. 6: 3723–3731. https://doi.org/10.1109/TII.2019.2890849.
- He, K., X. Zhang, S. Ren, and J. Sun. 2016. “ Deep Residual Learning for Image Recognition.” In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. IEEE. https://doi.org/10.1109/CVPR.2016.90.
10.1109/CVPR.2016.90 Google Scholar
- Hou, Y., J. Li, and M. Chen 2024. “Self-Supervised Antipodal Grasp Learning With Fine-Grained Grasp Quality Feedback in Clutter.” IEEE Transactions on Industrial Electronics 71, no. 4: 3853–3861. https://doi.org/10.1109/TIE.2023.3274854.
- Huber, M. F., T. Dencker, M. Roschani, and J. Beyerer. 2012. “ Bayesian Active Object Recognition via Gaussian Process Regression.” In 2012 15th International Conference on Information Fusion, 1718–1725. IEEE.
- Khan, A. H., M. S. Nawaz, and A. Dengel. 2023. “ Localized Semantic Feature Mixers for Efficient Pedestrian Detection in Autonomous Driving.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5476–5485. IEEE. https://doi.org/10.1109/CVPR52729.2023.00530.
10.1109/CVPR52729.2023.00530 Google Scholar
- Kivrak, H., F. Cakmak, H. Kose, and S. Yavuz. 2021. “Social Navigation Framework for Assistive Robots in Human Inhabited Unknown Environments.” Engineering Science and Technology, an International Journal 24, no. 2: 284–298. https://doi.org/10.1016/j.jestch.2020.08.008.
- Kong, Y. Z., F. Zhu, H. B. Sun, Z. Y. Lin, and Q. Wang. 2022. “A Generic View Planning System Based on Formal Expression of Perception Tasks.” Entropy 24, no. 5: 578. https://doi.org/10.3390/e24050578.
- Lehnert, C., D. Tsai, A. Eriksson, and C. McCool. 2019. “ 3D Move to See: Multi-Perspective Visual Servoing Towards the Next Best View Within Unstructured and Occluded Environments.” In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 3890–3897. IEEE. https://doi.org/10.1109/IROS40897.2019.8967918.
10.1109/IROS40897.2019.8967918 Google Scholar
- Li, Y., Y. Ma, X. Huo, and X. Wu. 2022. “Remote Object Navigation for Service Robots Using Hierarchical Knowledge Graph in Human-Centered Environments.” Intelligent Service Robotics 15, no. 4: 459–473. https://doi.org/10.1007/s11370-022-00428-4.
- Lin, J., Y. Wang, Z. Miao, H. Zhong, and R. Fierro. 2022. “Low-Complexity Control for Vision-Based Landing of Quadrotor UAV on Unknown Moving Platform.” IEEE Transactions on Industrial Informatics 18, no. 8: 5348–5358. https://doi.org/10.1109/TII.2021.3129486.
- Lin, T.-Y., M. Maire, S. Belongie, et al. 2014. “ Microsoft COCO: Common Objects in Context.” In Computer Vision – ECCV 2014, 740–755. Springer. https://doi.org/10.1007/978-3-319-10602-1_48.
10.1007/978-3-319-10602-1_48 Google Scholar
- Liu, S., G. Tian, Y. Zhang, M. Zhang, and S. Liu. 2022. “Active Object Detection Based on a Novel Deep Q-Learning Network and Long-Term Learning Strategy for the Service Robot.” IEEE Transactions on Industrial Electronics 69, no. 6: 5984–5993. https://doi.org/10.1109/TIE.2021.3090707.
- Liu, W., D. Anguelov, D. Erhan, et al. 2016. “ SSD: Single Shot MultiBox Detector.” In Computer Vision – ECCV 2016, 21–37. Springer. https://doi.org/10.1007/978-3-319-46448-0_2.
10.1007/978-3-319-46448-0_2 Google Scholar
- Long, Y., X. Li, W. Cai, and H. Dong. 2024. “ Discuss Before Moving: Visual Language Navigation via Multiexpert Discussions.” In 2024 IEEE International Conference on Robotics and Automation (ICRA), 17380–17387. IEEE. https://doi.org/10.1109/ICRA57147.2024.10611565.
10.1109/ICRA57147.2024.10611565 Google Scholar
- Lorbach, M., S. Höfer, and O. Brock. 2014. “ Prior-Assisted Propagation of Spatial Information for Object Search.” In 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2904–2909. IEEE. https://doi.org/10.1109/IROS.2014.6942962.
10.1109/IROS.2014.6942962 Google Scholar
- Luo, W., P. Sun, F. Zhong, W. Liu, T. Zhang, and Y. Wang. 2020. “End-to-End Active Object Tracking and Its Real-World Deployment via Reinforcement Learning.” IEEE Transactions on Pattern Analysis and Machine Intelligence 42, no. 6: 1317–1332. https://doi.org/10.1109/TPAMI.2019.2899570.
- Lyu, Y., Y. Shi, and X. Zhang. 2022. “Improving Targetdriven Visual Navigation With Attention on 3D Spatial Relationships.” Neural Processing Letters 54, no. 5: 3979–3998. https://doi.org/10.1007/s11063-022-10796-8.
10.1007/s11063-022-10796-8 Google Scholar
- Majumdar, A., K. Yadav, S. Arnaud, et al. 2023. “ Where Are We in the Search for an Artificial Visual Cortex for Embodied Intelligence?” In Advances in Neural Information Processing Systems (NeurIPS 2023), vol. 36, 655–677. Curran Associates, Inc.
- Meda, K. C., S. S. Milla, and B. S. Rostad. 2021. “Artificial Intelligence Research Within Reach: An Object Detection Model to Identify Rickets on Pediatric Wrist Radiographs.” Pediatric Radiology 51: 782–791. https://doi.org/10.1007/s00247-020-04895-8.
- Morrison, D., P. Corke, and J. Leitner. 2019. “ Multi-View Picking: Next-Best-View Reaching for Improved Grasping in Clutter.” In 2019 International Conference on Robotics and Automation (ICRA), 8762–8768. IEEE. https://doi.org/10.1109/ICRA.2019.8793805.
10.1109/ICRA.2019.8793805 Google Scholar
- Paletta, L., and A. Pinz. 2000. “Active Object Recognition by View Integration and Reinforcement Learning.” Robotics and Autonomous Systems 31, no. 30: 71–86. https://doi.org/10.1016/S0921-8890(99)00079-2.
10.1016/S0921-8890(99)00079-2 Google Scholar
- Patrício, D. I., and R. Rieder. 2018. “Computer Vision and Artificial Intelligence in Precision Agriculture for Grain Crops: A Systematic Review.” Computers and Electronics in Agriculture 153: 69–81. https://doi.org/10.1016/j.compag.2018.08.001.
- Piotr, J., D. Brian, W. Victor, and J. R. R. Service. 1995. “ Active Object Detection Using Color.” In Research in Computer and Robot Vision, 37–53. World Scientific Publishing. https://doi.org/10.1142/9789812812483_0003.
- Prasanth, K. D., B. M. Naga Praveen, and P. Rajalakshmi. 2024. “Stereo Vision Based Object Detection for Autonomous Navigation in Space Environments.” Acta Astronautica 218: 326–329. https://doi.org/10.1016/j.actaastro.2024.02.032.
10.1016/j.actaastro.2024.02.032 Google Scholar
- Redmon, J., S. Divvala, R. Girshick, and A. Farhadi. 2016. “ You Only Look Once: Unified, Real-Time Object Detection.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 779–788. IEEE. https://doi.org/10.1109/CVPR.2016.91.
10.1109/CVPR.2016.91 Google Scholar
- Reza Loghmani, M., B. Caputo, and M. Vincze. 2018. “ Recognizing Objects In-the-Wild: Where Do we Stand?” In 2018 IEEE International Conference on Robotics and Automation (ICRA), 2170–2177. IEEE. https://doi.org/10.1109/ICRA.2018.8460985.
10.1109/ICRA.2018.8460985 Google Scholar
- Samani, E. U., and A. G. Banerjee. 2024. “Persistent Homology Meets Object Unity: Object Recognition in Clutter.” IEEE Transactions on Robotics 40: 886–902. https://doi.org/10.1109/TRO.2023.3343994.
- Samma, H., and S. El-Ferik. 2024. “Autonomous UAV Visual Navigation Using an Improved Deep Reinforcement Learning.” IEEE Access 12: 79967–79977. https://doi.org/10.1109/ACCESS.2024.3409780.
- Schaul, T., J. Quan, I. Antonoglou, and D. Silver. 2016. “Prioritized Experience Replay.” arXiv Preprint. arXiv:1511.05952. https://arxiv.org/abs/1511.05952.
- Shen, Y. F., H. L. Zhou, J. T. Li, F. J. Jian, and D. S. Jayas. 2018. “Detection of Stored-Grain Insects Using Deep Learning.” Computers and Electronics in Agriculture 145: 319–325. https://doi.org/10.1016/j.compag.2017.11.039.
- Shim, V. A., M. Yuan, and B. H. Tan. 2017. “ Automatic Object Searching by a Mobile Robot With Single RGB-D Camera.” In 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 56–62. IEEE. https://doi.org/10.1109/APSIPA.2017.8282002.
10.1109/APSIPA.2017.8282002 Google Scholar
- Shin, S. Y., and C. Kim. 2015. “Human-Like Motion Generation and Control for Humanoid's Dual Arm Object Manipulation.” IEEE Transactions on Industrial Electronics 62, no. 4: 2265–2276. https://doi.org/10.1109/TIE.2014.2353017.
- Taioli, F., S. Rosa, A. Castellini, et al. 2024. “ Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation.” In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 12993–13000. IEEE. https://doi.org/10.1109/IROS58592.2024.10801822.
10.1109/IROS58592.2024.10801822 Google Scholar
- Tavakoli, A., F. Pardo, and P. Kormushev. 2018. “Action Branching Architectures for Deep Reinforcement Learning.” Proceedings of the AAAI Conference on Artificial Intelligence 32, no. 1: 1–9. https://doi.org/10.1609/aaai.v32i1.11798.
10.1609/aaai.v32i1.11798 Google Scholar
- Van Hasselt, H., A. Guez, and D. Silver. 2016. “Deep Reinforcement Learning With Double Q-Learning.” Proceedings of the AAAI Conference on Artificial Intelligence 30, no. 1: 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295.
10.1609/aaai.v30i1.10295 Google Scholar
- Vidit, V., M. Engilberge, and M. Salzmann. 2023. “ CLIP the Gap: A Single Domain Generalization Approach for Object Detection.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 3219–3229. IEEE. https://doi.org/10.1109/CVPR52729.2023.00314.
10.1109/CVPR52729.2023.00314 Google Scholar
- Volodymyr, M., K. Koray, S. David, et al. 2013. “Playing ATARI With Deep Reinforcement Learning.” arXiv Preprint. arXiv:1312.5602. https://arxiv.org/abs/1312.5602.
- Wang, A., H. Chen, L. Liu, et al. 2024. “ YOLOv10: Real-Time End-To-End Object Detection.” In Advances in Neural Information Processing Systems (NeurIPS 2024), vol. 37, 107984–108011. Curran Associates, Inc.
- Wang, A., G. Tian, Y. Wang, and Z. Li. 2025. “Move to See More: Approaching Object With Partial Occlusion Using Large Multimodal Model and Active Object Detection.” IET Cyber-Systems and Robotics 7, no. 1: e70008. https://doi.org/10.1049/csy2.70008.
- Wang, Z., T. Schaul, M. Hessel, H. V. Hasselt, M. Lanctot, and N. de Freitas. 2016. “ Dueling Network Architectures for Deep Reinforcement Learning.” In Proceedings of the 33rd International Conference on Machine Learning (PMLR), vol. 48, 1995–2003. PMLR.
- Wu, J., Z. Jin, A. Liu, L. Yu, and F. Yang. 2022. “A Survey of Learning-Based Control of Robotic Visual Servoing Systems.” Journal of the Franklin Institute 359, no. 1: 556–577. https://doi.org/10.1016/j.jfranklin.2021.11.009.
- Xing, H., S. Gao, Y. Wang, X. Wei, H. Tang, and W. Zhang. 2023. “Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion.” IEEE Transactions on Circuits and Systems for Video Technology 33, no. 10: 5444–5457. https://doi.org/10.1109/TCSVT.2023.3255304.
- Xu, Q., F. Fang, N. Gauthier, et al. 2021. “ Towards Efficient Multiview Object Detection With Adaptive Action Prediction.” In 2021 IEEE International Conference on Robotics and Automation (ICRA), 13423–13429. IEEE. https://doi.org/10.1109/ICRA48506.2021.9561388.
10.1109/ICRA48506.2021.9561388 Google Scholar
- Yang, N., F. Lu, B. Yu, F. Yao, D. Zhang, and G. Tian. 2023. “ Service Robot Active Object Detection Based on Spatial Exploration Using Deep Recurrent Q-Learning Network.” In 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO), 1–6. IEEE. https://doi.org/10.1109/ROBIO58561.2023.10354931.
- Yu, X., S. Liu, S. Zhang, W. He, and H. Huang. 2024. “Adaptive Neural Network Force Tracking Control of Flexible Joint Robot With an Uncertain Environment.” IEEE Transactions on Industrial Electronics 71, no. 6: 5941–5949. https://doi.org/10.1109/TIE.2023.3290250.
- Yuankai, Q., W. Qi, A. Peter, et al. 2020. “ REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments.” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9982–9991. IEEE. https://doi.org/10.1109/CVPR42600.2020.01000.
- Zaenker, T., C. Smitt, C. McCool, and M. Bennewitz. 2021. “ Viewpoint Planning for Fruit Size and Position Estimation.” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 3271–3277. IEEE. https://doi.org/10.1109/IROS51168.2021.9636701.
10.1109/IROS51168.2021.9636701 Google Scholar
- Zeng, Z., A. Röfer, and O. C. Jenkins. 2020. “ Semantic Linking Maps for Active Visual Object Search.” In 2020 IEEE International Conference on Robotics and Automation (ICRA), 1984–1990. IEEE. https://doi.org/10.1109/ICRA40945.2020.9196830.
10.1109/ICRA40945.2020.9196830 Google Scholar
- Zhang, G. L., S. M. Jia, D. S. Zeng, and Z. L. Zheng. 2018. “ Object Detection and Grabbing Based on Machine Vision for Service Robot.” In 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference, 89–94. Springer. https://doi.org/10.1109/IEMCON.2018.8615062.
10.1109/IEMCON.2018.8615062 Google Scholar
- Zhang, H., H. Liu, D. Guo, and F. Sun. 2017. “ From Foot to Head: Active Face Finding Using Deep Q-Learning.” In 2017 IEEE International Conference on Image Processing (ICIP), 1862–1866. IEEE. https://doi.org/10.1109/ICIP.2017.8296604.
10.1109/ICIP.2017.8296604 Google Scholar
- Zhong, F., X. Bi, Y. Zhang, W. Zhang, and Y. Wang. 2023. “RSPT: Reconstruct Surroundings and Predict Trajectory for Generalizable Active Object Tracking.” Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 37, no. 3: 3705–3714. https://doi.org/10.1609/aaai.v37i3.25482.
10.1609/aaai.v37i3.25482 Google Scholar
- Zohar, O., K.-C. Wang, and S. Yeung. 2023. “ PROB: Probabilistic Objectness for Open World Object Detection.” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 11444–11453. IEEE. https://doi.org/10.1109/CVPR52729.2023.01101.
10.1109/CVPR52729.2023.01101 Google Scholar
- Zou, Z. X., K. Chen, Z. W. Shi, Y. Guo, and J. Ye. 2023. “Object Detection in 20 Years: A Survey.” Proceedings of the IEEE 111, no. 3: 257–276. https://doi.org/10.1109/JPROC.2023.3238524.