Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System

Thanh Nguyen Canh; Du Trinh Ngoc; Xiem HoangVan

doi:10.14313/jamris-2025-025

Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System

Authors

Thanh Nguyen Canh VNU - University of Engineering and Technology, Viet Nam
Du Trinh Ngoc VNU - University of Engineering and Technology,Viet Nam
Xiem HoangVan Vietnam National University - University of Engineering and Technology, Viet Nam
https://orcid.org/0000-0002-7524-6529

Keywords: Object Localization, Camera Calibration, Machine (Robot) Vision System, Industrial robotics

Abstract

3D Object Localization has been emerging as one of the main challenges in Machine Vision tasks. In this paper, we proposed a novel 3D object localization method, leveraging a blend of deep learning techniques primarily rooted in object detection, post-image processing, and pose estimation algorithms. Our approach involves 3D calibration methods tailored for low-cost industrial robotics systems, requiring only a single 2D image input. Initially, object detection is performed using the You Only Look Once (YOLO) model, followed by an R-CNN model for segmenting the object into two distinct parts, i.e., the top face and the remainder. Subsequently, the center of the top face is served as an initialization position, and being refined with a novel calibration algorithm. Experimental results demonstrate a notable reduction in localization error by 87.65% when compared to existing methodologies.

Downloads

Published

09.09.2025

Issue

ISSUE 3/2025

Section

Articles

How to Cite

Nguyen Canh, T., Trinh Ngoc, D., & HoangVan, X. (2025). Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System. Journal of Automation, Mobile Robotics and Intelligent Systems, 19(3), 53-65. https://doi.org/10.14313/jamris-2025-025

Download Citation

BibTeX

How to Cite

Download Citation

BibTeX

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

The authors wishing to publish in JAMRIS journal must sign the license to publish upon paper acceptance. The license governs in detail the commercial and non-commercial use of papers published by our journal and determines user and author rights.

Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System

Authors

Abstract

Downloads

How to Cite

Information