2D-Driven 3D Object Detection in RGB-D Images
dc.contributor.author | Lahoud, Jean | |
dc.contributor.author | Ghanem, Bernard | |
dc.date.accessioned | 2018-03-11T06:54:09Z | |
dc.date.available | 2018-03-11T06:54:09Z | |
dc.date.issued | 2017-12-25 | |
dc.identifier.citation | Lahoud J, Ghanem B (2017) 2D-Driven 3D Object Detection in RGB-D Images. 2017 IEEE International Conference on Computer Vision (ICCV). Available: http://dx.doi.org/10.1109/ICCV.2017.495. | |
dc.identifier.doi | 10.1109/ICCV.2017.495 | |
dc.identifier.uri | http://hdl.handle.net/10754/627233 | |
dc.description.abstract | In this paper, we present a technique that places 3D bounding boxes around objects in an RGB-D scene. Our approach makes best use of the 2D information to quickly reduce the search space in 3D, benefiting from state-of-the-art 2D object detection techniques. We then use the 3D information to orient, place, and score bounding boxes around objects. We independently estimate the orientation for every object, using previous techniques that utilize normal information. Object locations and sizes in 3D are learned using a multilayer perceptron (MLP). In the final step, we refine our detections based on object class relations within a scene. When compared to state-of-the-art detection methods that operate almost entirely in the sparse 3D domain, extensive experiments on the well-known SUN RGB-D dataset [29] show that our proposed method is much faster (4.1s per image) in detecting 3D objects in RGB-D images and performs better (3 mAP higher) than the state-of-the-art method that is 4.7 times slower and comparably to the method that is two orders of magnitude slower. This work hints at the idea that 2D-driven object detection in 3D should be further explored, especially in cases where the 3D input is sparse. | |
dc.description.sponsorship | This work was supported by the King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research. | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.relation.url | http://ieeexplore.ieee.org/document/8237757/ | |
dc.title | 2D-Driven 3D Object Detection in RGB-D Images | |
dc.type | Conference Paper | |
dc.contributor.department | Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division | |
dc.contributor.department | Electrical Engineering Program | |
dc.contributor.department | Visual Computing Center (VCC) | |
dc.identifier.journal | 2017 IEEE International Conference on Computer Vision (ICCV) | |
dc.conference.date | 2017-10-22 to 2017-10-29 | |
dc.conference.name | 16th IEEE International Conference on Computer Vision, ICCV 2017 | |
dc.conference.location | Venice, ITA | |
kaust.person | Lahoud, Jean | |
kaust.person | Ghanem, Bernard | |
dc.date.published-online | 2017-12-25 | |
dc.date.published-print | 2017-10 |
This item appears in the following Collection(s)
-
Conference Papers
-
Electrical and Computer Engineering Program
For more information visit: https://cemse.kaust.edu.sa/ece -
Visual Computing Center (VCC)
-
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/