Name:
KAUST_Official_Thesis___Jichen_Lu.pdf
Size:
37.24Mb
Format:
PDF
Description:
MS Thesis
Embargo End Date:
2024-05-14
Type
ThesisAuthors
Lu, Jichen
Advisors
Wonka, Peter
Committee members
Hadwiger, Markus
Michels, Dominik L.

Program
Computer ScienceDate
2023-05Embargo End Date
2024-05-14Permanent link to this record
http://hdl.handle.net/10754/691660
Metadata
Show full item recordAccess Restrictions
At the time of archiving, the student author of this thesis opted to temporarily restrict access to it. The full text of this thesis will become available to the public after the expiration of the embargo on 2024-05-14.Abstract
3D Indoor room layout estimation refers to the reconstruction of a 3D layout of a room from a single RGB panoramic image. In this work, we focus on rooms that fit the Manhattan assumption; for each corner of the room, all three composed planes are orthogonal to each other. This paper proposes a new network for indoor room 3D layout estimation based on detection tasks and bin regression, utilizing the encoder-decoder architecture to embed feature and boundary query tokens separately. We also add the deformable attention mechanism to enable our network to extract information from muti-scale features, significantly increasing performance. Besides, several different loss terms have also been experimented with and compared with each other. The proposed network has been trained and tested on the Zillow dataset. Compared with previous SOTA work, our network has surpassed the previous 3D reconstruction accuracy with fewer parameters and fewer training epochs.Citation
Lu, J. (2023). 3D Room Layout Estimation from One Single RGB Panorama [KAUST Research Repository]. https://doi.org/10.25781/KAUST-KUK6Pae974a485f413a2113503eed53cd6c53
10.25781/KAUST-KUK6P