Progressive learning of 3D reconstruction network from 2D GAN data

buir.contributor.authorDündar, Ayşegül
buir.contributor.orcidDündar, Ayşegül|0000-0003-2014-6325
dc.citation.epage804
dc.citation.issueNumber2
dc.citation.spage793
dc.citation.volumeNumber46
dc.contributor.authorDündar, Ayşegül
dc.contributor.authorGao, Jun
dc.contributor.authorTao, Andrew
dc.contributor.authorCatanzaro, Bryan
dc.date.accessioned2025-02-27T06:05:16Z
dc.date.available2025-02-27T06:05:16Z
dc.date.issued2024-02
dc.departmentDepartment of Computer Engineering
dc.description.abstractThis paper presents a method to reconstruct high-quality textured 3D models from single images. Current methods rely on datasets with expensive annotations; multi-view images and their camera parameters. Our method relies on GAN generated multi-view image datasets which have a negligible annotation cost. However, they are not strictly multi-view consistent and sometimes GANs output distorted images. This results in degraded reconstruction qualities. In this work, to overcome these limitations of generated datasets, we have two main contributions which lead us to achieve state-of-the-art results on challenging objects: 1) A robust multi-stage learning scheme that gradually relies more on the models own predictions when calculating losses and 2) A novel adversarial learning pipeline with online pseudo-ground truth generations to achieve fine details. Our work provides a bridge from 2D supervisions of GAN models to 3D reconstruction models and removes the expensive annotation efforts. We show significant improvements over previous methods whether they were trained on GAN generated multi-view images or on real images with expensive annotations.
dc.identifier.doi10.1109/TPAMI.2023.3324806
dc.identifier.eissn1939-3539
dc.identifier.issn0162-8828
dc.identifier.urihttps://hdl.handle.net/11693/116883
dc.language.isoEnglish
dc.publisherIEEE
dc.relation.isversionofhttps://dx.doi.org/10.1109/TPAMI.2023.3324806
dc.rightsCC BY-NC-ND 4.0 DEED (Attribution-NonCommercial-NoDerivatives 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.source.titleIEEE Transactions on Pattern Analysis and Machine Intelligence
dc.subject3D reconstruction
dc.subject3D texture learning
dc.subjectGenerative adversarial networks
dc.subjectSingle-image inference
dc.titleProgressive learning of 3D reconstruction network from 2D GAN data
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Progressive_Learning_of_3D_Reconstruction_Network_From_2D_GAN_Data.pdf
Size:
4.99 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: