Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

24citations

arXiv:2408.09702 PDF Project

Citations

#350

in ECCV 2024

of 2387 papers

Authors

Data Points

Authors

Ruofan Liang Zan Gojcic Merlin Nimier-David David Acuna Nandita Vijaykumar Sanja Fidler Zian Wang

Topics

diffusion models inverse rendering photorealistic object insertion scene lighting estimation video composition physically based rendering tone-mapping refinement

Abstract

The correct insertion of virtual objects in images of real-world scenes requires a deep understanding of the scene's lighting, geometry and materials, as well as the image formation process. While recent large-scale diffusion models have shown strong generative and inpainting capabilities, we find that current models do not sufficiently "understand" the scene shown in a single picture to generate consistent lighting effects (shadows, bright reflections, etc.) while preserving the identity and details of the composited object. We propose using a personalized large diffusion model as guidance to a physically based inverse rendering process. Our method recovers scene lighting and tone-mapping parameters, allowing the photorealistic composition of arbitrary virtual objects in single frames or videos of indoor or outdoor scenes. Our physically based pipeline further enables automatic materials and tone-mapping refinement.

Citation History

Jan 26, 2026

Jan 27, 2026

Feb 1, 2026

24+24