Wenbo Hu

12

Papers

421

Total Citations

1

Affiliations

Affiliations

UCLA

Papers (12)

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Spotting the Unseen: Reciprocal Consensus Network Guided by Visual Archetypes

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos