Wenbo Hu
12
Papers
421
Total Citations
1
Affiliations
Affiliations
UCLA
Papers (12)
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
AAAI 2024arXiv
190
citations
Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting
ECCV 2024
96
citations
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
ICCV 2025
35
citations
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
ICLR 2025arXiv
29
citations
Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
CVPR 2025arXiv
23
citations
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
ICCV 2025arXiv
19
citations
Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields
CVPR 2024
16
citations
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
ICCV 2025
6
citations
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
CVPR 2025
6
citations
Verbalized Representation Learning for Interpretable Few-Shot Generalization
ICCV 2025
1
citations
Spotting the Unseen: Reciprocal Consensus Network Guided by Visual Archetypes
AAAI 2024
0
citations
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
CVPR 2025
0
citations