Jiahao Wang

20
Papers
128
Total Citations

Papers (20)

Structure-Aware Sparse-View X-ray 3D Reconstruction

CVPR 2024
75
citations

Universal Segmentation at Arbitrary Granularity with Language Instruction

CVPR 2024
30
citations

CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception

ICCV 2025
8
citations

SpotActor: Training-Free Layout-Controlled Consistent Image Generation

AAAI 2025
6
citations

SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection

AAAI 2024
4
citations

SceneCrafter: Controllable Multi-View Driving Scene Editing

CVPR 2025
3
citations

Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation

ICCV 2025arXiv
2
citations

Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling

NeurIPS 2025
0
citations

IWRN:A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack

AAAI 2025
0
citations

ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-guided Optimization

AAAI 2024
0
citations

CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

AAAI 2024
0
citations

RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation

CVPR 2024
0
citations

PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models

CVPR 2025
0
citations

RobustLight: Improving Robustness via Diffusion Reinforcement Learning for Traffic Signal Control

ICML 2025
0
citations

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

CVPR 2025
0
citations

Mamba-Reg: Vision Mamba Also Needs Registers

CVPR 2025
0
citations

Towards Precise Scaling Laws for Video Diffusion Transformers

CVPR 2025
0
citations

Imbalance in Balance: Online Concept Balancing in Generation Models

ICCV 2025
0
citations

DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability

ICCV 2025
0
citations

LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation

ICCV 2025
0
citations