Jiahao Wang
28
Papers
136
Total Citations
Papers (28)
Structure-Aware Sparse-View X-ray 3D Reconstruction
CVPR 2024arXiv
75
citations
Universal Segmentation at Arbitrary Granularity with Language Instruction
CVPR 2024arXiv
30
citations
CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception
ICCV 2025arXiv
8
citations
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability
ICCV 2025arXiv
8
citations
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
AAAI 2025arXiv
6
citations
SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection
AAAI 2024
4
citations
SceneCrafter: Controllable Multi-View Driving Scene Editing
CVPR 2025arXiv
3
citations
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation
ICCV 2025arXiv
2
citations
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
CVPR 2025arXiv
0
citations
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
CVPR 2025
0
citations
Mamba-Reg: Vision Mamba Also Needs Registers
CVPR 2025
0
citations
Imbalance in Balance: Online Concept Balancing in Generation Models
ICCV 2025arXiv
0
citations
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025arXiv
0
citations
Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
NeurIPS 2025arXiv
0
citations
IWRN:A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack
AAAI 2025
0
citations
ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-guided Optimization
AAAI 2024
0
citations
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers
AAAI 2024arXiv
0
citations
RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation
CVPR 2024
0
citations
RobustLight: Improving Robustness via Diffusion Reinforcement Learning for Traffic Signal Control
ICML 2025
0
citations
Learning Adaptive Warping for Real-World Rolling Shutter Correction
CVPR 2022arXiv
0
citations
Accelerating Neural Network Optimization Through an Automated Control Theory Lens
CVPR 2022
0
citations
RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer
CVPR 2023
0
citations
Memory-and-Anticipation Transformer for Online Action Understanding
ICCV 2023arXiv
0
citations
Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape
ICCV 2023arXiv
0
citations
SAGA: Stochastic Whole-Body Grasping with Contact
ECCV 2022
0
citations
Global Spectral Filter Memory Network for Video Object Segmentation
ECCV 2022
0
citations
Adder Attention for Vision Transformer
NeurIPS 2021
0
citations
Towards Precise Scaling Laws for Video Diffusion Transformers
CVPR 2025arXiv
0
citations