Yu Wang

49
Papers
563
Total Citations
1
Affiliations

Affiliations

University of California, San Diego

Papers (49)

Knowledge Graph Prompting for Multi-Document Question Answering

AAAI 2024arXiv
231
citations

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector

ECCV 2024arXiv
48
citations

ParCo: Part-Coordinating Text-to-Motion Synthesis

ECCV 2024arXiv
43
citations

MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

ECCV 2024arXiv
36
citations

V2Meow: Meowing to the Visual Beat via Video-to-Music Generation

AAAI 2024arXiv
23
citations

FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Vision Language Models

ICCV 2025arXiv
22
citations

PrPSeg: Universal Proposition Learning for Panoramic Renal Pathology Segmentation

CVPR 2024arXiv
21
citations

Exploring Diverse Representations for Open Set Recognition

AAAI 2024arXiv
18
citations

KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

CVPR 2024arXiv
16
citations

Every Node Is Different: Dynamically Fusing Self-Supervised Tasks for Attributed Graph Clustering

AAAI 2024arXiv
16
citations

ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics

CVPR 2025arXiv
13
citations

Dynamic Sub-graph Distillation for Robust Semi-supervised Continual Learning

AAAI 2024arXiv
11
citations

MBQ: Modality-Balanced Quantization for Large Vision-Language Models

CVPR 2025arXiv
10
citations

ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning

NeurIPS 2025arXiv
10
citations

DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers

ICCV 2025arXiv
10
citations

Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective

AAAI 2025arXiv
10
citations

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

NeurIPS 2025arXiv
9
citations

When Visual Grounding Meets Gigapixel-level Large-scale Scenes: Benchmark and Approach

CVPR 2024
7
citations

Holistic Semantic Representation for Navigational Trajectory Generation

AAAI 2025arXiv
3
citations

AnyTalk: Multi-modal Driven Multi-domain Talking Head Generation

AAAI 2025
2
citations

Take the Bull by the Horns: Learning to Segment Hard Samples

CVPR 2025
1
citations

PEINR: A Physics-enhanced Implicit Neural Representation for High-Fidelity Flow Field Reconstruction

ICML 2025
1
citations

DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate

ICCV 2025
1
citations

Probabilistic Prompt Distribution Learning for Animal Pose Estimation

CVPR 2025
1
citations

DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image

ICCV 2025arXiv
0
citations

SVDC: Consistent Direct Time-of-Flight Video Depth Completion with Frequency Selective Fusion

CVPR 2025
0
citations

Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds

CVPR 2025
0
citations

Continual SFT Matches Multimodal RLHF with Negative Supervision

CVPR 2025
0
citations

Rethinking the Upsampling Process in Light Field Super-Resolution with Spatial-Epipolar Implicit Image Function

ICCV 2025
0
citations

Long-Tailed Classification with Multi-Granularity Semantics

ICCV 2025
0
citations

HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery

ICCV 2025arXiv
0
citations

Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models

ICML 2025arXiv
0
citations

Zero-Sum vs. Positive-Sum: Effects of Inter-team Competition Modes and Haptic Feedback on Team Flow in Multi-team VR

ISMAR 2025
0
citations

SuperJunction: Learning-Based Junction Detection for Retinal Image Registration

AAAI 2024
0
citations

Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach

AAAI 2024arXiv
0
citations

Self-Updatable Large Language Models by Integrating Context into Model Parameters

ICLR 2025arXiv
0
citations

Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning

AAAI 2024
0
citations

FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

CVPR 2024
0
citations

H2GFormer: Horizontal-to-Global Voxel Transformer for 3D Semantic Scene Completion

AAAI 2024
0
citations

Enhancing Contrastive Learning Inspired by the Philosophy of “The Blind Men and the Elephant”

AAAI 2025
0
citations

Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding

ICML 2024
0
citations

Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation

ICML 2024
0
citations

MEMORYLLM: Towards Self-Updatable Large Language Models

ICML 2024arXiv
0
citations

Position: Towards Implicit Prompt For Text-To-Image Models

ICML 2024
0
citations

Socialized Learning: Making Each Other Better Through Multi-Agent Collaboration

ICML 2024
0
citations

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game

ICML 2024
0
citations

Evaluating Quantized Large Language Models

ICML 2024
0
citations

Open-Set Graph Domain Adaptation via Separate Domain Alignment

AAAI 2024
0
citations

High-Dimensional Bayesian Optimization via Semi-Supervised Learning with Optimized Unlabeled Data Sampling

ICML 2024
0
citations