Yan Yan
24
Papers
44
Total Citations
Papers (24)
Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer
CVPR 2024
16
citations
Efficient Multitask Dense Predictor via Binarization
CVPR 2024
6
citations
InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction
NeurIPS 2025
5
citations
Federated Partial Label Learning with Local-Adaptive Augmentation and Regularization
AAAI 2024
5
citations
CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation
ICCV 2025
5
citations
Efficient Multimodal Dataset Distillation via Generative Models
NeurIPS 2025
2
citations
Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model
CVPR 2025
2
citations
Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos
NeurIPS 2025arXiv
2
citations
ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction
ICCV 2025
1
citations
High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-identification
AAAI 2024arXiv
0
citations
Versatile Navigation Under Partial Observability via Value-guided Diffusion Policy
CVPR 2024
0
citations
BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
CVPR 2024
0
citations
On the Faithfulness of Vision Transformer Explanations
CVPR 2024
0
citations
Enhancing Post-training Quantization Calibration through Contrastive Learning
CVPR 2024
0
citations
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
CVPR 2025
0
citations
The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks
ICML 2024
0
citations
Distilling Long-tailed Datasets
CVPR 2025
0
citations
QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
ICCV 2025
0
citations
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
ICCV 2025
0
citations
MaskSAM: Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation
ICCV 2025
0
citations
You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data
ICCV 2025
0
citations
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning
ICCV 2025
0
citations
Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking
ICCV 2025
0
citations
WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting
AAAI 2024
0
citations