Yan Yan

24
Papers
44
Total Citations

Papers (24)

Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer

CVPR 2024
16
citations

Efficient Multitask Dense Predictor via Binarization

CVPR 2024
6
citations

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

NeurIPS 2025
5
citations

Federated Partial Label Learning with Local-Adaptive Augmentation and Regularization

AAAI 2024
5
citations

CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation

ICCV 2025
5
citations

Efficient Multimodal Dataset Distillation via Generative Models

NeurIPS 2025
2
citations

Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model

CVPR 2025
2
citations

Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos

NeurIPS 2025arXiv
2
citations

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

ICCV 2025
1
citations

High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-identification

AAAI 2024arXiv
0
citations

Versatile Navigation Under Partial Observability via Value-guided Diffusion Policy

CVPR 2024
0
citations

BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition

CVPR 2024
0
citations

On the Faithfulness of Vision Transformer Explanations

CVPR 2024
0
citations

Enhancing Post-training Quantization Calibration through Contrastive Learning

CVPR 2024
0
citations

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture

CVPR 2025
0
citations

The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks

ICML 2024
0
citations

Distilling Long-tailed Datasets

CVPR 2025
0
citations

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

ICCV 2025
0
citations

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

ICCV 2025
0
citations

MaskSAM: Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation

ICCV 2025
0
citations

You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data

ICCV 2025
0
citations

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

ICCV 2025
0
citations

Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking

ICCV 2025
0
citations

WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting

AAAI 2024
0
citations