Fan Yang
26
Papers
242
Total Citations
Papers (26)
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
NeurIPS 2025arXiv
83
citations
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
NeurIPS 2025arXiv
56
citations
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
ECCV 2024
30
citations
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
ICLR 2025
23
citations
MagicArticulate: Make Your 3D Models Articulation-Ready
CVPR 2025
16
citations
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
CVPR 2025
12
citations
Geometry-Guided Domain Generalization for Monocular 3D Object Detection
AAAI 2024
10
citations
HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
CVPR 2025
10
citations
Oracle-MoE: Locality-preserving Routing in the Oracle Space for Memory-constrained Large Language Model Inference
ICML 2025
1
citations
Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model
CVPR 2025
1
citations
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification
CVPR 2025
0
citations
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
AAAI 2025
0
citations
3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation
AAAI 2025
0
citations
An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization
AAAI 2024
0
citations
Implicit Modeling of Non-rigid Objects with Cross-Category Signals
AAAI 2024arXiv
0
citations
Multi-Modal Disordered Representation Learning Network for Description-Based Person Search
AAAI 2024
0
citations
Sparse Bayesian Deep Learning for Cross Domain Medical Image Reconstruction
AAAI 2024
0
citations
Causal-Driven Skill Prerequisite Structure Discovery
AAAI 2024
0
citations
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
CVPR 2024
0
citations
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models
CVPR 2024
0
citations
Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior
CVPR 2024
0
citations
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
ICML 2024
0
citations
TVE: Learning Meta-attribution for Transferable Vision Explainer
ICML 2024
0
citations
The Source Image is the Best Attention for Infrared and Visible Image Fusion
ICCV 2025
0
citations
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
ICML 2024
0
citations
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
ICCV 2025
0
citations