Fan Yang

26
Papers
242
Total Citations

Papers (26)

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

NeurIPS 2025arXiv
83
citations

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

NeurIPS 2025arXiv
56
citations

Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models

ECCV 2024
30
citations

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver

ICLR 2025
23
citations

MagicArticulate: Make Your 3D Models Articulation-Ready

CVPR 2025
16
citations

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

CVPR 2025
12
citations

Geometry-Guided Domain Generalization for Monocular 3D Object Detection

AAAI 2024
10
citations

HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator

CVPR 2025
10
citations

Oracle-MoE: Locality-preserving Routing in the Oracle Space for Memory-constrained Large Language Model Inference

ICML 2025
1
citations

Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model

CVPR 2025
1
citations

MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

CVPR 2025
0
citations

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations

AAAI 2025
0
citations

3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation

AAAI 2025
0
citations

An Effective Augmented Lagrangian Method for Fine-Grained Multi-View Optimization

AAAI 2024
0
citations

Implicit Modeling of Non-rigid Objects with Cross-Category Signals

AAAI 2024arXiv
0
citations

Multi-Modal Disordered Representation Learning Network for Description-Based Person Search

AAAI 2024
0
citations

Sparse Bayesian Deep Learning for Cross Domain Medical Image Reconstruction

AAAI 2024
0
citations

Causal-Driven Skill Prerequisite Structure Discovery

AAAI 2024
0
citations

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing

CVPR 2024
0
citations

FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models

CVPR 2024
0
citations

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior

CVPR 2024
0
citations

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

ICML 2024
0
citations

TVE: Learning Meta-attribution for Transferable Vision Explainer

ICML 2024
0
citations

The Source Image is the Best Attention for Infrared and Visible Image Fusion

ICCV 2025
0
citations

Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning

ICML 2024
0
citations

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

ICCV 2025
0
citations