Qi Chen

18
Papers
152
Total Citations

Papers (18)

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

NeurIPS 2025
81
citations

WebVLN: Vision-and-Language Navigation on Websites

AAAI 2024arXiv
19
citations

CREAD: A Classification-Restoration Framework with Error Adaptive Discretization for Watch Time Prediction in Video Recommender Systems

AAAI 2024arXiv
15
citations

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

CVPR 2024
12
citations

From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing

CVPR 2025
11
citations

Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning

AAAI 2025
9
citations

IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation

AAAI 2025
3
citations

Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection

CVPR 2025
1
citations

Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering

CVPR 2025
1
citations

Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding

ICCV 2025
0
citations

VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization

AAAI 2025
0
citations

OVG-HQ: Online Video Grounding with Hybrid-modal Queries

ICCV 2025
0
citations

Enhancing Large Language Model Performance with Gradient-Based Parameter Selection

AAAI 2025
0
citations

Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data

ICCV 2025
0
citations

Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video

ICCV 2025
0
citations

Training-Free Class Purification for Open-Vocabulary Semantic Segmentation

ICCV 2025
0
citations

G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images

CVPR 2024
0
citations

Towards Generalizable Tumor Synthesis

CVPR 2024
0
citations