Xiu Su
13
Papers
58
Total Citations
Papers (13)
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
CVPR 2025
54
citations
L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
NeurIPS 2025
3
citations
Perturbating, Tuning, and Collaborating: Harnessing Vision Foundation Models for Single Domain Generalization on Medical Imaging
AAAI 2025
1
citations
Detecting Any instruction-to-answer interaction relationship:Universal Instruction-to-Answer Navigator for Med-VQA
ICML 2024
0
citations
BCNet: Searching for Network Width With Bilaterally Coupled Network
CVPR 2021arXiv
0
citations
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection
ICCV 2023arXiv
0
citations
ScaleNet: Searching for the Model to Scale
ECCV 2022
0
citations
ViTAS: Vision Transformer Architecture Search
ECCV 2022
0
citations
Prioritized Architecture Sampling With Monto-Carlo Tree Search
CVPR 2021arXiv
0
citations
CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds
ICCV 2025
0
citations
Seeing Beyond Noise: Joint Graph Structure Evaluation and Denoising for Multimodal Recommendation
AAAI 2025
0
citations
Searching for Better Spatio-temporal Alignment in Few-Shot Action Recognition
NeurIPS 2022
0
citations
Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models
NeurIPS 2023
0
citations