Si Liu
20
Papers
154
Total Citations
Papers (20)
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
CVPR 2025
54
citations
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
ECCV 2024arXiv
51
citations
Mixture Compressor for Mixture-of-Experts LLMs Gains More
ICLR 2025arXiv
22
citations
Controllable Navigation Instruction Generation with Chain of Thought Prompting
ECCV 2024arXiv
16
citations
UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning
NeurIPS 2025
8
citations
FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering
CVPR 2025
2
citations
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective
ICCV 2025
1
citations
GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
AAAI 2025
0
citations
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
CVPR 2024
0
citations
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection
CVPR 2024
0
citations
EASE-DETR: Easing the Competition among Object Queries
CVPR 2024
0
citations
Communication-Efficient Collaborative Perception via Information Filling with Codebook
CVPR 2024
0
citations
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
CVPR 2025
0
citations
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
CVPR 2024
0
citations
Generative Map Priors for Collaborative BEV Semantic Segmentation
CVPR 2025
0
citations
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
CVPR 2025
0
citations
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
ICCV 2025
0
citations
CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation
ICCV 2025
0
citations
Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
ICCV 2025
0
citations
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
AAAI 2025
0
citations