Si Liu

20
Papers
154
Total Citations

Papers (20)

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

CVPR 2025
54
citations

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

ECCV 2024arXiv
51
citations

Mixture Compressor for Mixture-of-Experts LLMs Gains More

ICLR 2025arXiv
22
citations

Controllable Navigation Instruction Generation with Chain of Thought Prompting

ECCV 2024arXiv
16
citations

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

NeurIPS 2025
8
citations

FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering

CVPR 2025
2
citations

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

ICCV 2025
1
citations

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance

AAAI 2025
0
citations

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

CVPR 2024
0
citations

SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection

CVPR 2024
0
citations

EASE-DETR: Easing the Competition among Object Queries

CVPR 2024
0
citations

Communication-Efficient Collaborative Perception via Information Filling with Codebook

CVPR 2024
0
citations

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

CVPR 2025
0
citations

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

CVPR 2024
0
citations

Generative Map Priors for Collaborative BEV Semantic Segmentation

CVPR 2025
0
citations

Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

CVPR 2025
0
citations

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

ICCV 2025
0
citations

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

ICCV 2025
0
citations

Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization

ICCV 2025
0
citations

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

AAAI 2025
0
citations