Shiji Song

35
Papers
180
Total Citations

Papers (35)

GSVA: Generalized Segmentation via Multimodal Large Language Models

CVPR 2024
127
citations

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

CVPR 2024
28
citations

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection

ECCV 2024
21
citations

GridMix: Exploring Spatial Modulation for Neural Fields in PDE Modeling

ICLR 2025
4
citations

Resolution Adaptive Networks for Efficient Inference

CVPR 2020arXiv
0
citations

CondenseNet V2: Sparse Feature Reactivation for Deep Networks

CVPR 2021arXiv
0
citations

3D Object Detection With Pointformer

CVPR 2021arXiv
0
citations

Vision Transformer With Deformable Attention

CVPR 2022arXiv
0
citations

On the Integration of Self-Attention and Convolution

CVPR 2022arXiv
0
citations

Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

CVPR 2022
0
citations

Exploring the Equivalence of Siamese Self-Supervised Learning via a Unified Gradient Framework

CVPR 2022arXiv
0
citations

Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning

CVPR 2023arXiv
0
citations

Slide-Transformer: Hierarchical Vision Transformer With Local Self-Attention

CVPR 2023
0
citations

Adaptive Focus for Efficient Video Recognition

ICCV 2021arXiv
0
citations

Towards Learning Spatially Discriminative Feature Representations

ICCV 2021arXiv
0
citations

FLatten Transformer: Vision Transformer using Focused Linear Attention

ICCV 2023arXiv
0
citations

Dynamic Perceiver for Efficient Visual Recognition

ICCV 2023arXiv
0
citations

Adaptive Rotated Convolution for Rotated Object Detection

ICCV 2023arXiv
0
citations

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

ICCV 2023arXiv
0
citations

AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition

ECCV 2022
0
citations

Learning to Weight Samples for Dynamic Early-Exiting Networks

ECCV 2022
0
citations

ActiveNeRF: Learning Where to See with Uncertainty Estimation

ECCV 2022
0
citations

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

CVPR 2025arXiv
0
citations

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

CVPR 2025
0
citations

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

CVPR 2025
0
citations

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

CVPR 2024
0
citations

Implicit Semantic Data Augmentation for Deep Networks

NeurIPS 2019
0
citations

Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning

NeurIPS 2019
0
citations

Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification

NeurIPS 2020
0
citations

Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition

NeurIPS 2021
0
citations

Efficient Knowledge Distillation from Model Checkpoints

NeurIPS 2022
0
citations

Contrastive Language-Image Pre-Training with Knowledge Graphs

NeurIPS 2022
0
citations

Latency-aware Spatial-wise Dynamic Networks

NeurIPS 2022
0
citations

Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

NeurIPS 2023
0
citations

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

NeurIPS 2023
0
citations