Shiji Song
35
Papers
180
Total Citations
Papers (35)
GSVA: Generalized Segmentation via Multimodal Large Language Models
CVPR 2024
127
citations
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
CVPR 2024
28
citations
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
ECCV 2024
21
citations
GridMix: Exploring Spatial Modulation for Neural Fields in PDE Modeling
ICLR 2025
4
citations
Resolution Adaptive Networks for Efficient Inference
CVPR 2020arXiv
0
citations
CondenseNet V2: Sparse Feature Reactivation for Deep Networks
CVPR 2021arXiv
0
citations
3D Object Detection With Pointformer
CVPR 2021arXiv
0
citations
Vision Transformer With Deformable Attention
CVPR 2022arXiv
0
citations
On the Integration of Self-Attention and Convolution
CVPR 2022arXiv
0
citations
Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
CVPR 2022
0
citations
Exploring the Equivalence of Siamese Self-Supervised Learning via a Unified Gradient Framework
CVPR 2022arXiv
0
citations
Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning
CVPR 2023arXiv
0
citations
Slide-Transformer: Hierarchical Vision Transformer With Local Self-Attention
CVPR 2023
0
citations
Adaptive Focus for Efficient Video Recognition
ICCV 2021arXiv
0
citations
Towards Learning Spatially Discriminative Feature Representations
ICCV 2021arXiv
0
citations
FLatten Transformer: Vision Transformer using Focused Linear Attention
ICCV 2023arXiv
0
citations
Dynamic Perceiver for Efficient Visual Recognition
ICCV 2023arXiv
0
citations
Adaptive Rotated Convolution for Rotated Object Detection
ICCV 2023arXiv
0
citations
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
ICCV 2023arXiv
0
citations
AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition
ECCV 2022
0
citations
Learning to Weight Samples for Dynamic Early-Exiting Networks
ECCV 2022
0
citations
ActiveNeRF: Learning Where to See with Uncertainty Estimation
ECCV 2022
0
citations
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
CVPR 2025arXiv
0
citations
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
CVPR 2025
0
citations
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
CVPR 2025
0
citations
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
CVPR 2024
0
citations
Implicit Semantic Data Augmentation for Deep Networks
NeurIPS 2019
0
citations
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning
NeurIPS 2019
0
citations
Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification
NeurIPS 2020
0
citations
Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition
NeurIPS 2021
0
citations
Efficient Knowledge Distillation from Model Checkpoints
NeurIPS 2022
0
citations
Contrastive Language-Image Pre-Training with Knowledge Graphs
NeurIPS 2022
0
citations
Latency-aware Spatial-wise Dynamic Networks
NeurIPS 2022
0
citations
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
NeurIPS 2023
0
citations
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
NeurIPS 2023
0
citations