Siyuan Li
52
Papers
415
Total Citations
Papers (52)
MogaNet: Multi-order Gated Aggregation Network
ICLR 2024
125
citations
Make RepVGG Greater Again: A Quantization-Aware Approach
AAAI 2024arXiv
65
citations
Matching Anything by Segmenting Anything
CVPR 2024
49
citations
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding
ICLR 2024
41
citations
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
CVPR 2024
18
citations
SemiReward: A General Reward Model for Semi-supervised Learning
ICLR 2024
18
citations
CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph
ICLR 2025
16
citations
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
AAAI 2025
16
citations
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
CVPR 2025
11
citations
Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation
CVPR 2025arXiv
10
citations
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking
ECCV 2024
8
citations
Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning
AAAI 2025
7
citations
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
CVPR 2025arXiv
6
citations
One2Any: One-Reference 6D Pose Estimation for Any Object
CVPR 2025
5
citations
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
ICLR 2025
5
citations
MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction
ICLR 2025arXiv
4
citations
DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing
CVPR 2025
4
citations
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
ICCV 2025
3
citations
AlphaFold Database Debiasing for Robust Inverse Folding
NeurIPS 2025
2
citations
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
ICCV 2025
1
citations
Multi-View 3D Point Tracking
ICCV 2025
1
citations
Tracking Every Thing in the Wild
ECCV 2022
0
citations
AutoMix: Unveiling the Power of Mixup for Stronger Classifiers
ECCV 2022
0
citations
Single Image Deraining: A Comprehensive Benchmark Analysis
CVPR 2019
0
citations
Video-Bench: Human-Aligned Video Generation Benchmark
CVPR 2025
0
citations
Dual-branch Graph Feature Learning for NLOS Imaging
AAAI 2025
0
citations
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
AAAI 2025
0
citations
3515 Protein 3D Graph Structure Learning for Robust Structure-Based Protein Property Prediction
AAAI 2024
0
citations
Robust Visual Imitation Learning with Inverse Dynamics Representations
AAAI 2024
0
citations
UniDepth: Universal Monocular Metric Depth Estimation
CVPR 2024
0
citations
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
ICML 2024arXiv
0
citations
Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences
ICML 2024
0
citations
Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge
ICML 2024
0
citations
Learning to Predict Mutational Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning
ICML 2024
0
citations
UniK3D: Universal Camera Monocular 3D Estimation
CVPR 2025
0
citations
Deformation-Aware Unpaired Image Translation for Pose Estimation on Laboratory Animals
CVPR 2020arXiv
0
citations
Style Transformer for Image Inversion and Editing
CVPR 2022arXiv
0
citations
UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection
CVPR 2022arXiv
0
citations
Hyperspherical Consistency Regularization
CVPR 2022arXiv
0
citations
CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition With Variational Alignment
CVPR 2023
0
citations
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
CVPR 2023arXiv
0
citations
OVTrack: Open-Vocabulary Multiple Object Tracking
CVPR 2023arXiv
0
citations
Cascade-DETR: Delving into High-Quality Universal Object Detection
ICCV 2023
0
citations
DLME: Deep Local-Flatness Manifold Embedding
ECCV 2022
0
citations
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
NeurIPS 2019
0
citations
Offline Reinforcement Learning with Reverse Model-based Imagination
NeurIPS 2021
0
citations
Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning
NeurIPS 2022
0
citations
CUP: Critic-Guided Policy Reuse
NeurIPS 2022
0
citations
Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration
NeurIPS 2023
0
citations
Harnessing Hard Mixed Samples with Decoupled Regularizer
NeurIPS 2023
0
citations
Understanding the Limitations of Deep Models for Molecular property prediction: Insights and Solutions
NeurIPS 2023
0
citations
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
NeurIPS 2023
0
citations