Siyuan Li

52
Papers
415
Total Citations

Papers (52)

MogaNet: Multi-order Gated Aggregation Network

ICLR 2024
125
citations

Make RepVGG Greater Again: A Quantization-Aware Approach

AAAI 2024arXiv
65
citations

Matching Anything by Segmenting Anything

CVPR 2024
49
citations

MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding

ICLR 2024
41
citations

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

CVPR 2024
18
citations

SemiReward: A General Reward Model for Semi-supervised Learning

ICLR 2024
18
citations

CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

ICLR 2025
16
citations

Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing

AAAI 2025
16
citations

From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing

CVPR 2025
11
citations

Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation

CVPR 2025arXiv
10
citations

SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

ECCV 2024
8
citations

Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning

AAAI 2025
7
citations

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

CVPR 2025arXiv
6
citations

One2Any: One-Reference 6D Pose Estimation for Any Object

CVPR 2025
5
citations

Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings

ICLR 2025
5
citations

MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction

ICLR 2025arXiv
4
citations

DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing

CVPR 2025
4
citations

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

ICCV 2025
3
citations

AlphaFold Database Debiasing for Robust Inverse Folding

NeurIPS 2025
2
citations

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

ICCV 2025
1
citations

Multi-View 3D Point Tracking

ICCV 2025
1
citations

Tracking Every Thing in the Wild

ECCV 2022
0
citations

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

ECCV 2022
0
citations

Single Image Deraining: A Comprehensive Benchmark Analysis

CVPR 2019
0
citations

Video-Bench: Human-Aligned Video Generation Benchmark

CVPR 2025
0
citations

Dual-branch Graph Feature Learning for NLOS Imaging

AAAI 2025
0
citations

SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks

AAAI 2025
0
citations

3515 Protein 3D Graph Structure Learning for Robust Structure-Based Protein Property Prediction

AAAI 2024
0
citations

Robust Visual Imitation Learning with Inverse Dynamics Representations

AAAI 2024
0
citations

UniDepth: Universal Monocular Metric Depth Estimation

CVPR 2024
0
citations

VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

ICML 2024arXiv
0
citations

Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences

ICML 2024
0
citations

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

ICML 2024
0
citations

Learning to Predict Mutational Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning

ICML 2024
0
citations

UniK3D: Universal Camera Monocular 3D Estimation

CVPR 2025
0
citations

Deformation-Aware Unpaired Image Translation for Pose Estimation on Laboratory Animals

CVPR 2020arXiv
0
citations

Style Transformer for Image Inversion and Editing

CVPR 2022arXiv
0
citations

UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection

CVPR 2022arXiv
0
citations

Hyperspherical Consistency Regularization

CVPR 2022arXiv
0
citations

CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition With Variational Alignment

CVPR 2023
0
citations

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

CVPR 2023arXiv
0
citations

OVTrack: Open-Vocabulary Multiple Object Tracking

CVPR 2023arXiv
0
citations

Cascade-DETR: Delving into High-Quality Universal Object Detection

ICCV 2023
0
citations

DLME: Deep Local-Flatness Manifold Embedding

ECCV 2022
0
citations

Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards

NeurIPS 2019
0
citations

Offline Reinforcement Learning with Reverse Model-based Imagination

NeurIPS 2021
0
citations

Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning

NeurIPS 2022
0
citations

CUP: Critic-Guided Policy Reuse

NeurIPS 2022
0
citations

Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration

NeurIPS 2023
0
citations

Harnessing Hard Mixed Samples with Decoupled Regularizer

NeurIPS 2023
0
citations

Understanding the Limitations of Deep Models for Molecular property prediction: Insights and Solutions

NeurIPS 2023
0
citations

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

NeurIPS 2023
0
citations