Xin Chen

51
Papers
365
Total Citations

Papers (51)

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models

CVPR 2024
108
citations

Plug-In Diffusion Model for Sequential Recommendation

AAAI 2024arXiv
69
citations

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers

CVPR 2024
47
citations

SUTrack: Towards Simple and Unified Single Object Tracking

AAAI 2025
37
citations

Exploring Enhanced Contextual Information for Video-Level Object Tracking

AAAI 2025
27
citations

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

CVPR 2025arXiv
19
citations

CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data

CVPR 2024
17
citations

Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking

AAAI 2025
15
citations

X-Dancer: Expressive Music to Human Dance Video Generation

ICCV 2025
9
citations

Learning Safety Constraints for Large Language Models

ICML 2025
7
citations

MikuDance: Animating Character Art with Mixed Motion Dynamics

ICCV 2025
4
citations

CohEx: A Generalized Framework for Cohort Explanation

AAAI 2025
2
citations

Learning Dynamic Collaborative Network for Semi-supervised 3D Vessel Segmentation

CVPR 2025
2
citations

DoDo-Code: an Efficient Levenshtein Distance Embedding-based Code for 4-ary IDS Channel

NeurIPS 2025
1
citations

Efficient Motion Prompt Learning for Robust Visual Tracking

ICML 2025
1
citations

End-to-End 3D Dense Captioning With Vote2Cap-DETR

CVPR 2023arXiv
0
citations

Devil Is in the Queries: Advancing Mask Transformers for Real-World Medical Image Segmentation and Out-of-Distribution Localization

CVPR 2023arXiv
0
citations

Text-Visual Prompting for Efficient 2D Temporal Video Grounding

CVPR 2023arXiv
0
citations

Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation

ICCV 2019
0
citations

Enhancing Low Light Videos by Exploring High Sensitivity Camera Noise

ICCV 2019
0
citations

Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection

ICCV 2021
0
citations

Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking

ICCV 2023arXiv
0
citations

A Large-Scale Outdoor Multi-Modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction

ICCV 2023arXiv
0
citations

CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans

ICCV 2023
0
citations

Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image Retrieval

ICCV 2023
0
citations

Circumventing Outliers of AutoAugment with Knowledge Distillation

ECCV 2020
0
citations

CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search

ECCV 2020
0
citations

Cornerformer: Purifying Instances for Corner-Based Detectors

ECCV 2022
0
citations

Contrastive Deep Supervision

ECCV 2022
0
citations

Visual Prompt Multi-Modal Tracking

CVPR 2023arXiv
0
citations

ESCNet:Edge-Semantic Collaborative Network for Camouflaged Object Detection

ICCV 2025
0
citations

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

AAAI 2025
0
citations

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation

AAAI 2024
0
citations

REGLO: Provable Neural Network Repair for Global Robustness Properties

AAAI 2024
0
citations

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

CVPR 2024
0
citations

Sparse Photometric 3D Face Reconstruction Guided by Morphable Models

CVPR 2018arXiv
0
citations

Robustness Verification of Classification Deep Neural Networks via Linear Programming

CVPR 2019
0
citations

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search

CVPR 2021
0
citations

ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References

CVPR 2021arXiv
0
citations

Transformer Tracking

CVPR 2021arXiv
0
citations

Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search

CVPR 2022
0
citations

Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation

CVPR 2022arXiv
0
citations

Executing Your Commands via Motion Diffusion in Latent Space

CVPR 2023arXiv
0
citations

SeqTrack: Sequence to Sequence Learning for Visual Object Tracking

CVPR 2023arXiv
0
citations

Online Optimal Control with Linear Dynamics and Predictions: Algorithms and Regret Analysis

NeurIPS 2019
0
citations

Biased Stochastic First-Order Methods for Conditional Stochastic Optimization and Applications in Meta Learning

NeurIPS 2020
0
citations

Graph Stochastic Neural Networks for Semi-supervised Learning

NeurIPS 2020
0
citations

On the Bias-Variance-Cost Tradeoff of Stochastic Optimization

NeurIPS 2021
0
citations

PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation

NeurIPS 2023
0
citations

MotionGPT: Human Motion as a Foreign Language

NeurIPS 2023
0
citations

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

NeurIPS 2023
0
citations