Chen Li

53
Papers
143
Total Citations

Papers (53)

ST-LLM: Large Language Models Are Effective Temporal Learners

ECCV 2024
124
citations

TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model

CVPR 2025
11
citations

AugDETR: Improving Multi-scale Learning for Detection Transformer

ECCV 2024
4
citations

Detecting Adversarial Data Using Perturbation Forgery

CVPR 2025
2
citations

TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly

ECCV 2024
1
citations

DAMap: Distance-aware MapNet for High Quality HD Map Construction

ICCV 2025
1
citations

IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A

ICCV 2025
0
citations

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

ICCV 2025
0
citations

Text-guided Visual Prompt DINO for Generic Segmentation

ICCV 2025
0
citations

RemDet: Rethinking Efficient Model Design for UAV Object Detection

AAAI 2025
0
citations

Mamba YOLO: A Simple Baseline for Object Detection with State Space Model

AAAI 2025
0
citations

SwitchTab: Switched Autoencoders Are Effective Tabular Learners

AAAI 2024
0
citations

GxVAEs: Two Joint VAEs Generate Hit Molecules from Gene Expression Profiles

AAAI 2024
0
citations

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

CVPR 2024
0
citations

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning

CVPR 2024
0
citations

DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF

CVPR 2024
0
citations

Practical Measurements of Translucent Materials with Inter-Pixel Translucency Prior

CVPR 2024
0
citations

ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification

CVPR 2024
0
citations

Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

CVPR 2024
0
citations

Multi-View Attentive Contextualization for Multi-View 3D Object Detection

CVPR 2024
0
citations

ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation

CVPR 2024
0
citations

Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption

CVPR 2024
0
citations

Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic Perspective

ICML 2024
0
citations

Simulating Makeup Through Physics-Based Manipulation of Intrinsic Image Layers

CVPR 2015
0
citations

Specular Highlight Removal in Facial Images

CVPR 2017
0
citations

Radiometric Calibration From Faces in Images

CVPR 2017
0
citations

Convolutional Sequence to Sequence Model for Human Dynamics

CVPR 2018arXiv
0
citations

MMFace: A Multi-Metric Regression Network for Unconstrained Face Reconstruction

CVPR 2019
0
citations

Generating Multiple Hypotheses for 3D Human Pose Estimation With Mixture Density Network

CVPR 2019
0
citations

Viewport Proposal CNN for 360deg Video Quality Assessment

CVPR 2019
0
citations

From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation

CVPR 2021arXiv
0
citations

Distribution Consistent Neural Architecture Search

CVPR 2022
0
citations

Computing Wasserstein-p Distance Between Images With Linear Cost

CVPR 2022
0
citations

DLFormer: Discrete Latent Transformer for Video Inpainting

CVPR 2022
0
citations

ScarceNet: Animal Pose Estimation With Scarce Annotations

CVPR 2023arXiv
0
citations

NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects

CVPR 2023
0
citations

Weak-Shot Object Detection Through Mutual Knowledge Transfer

CVPR 2023
0
citations

Efficient Diffusion Training via Min-SNR Weighting Strategy

ICCV 2023arXiv
0
citations

DETR Does Not Need Multi-Scale or Locality Design

ICCV 2023
0
citations

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token

ICCV 2023arXiv
0
citations

Unleashing the Potential of Spiking Neural Networks with Dynamic Confidence

ICCV 2023
0
citations

Weakly-supervised 3D Pose Transfer with Keypoints

ICCV 2023arXiv
0
citations

"A Simple Approach and Benchmark for 21,000-Category Object Detection"

ECCV 2022
0
citations

Hierarchical Feature Embedding for Visual Tracking

ECCV 2022
0
citations

Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction

CVPR 2023arXiv
0
citations

Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection

CVPR 2025
0
citations

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat

CVPR 2025
0
citations

Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning

CVPR 2025arXiv
0
citations

Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin

ICCV 2025
0
citations

Learning Efficient and Generalizable Human Representation with Human Gaussian Model

ICCV 2025
0
citations

Coarse-to-fine Animal Pose and Shape Estimation

NeurIPS 2021
0
citations

GNeSF: Generalizable Neural Semantic Fields

NeurIPS 2023
0
citations

Formulating Discrete Probability Flow Through Optimal Transport

NeurIPS 2023
0
citations