Lei Li
49
Papers
1,388
Total Citations
4
Affiliations
Affiliations
Peking UniversityThe University of Hong KongUniversity of VirginiaCarnegie Mellon University
Papers (49)
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
CVPR 2025
858
citations
Provable Robust Watermarking for AI-Generated Text
ICLR 2024
271
citations
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models
ICLR 2025
135
citations
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
ECCV 2024
49
citations
GenZI: Zero-Shot 3D Human-Scene Interaction Generation
CVPR 2024
36
citations
Temporal Reasoning Transfer from Text to Video
ICLR 2025arXiv
20
citations
3D Neural Edge Reconstruction
CVPR 2024
13
citations
Position-Aware Guided Point Cloud Completion with CLIP Model
AAAI 2025
2
citations
Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching
ICCV 2025
2
citations
Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation
CVPR 2025
2
citations
An Efficient and Accurate Dynamic Sparse Training Framework Based on Parameter-Freezing
AAAI 2025
0
citations
Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential Recommendations
AAAI 2024arXiv
0
citations
PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design
ICML 2025
0
citations
DIS-CO: Discovering Copyrighted Content in VLMs Training Data
ICML 2025
0
citations
DE-COP: Detecting Copyrighted Content in Language Models Training Data
ICML 2024
0
citations
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
ICML 2024
0
citations
SurfPro: Functional Protein Design Based on Continuous Surface
ICML 2024
0
citations
Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates
ICML 2024
0
citations
Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations
CVPR 2019
0
citations
End-to-End Learning Local Multi-View Descriptors for 3D Point Clouds
CVPR 2020arXiv
0
citations
PointDSC: Robust Point Cloud Registration Using Deep Spatial Consistency
CVPR 2021arXiv
0
citations
Sparse R-CNN: End-to-End Object Detection With Learnable Proposals
CVPR 2021
0
citations
Scale-Aware Automatic Augmentation for Object Detection
CVPR 2021arXiv
0
citations
Locate Then Segment: A Strong Pipeline for Referring Image Segmentation
CVPR 2021arXiv
0
citations
Dense Contrastive Learning for Self-Supervised Visual Pre-Training
CVPR 2021arXiv
0
citations
Progressive Domain Expansion Network for Single Domain Generalization
CVPR 2021arXiv
0
citations
Generalizable Local Feature Pre-Training for Deformable Shape Analysis
CVPR 2023arXiv
0
citations
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
ICCV 2019
0
citations
SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval
ICCV 2019
0
citations
SOLO: Segmenting Objects by Locations
ECCV 2020
0
citations
Human Motion Instruction Tuning
CVPR 2025
0
citations
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
CVPR 2025
0
citations
CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model
CVPR 2025
0
citations
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers
CVPR 2025
0
citations
LT3SD: Latent Trees for 3D Scene Diffusion
CVPR 2025
0
citations
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
ICCV 2025
0
citations
DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching
ICCV 2025
0
citations
MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing
ICCV 2025
0
citations
AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation
ICCV 2025
0
citations
To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning
ECCV 2024arXiv
0
citations
BRITS: Bidirectional Recurrent Imputation for Time Series
NeurIPS 2018
0
citations
Kernelized Bayesian Softmax for Text Generation
NeurIPS 2019
0
citations
SOLOv2: Dynamic and Fast Instance Segmentation
NeurIPS 2020
0
citations
Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
NeurIPS 2021
0
citations
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning
NeurIPS 2022
0
citations
Learning Multi-resolution Functional Maps with Spectral Attention for Robust Shape Matching
NeurIPS 2022
0
citations
Statistical Knowledge Assessment for Large Language Models
NeurIPS 2023arXiv
0
citations
ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers
NeurIPS 2023
0
citations
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
NeurIPS 2023
0
citations