28
Papers
1,388
Total Citations
4
Affiliations

Affiliations

Peking UniversityThe University of Hong KongUniversity of VirginiaCarnegie Mellon University

Papers (28)

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

Provable Robust Watermarking for AI-Generated Text

ICLR 2024
271
citations

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

ICLR 2025
135
citations

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

ECCV 2024
49
citations

GenZI: Zero-Shot 3D Human-Scene Interaction Generation

CVPR 2024
36
citations

Temporal Reasoning Transfer from Text to Video

ICLR 2025arXiv
20
citations

3D Neural Edge Reconstruction

CVPR 2024
13
citations

Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching

ICCV 2025arXiv
2
citations

Position-Aware Guided Point Cloud Completion with CLIP Model

AAAI 2025
2
citations

Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation

CVPR 2025
2
citations

An Efficient and Accurate Dynamic Sparse Training Framework Based on Parameter-Freezing

AAAI 2025
0
citations

Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential Recommendations

AAAI 2024arXiv
0
citations

PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design

ICML 2025
0
citations

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

ICML 2025
0
citations

DE-COP: Detecting Copyrighted Content in Language Models Training Data

ICML 2024
0
citations

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

ICML 2024
0
citations

SurfPro: Functional Protein Design Based on Continuous Surface

ICML 2024
0
citations

LT3SD: Latent Trees for 3D Scene Diffusion

CVPR 2025
0
citations

Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates

ICML 2024
0
citations

VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

CVPR 2025
0
citations

CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model

CVPR 2025
0
citations

MeshArt: Generating Articulated Meshes with Structure-Guided Transformers

CVPR 2025
0
citations

Human Motion Instruction Tuning

CVPR 2025
0
citations

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

ICCV 2025
0
citations

DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching

ICCV 2025
0
citations

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

ICCV 2025
0
citations

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

ICCV 2025
0
citations

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

ECCV 2024arXiv
0
citations