49
Papers
1,388
Total Citations
4
Affiliations

Affiliations

Peking UniversityThe University of Hong KongUniversity of VirginiaCarnegie Mellon University

Papers (49)

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

Provable Robust Watermarking for AI-Generated Text

ICLR 2024
271
citations

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

ICLR 2025
135
citations

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

ECCV 2024
49
citations

GenZI: Zero-Shot 3D Human-Scene Interaction Generation

CVPR 2024
36
citations

Temporal Reasoning Transfer from Text to Video

ICLR 2025arXiv
20
citations

3D Neural Edge Reconstruction

CVPR 2024
13
citations

Position-Aware Guided Point Cloud Completion with CLIP Model

AAAI 2025
2
citations

Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching

ICCV 2025
2
citations

Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation

CVPR 2025
2
citations

An Efficient and Accurate Dynamic Sparse Training Framework Based on Parameter-Freezing

AAAI 2025
0
citations

Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential Recommendations

AAAI 2024arXiv
0
citations

PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design

ICML 2025
0
citations

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

ICML 2025
0
citations

DE-COP: Detecting Copyrighted Content in Language Models Training Data

ICML 2024
0
citations

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

ICML 2024
0
citations

SurfPro: Functional Protein Design Based on Continuous Surface

ICML 2024
0
citations

Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates

ICML 2024
0
citations

Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations

CVPR 2019
0
citations

End-to-End Learning Local Multi-View Descriptors for 3D Point Clouds

CVPR 2020arXiv
0
citations

PointDSC: Robust Point Cloud Registration Using Deep Spatial Consistency

CVPR 2021arXiv
0
citations

Sparse R-CNN: End-to-End Object Detection With Learnable Proposals

CVPR 2021
0
citations

Scale-Aware Automatic Augmentation for Object Detection

CVPR 2021arXiv
0
citations

Locate Then Segment: A Strong Pipeline for Referring Image Segmentation

CVPR 2021arXiv
0
citations

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

CVPR 2021arXiv
0
citations

Progressive Domain Expansion Network for Single Domain Generalization

CVPR 2021arXiv
0
citations

Generalizable Local Feature Pre-Training for Deformable Shape Analysis

CVPR 2023arXiv
0
citations

VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research

ICCV 2019
0
citations

SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval

ICCV 2019
0
citations

SOLO: Segmenting Objects by Locations

ECCV 2020
0
citations

Human Motion Instruction Tuning

CVPR 2025
0
citations

VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

CVPR 2025
0
citations

CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model

CVPR 2025
0
citations

MeshArt: Generating Articulated Meshes with Structure-Guided Transformers

CVPR 2025
0
citations

LT3SD: Latent Trees for 3D Scene Diffusion

CVPR 2025
0
citations

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

ICCV 2025
0
citations

DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching

ICCV 2025
0
citations

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

ICCV 2025
0
citations

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

ICCV 2025
0
citations

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

ECCV 2024arXiv
0
citations

BRITS: Bidirectional Recurrent Imputation for Time Series

NeurIPS 2018
0
citations

Kernelized Bayesian Softmax for Text Generation

NeurIPS 2019
0
citations

SOLOv2: Dynamic and Fast Instance Segmentation

NeurIPS 2020
0
citations

Duplex Sequence-to-Sequence Learning for Reversible Machine Translation

NeurIPS 2021
0
citations

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

NeurIPS 2022
0
citations

Learning Multi-resolution Functional Maps with Spectral Attention for Robust Shape Matching

NeurIPS 2022
0
citations

Statistical Knowledge Assessment for Large Language Models

NeurIPS 2023arXiv
0
citations

ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers

NeurIPS 2023
0
citations

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

NeurIPS 2023
0
citations