Ze Liu

45
Papers
1,367
Total Citations

Papers (45)

SpinQuant: LLM Quantization with Learned Rotations

ICLR 2025arXiv
248
citations

Advancing LLM Reasoning Generalists with Preference Trees

ICLR 2025arXiv
179
citations

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

ICLR 2025arXiv
121
citations

MMTEB: Massive Multilingual Text Embedding Benchmark

ICLR 2025arXiv
74
citations

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

NeurIPS 2025arXiv
71
citations

TC4D: Trajectory-Conditioned Text-to-4D Generation

ECCV 2024arXiv
64
citations

Large Motion Model for Unified Multi-Modal Motion Generation

ECCV 2024arXiv
61
citations

Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

ECCV 2024arXiv
53
citations

ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance

ECCV 2024arXiv
45
citations

Video World Models with Long-term Spatial Memory

NeurIPS 2025arXiv
41
citations

On the expressiveness and spectral bias of KANs

ICLR 2025arXiv
40
citations

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

ICLR 2025arXiv
39
citations

Scaling RL to Long Videos

NeurIPS 2025arXiv
38
citations

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

NeurIPS 2025arXiv
35
citations

FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs

ICLR 2025arXiv
35
citations

Fast-in-Slow: A Dual-System VLA Model Unifying Fast Manipulation within Slow Reasoning

NeurIPS 2025
27
citations

Multi-Agent Collaboration via Evolving Orchestration

NeurIPS 2025arXiv
25
citations

Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models

ICLR 2025arXiv
21
citations

Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets

ICLR 2025arXiv
19
citations

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

ICLR 2025arXiv
18
citations

Image-level Memorization Detection via Inversion-based Inference Perturbation

ICLR 2025
15
citations

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

ECCV 2024arXiv
13
citations

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

ECCV 2024arXiv
11
citations

Unleashing Hour-Scale Video Training for Long Video-Language Understanding

NeurIPS 2025arXiv
10
citations

Node-Time Conditional Prompt Learning in Dynamic Graphs

ICLR 2025arXiv
9
citations

VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models

NeurIPS 2025arXiv
8
citations

Bridging the Gap between Database Search and \emph{De Novo} Peptide Sequencing with SearchNovo

ICLR 2025
6
citations

Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

NeurIPS 2025arXiv
5
citations

SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision

ICLR 2025arXiv
5
citations

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

NeurIPS 2025arXiv
4
citations

Towards A Generalist Code Embedding Model Based On Massive Data Synthesis

NeurIPS 2025arXiv
4
citations

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

NeurIPS 2025arXiv
4
citations

Sparse Refinement for Efficient High-Resolution Semantic Segmentation

ECCV 2024arXiv
3
citations

ArchCAD-400K: A Large-Scale CAD drawings Dataset and New Baseline for Panoptic Symbol Spotting

NeurIPS 2025arXiv
2
citations

VideoLucy: Deep Memory Backtracking for Long Video Understanding

NeurIPS 2025arXiv
2
citations

SEBRA : Debiasing through Self-Guided Bias Ranking

ICLR 2025arXiv
2
citations

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

NeurIPS 2025arXiv
2
citations

PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs

NeurIPS 2025arXiv
2
citations

Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards

NeurIPS 2025arXiv
2
citations

MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval

NeurIPS 2025arXiv
2
citations

A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks

ECCV 2024
1
citations

Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time

ECCV 2024arXiv
1
citations

HetSyn: Versatile Timescale Integration in Spiking Neural Networks via Heterogeneous Synapses

NeurIPS 2025arXiv
0
citations

Luminance-Aware Statistical Quantization: Unsupervised Hierarchical Learning for Illumination Enhancement

NeurIPS 2025arXiv
0
citations

DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding

NeurIPS 2025arXiv
0
citations