Ze Liu
45
Papers
1,367
Total Citations
Papers (45)
SpinQuant: LLM Quantization with Learned Rotations
ICLR 2025arXiv
248
citations
Advancing LLM Reasoning Generalists with Preference Trees
ICLR 2025arXiv
179
citations
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
ICLR 2025arXiv
121
citations
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR 2025arXiv
74
citations
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay
NeurIPS 2025arXiv
71
citations
TC4D: Trajectory-Conditioned Text-to-4D Generation
ECCV 2024arXiv
64
citations
Large Motion Model for Unified Multi-Modal Motion Generation
ECCV 2024arXiv
61
citations
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
ECCV 2024arXiv
53
citations
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance
ECCV 2024arXiv
45
citations
Video World Models with Long-term Spatial Memory
NeurIPS 2025arXiv
41
citations
On the expressiveness and spectral bias of KANs
ICLR 2025arXiv
40
citations
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
ICLR 2025arXiv
39
citations
Scaling RL to Long Videos
NeurIPS 2025arXiv
38
citations
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
NeurIPS 2025arXiv
35
citations
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs
ICLR 2025arXiv
35
citations
Fast-in-Slow: A Dual-System VLA Model Unifying Fast Manipulation within Slow Reasoning
NeurIPS 2025
27
citations
Multi-Agent Collaboration via Evolving Orchestration
NeurIPS 2025arXiv
25
citations
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models
ICLR 2025arXiv
21
citations
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
ICLR 2025arXiv
19
citations
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
ICLR 2025arXiv
18
citations
Image-level Memorization Detection via Inversion-based Inference Perturbation
ICLR 2025
15
citations
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
ECCV 2024arXiv
13
citations
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
ECCV 2024arXiv
11
citations
Unleashing Hour-Scale Video Training for Long Video-Language Understanding
NeurIPS 2025arXiv
10
citations
Node-Time Conditional Prompt Learning in Dynamic Graphs
ICLR 2025arXiv
9
citations
VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models
NeurIPS 2025arXiv
8
citations
Bridging the Gap between Database Search and \emph{De Novo} Peptide Sequencing with SearchNovo
ICLR 2025
6
citations
Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections
NeurIPS 2025arXiv
5
citations
SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision
ICLR 2025arXiv
5
citations
ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation
NeurIPS 2025arXiv
4
citations
Towards A Generalist Code Embedding Model Based On Massive Data Synthesis
NeurIPS 2025arXiv
4
citations
The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training
NeurIPS 2025arXiv
4
citations
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
ECCV 2024arXiv
3
citations
ArchCAD-400K: A Large-Scale CAD drawings Dataset and New Baseline for Panoptic Symbol Spotting
NeurIPS 2025arXiv
2
citations
VideoLucy: Deep Memory Backtracking for Long Video Understanding
NeurIPS 2025arXiv
2
citations
SEBRA : Debiasing through Self-Guided Bias Ranking
ICLR 2025arXiv
2
citations
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
NeurIPS 2025arXiv
2
citations
PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs
NeurIPS 2025arXiv
2
citations
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
NeurIPS 2025arXiv
2
citations
MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval
NeurIPS 2025arXiv
2
citations
A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
ECCV 2024
1
citations
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time
ECCV 2024arXiv
1
citations
HetSyn: Versatile Timescale Integration in Spiking Neural Networks via Heterogeneous Synapses
NeurIPS 2025arXiv
0
citations
Luminance-Aware Statistical Quantization: Unsupervised Hierarchical Learning for Illumination Enhancement
NeurIPS 2025arXiv
0
citations
DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding
NeurIPS 2025arXiv
0
citations