Zhu

23
Papers
851
Total Citations

Papers (23)

MobileNetV4: Universal Models for the Mobile Ecosystem

ECCV 2024arXiv
407
citations

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

NeurIPS 2025arXiv
118
citations

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

NeurIPS 2025arXiv
74
citations

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

ICLR 2025arXiv
65
citations

Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting

ICLR 2025arXiv
38
citations

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering

ICLR 2025arXiv
33
citations

$\text{D}_{2}\text{O}$: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models

ICLR 2025
22
citations

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

ICLR 2025arXiv
15
citations

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

ECCV 2024arXiv
14
citations

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

ECCV 2024arXiv
13
citations

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

NeurIPS 2025arXiv
13
citations

NetMoE: Accelerating MoE Training through Dynamic Sample Placement

ICLR 2025
11
citations

UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation

ICLR 2025
7
citations

RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video

NeurIPS 2025arXiv
5
citations

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation

ECCV 2024arXiv
5
citations

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation

NeurIPS 2025arXiv
3
citations

SEBRA : Debiasing through Self-Guided Bias Ranking

ICLR 2025arXiv
2
citations

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation

ICLR 2025arXiv
2
citations

Rotated Orthographic Projection for Self-Supervised 3D Human Pose Estimation

ECCV 2024
2
citations

VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models

NeurIPS 2025arXiv
1
citations

Blackbox Model Provenance via Palimpsestic Membership Inference

NeurIPS 2025arXiv
1
citations

AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation

NeurIPS 2025arXiv
0
citations

Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm

NeurIPS 2025arXiv
0
citations