Weizhu Chen
12
Papers
415
Total Citations
Papers (12)
LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models
ICLR 2024arXiv
194
citations
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
ICLR 2025arXiv
115
citations
Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning
AAAI 2025arXiv
63
citations
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
ICLR 2024arXiv
16
citations
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning
NeurIPS 2025arXiv
14
citations
MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
AAAI 2025arXiv
13
citations
Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models
ECCV 2022
0
citations
Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
NeurIPS 2021arXiv
0
citations
Meet in the Middle: A New Pre-training Paradigm
NeurIPS 2023arXiv
0
citations
In-Context Learning Unlocked for Diffusion Models
NeurIPS 2023arXiv
0
citations
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
NeurIPS 2023arXiv
0
citations
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models
NeurIPS 2023arXiv
0
citations