Sanjeev Arora
7
Papers
78
Total Citations
Papers (7)
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
ICLR 2025arXiv
47
citations
Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning
ICLR 2025arXiv
18
citations
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
ICML 2025
9
citations
A Quadratic Synchronization Rule for Distributed Deep Learning
ICLR 2024
4
citations
Trainable Transformer in Transformer
ICML 2024
0
citations
LESS: Selecting Influential Data for Targeted Instruction Tuning
ICML 2024
0
citations
Language Models as Science Tutors
ICML 2024
0
citations