Aldo Pacchiano
19
Papers
8
Total Citations
Papers (19)
Second Order Bounds for Contextual Bandits with Function Approximation
ICLR 2025arXiv
7
citations
Multiple-policy Evaluation via Density Estimation
ICML 2025arXiv
1
citations
Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward
NeurIPS 2025arXiv
0
citations
Provable Interactive Learning with Hindsight Instruction Feedback
ICML 2024arXiv
0
citations
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
NeurIPS 2020arXiv
0
citations
Model Selection in Contextual Stochastic Bandit Problems
NeurIPS 2020arXiv
0
citations
Effective Diversity in Population Based Reinforcement Learning
NeurIPS 2020arXiv
0
citations
Near Optimal Policy Optimization via REPS
NeurIPS 2021arXiv
0
citations
On the Theory of Reinforcement Learning with Once-per-Episode Feedback
NeurIPS 2021arXiv
0
citations
Neural Pseudo-Label Optimism for the Bank Loan Problem
NeurIPS 2021arXiv
0
citations
Tactical Optimism and Pessimism for Deep Reinforcement Learning
NeurIPS 2021arXiv
0
citations
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
NeurIPS 2021arXiv
0
citations
Best of Both Worlds Model Selection
NeurIPS 2022arXiv
0
citations
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
NeurIPS 2022arXiv
0
citations
Learning General World Models in a Handful of Reward-Free Deployments
NeurIPS 2022arXiv
0
citations
Experiment Planning with Function Approximation
NeurIPS 2023arXiv
0
citations
Anytime Model Selection in Linear Bandits
NeurIPS 2023arXiv
0
citations
Supervised Pretraining Can Learn In-Context Reinforcement Learning
NeurIPS 2023arXiv
0
citations
A Unified Model and Dimension for Interactive Estimation
NeurIPS 2023arXiv
0
citations