Aldo Pacchiano

19
Papers
8
Total Citations

Papers (19)

Second Order Bounds for Contextual Bandits with Function Approximation

ICLR 2025arXiv
7
citations

Multiple-policy Evaluation via Density Estimation

ICML 2025arXiv
1
citations

Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward

NeurIPS 2025arXiv
0
citations

Provable Interactive Learning with Hindsight Instruction Feedback

ICML 2024arXiv
0
citations

Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian

NeurIPS 2020arXiv
0
citations

Model Selection in Contextual Stochastic Bandit Problems

NeurIPS 2020arXiv
0
citations

Effective Diversity in Population Based Reinforcement Learning

NeurIPS 2020arXiv
0
citations

Near Optimal Policy Optimization via REPS

NeurIPS 2021arXiv
0
citations

On the Theory of Reinforcement Learning with Once-per-Episode Feedback

NeurIPS 2021arXiv
0
citations

Neural Pseudo-Label Optimism for the Bank Loan Problem

NeurIPS 2021arXiv
0
citations

Tactical Optimism and Pessimism for Deep Reinforcement Learning

NeurIPS 2021arXiv
0
citations

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

NeurIPS 2021arXiv
0
citations

Best of Both Worlds Model Selection

NeurIPS 2022arXiv
0
citations

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

NeurIPS 2022arXiv
0
citations

Learning General World Models in a Handful of Reward-Free Deployments

NeurIPS 2022arXiv
0
citations

Experiment Planning with Function Approximation

NeurIPS 2023arXiv
0
citations

Anytime Model Selection in Linear Bandits

NeurIPS 2023arXiv
0
citations

Supervised Pretraining Can Learn In-Context Reinforcement Learning

NeurIPS 2023arXiv
0
citations

A Unified Model and Dimension for Interactive Estimation

NeurIPS 2023arXiv
0
citations