Andrea Zanette
11
Papers
3
Total Citations
Papers (11)
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
ICML 2025
3
citations
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
ICML 2024
0
citations
Limiting Extrapolation in Linear Approximate Value Iteration
NeurIPS 2019
0
citations
Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model
NeurIPS 2019
0
citations
Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration
NeurIPS 2020
0
citations
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
NeurIPS 2021
0
citations
Design of Experiments for Stochastic Contextual Linear Bandits
NeurIPS 2021
0
citations
Bellman Residual Orthogonalization for Offline Reinforcement Learning
NeurIPS 2022
0
citations
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
NeurIPS 2023
0
citations
Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs
ICML 2018
0
citations
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
ICML 2019
0
citations