Nathan Kallus
8
Papers
57
Total Citations
Papers (8)
Provable Offline Preference-Based Reinforcement Learning
ICLR 2024
39
citations
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NeurIPS 2025arXiv
10
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NeurIPS 2025
7
citations
GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding
NeurIPS 2025
1
citations
Switching the Loss Reduces the Cost in Batch Reinforcement Learning
ICML 2024
0
citations
Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
ICML 2024
0
citations
Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments
ICML 2024
0
citations
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
ICML 2024
0
citations