Nathan Kallus
32
Papers
57
Total Citations
Papers (32)
Provable Offline Preference-Based Reinforcement Learning
ICLR 2024
39
citations
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NeurIPS 2025
10
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NeurIPS 2025
7
citations
GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding
NeurIPS 2025
1
citations
Estimating Structural Disparities for Face Models
CVPR 2022arXiv
0
citations
Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams
ICML 2024
0
citations
Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments
ICML 2024
0
citations
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
ICML 2024
0
citations
Switching the Loss Reduces the Cost in Batch Reinforcement Learning
ICML 2024
0
citations
Assessing Disparate Impact of Personalized Interventions: Identifiability and Bounds
NeurIPS 2019
0
citations
Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies
NeurIPS 2020
0
citations
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning
NeurIPS 2020
0
citations
Control Variates for Slate Off-Policy Evaluation
NeurIPS 2021
0
citations
Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning
NeurIPS 2021
0
citations
Post-Contextual-Bandit Inference
NeurIPS 2021
0
citations
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
NeurIPS 2022
0
citations
What's the Harm? Sharp Bounds on the Fraction Negatively Affected by Treatment
NeurIPS 2022
0
citations
The Implicit Delta Method
NeurIPS 2022
0
citations
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
NeurIPS 2023
0
citations
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
NeurIPS 2023
0
citations
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
NeurIPS 2023
0
citations
Recursive Partitioning for Personalization using Observational Data
ICML 2017
0
citations
Residual Unfairness in Fair Machine Learning from Prejudiced Data
ICML 2018
0
citations
Classifying Treatment Responders Under Causal Effect Monotonicity
ICML 2019
0
citations
Confounding-Robust Policy Improvement
NeurIPS 2018
0
citations
Removing Hidden Confounding by Experimental Grounding
NeurIPS 2018
0
citations
Balanced Policy Evaluation and Learning
NeurIPS 2018
0
citations
Causal Inference with Noisy and Missing Covariates via Matrix Factorization
NeurIPS 2018
0
citations
Deep Generalized Method of Moments for Instrumental Variable Analysis
NeurIPS 2019
0
citations
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
NeurIPS 2019
0
citations
The Fairness of Risk Scores Beyond Classification: Bipartite Ranking and the XAUC Metric
NeurIPS 2019
0
citations
Policy Evaluation with Latent Confounders via Optimal Balance
NeurIPS 2019
0
citations