Nathan Kallus

32
Papers
57
Total Citations

Papers (32)

Provable Offline Preference-Based Reinforcement Learning

ICLR 2024
39
citations

$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training

NeurIPS 2025
10
citations

Value-Guided Search for Efficient Chain-of-Thought Reasoning

NeurIPS 2025
7
citations

GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding

NeurIPS 2025
1
citations

Estimating Structural Disparities for Face Models

CVPR 2022arXiv
0
citations

Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams

ICML 2024
0
citations

Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments

ICML 2024
0
citations

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

ICML 2024
0
citations

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

ICML 2024
0
citations

Assessing Disparate Impact of Personalized Interventions: Identifiability and Bounds

NeurIPS 2019
0
citations

Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies

NeurIPS 2020
0
citations

Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning

NeurIPS 2020
0
citations

Control Variates for Slate Off-Policy Evaluation

NeurIPS 2021
0
citations

Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning

NeurIPS 2021
0
citations

Post-Contextual-Bandit Inference

NeurIPS 2021
0
citations

Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

NeurIPS 2022
0
citations

What's the Harm? Sharp Bounds on the Fraction Negatively Affected by Treatment

NeurIPS 2022
0
citations

The Implicit Delta Method

NeurIPS 2022
0
citations

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

NeurIPS 2023
0
citations

Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage

NeurIPS 2023
0
citations

Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

NeurIPS 2023
0
citations

Recursive Partitioning for Personalization using Observational Data

ICML 2017
0
citations

Residual Unfairness in Fair Machine Learning from Prejudiced Data

ICML 2018
0
citations

Classifying Treatment Responders Under Causal Effect Monotonicity

ICML 2019
0
citations

Confounding-Robust Policy Improvement

NeurIPS 2018
0
citations

Removing Hidden Confounding by Experimental Grounding

NeurIPS 2018
0
citations

Balanced Policy Evaluation and Learning

NeurIPS 2018
0
citations

Causal Inference with Noisy and Missing Covariates via Matrix Factorization

NeurIPS 2018
0
citations

Deep Generalized Method of Moments for Instrumental Variable Analysis

NeurIPS 2019
0
citations

Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning

NeurIPS 2019
0
citations

The Fairness of Risk Scores Beyond Classification: Bipartite Ranking and the XAUC Metric

NeurIPS 2019
0
citations

Policy Evaluation with Latent Confounders via Optimal Balance

NeurIPS 2019
0
citations