🧬Reinforcement Learning

Imitation Learning

Learning from demonstrations

100 papers666 total citations

Compare with other topics

Feb '24 — Jan '26266 papers

Top Conferences

ICLR: 29 CVPR: 20 AAAI: 18 ICCV: 11 NeurIPS: 9 ECCV: 6

Top Papers

#1

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Yiheng Xu, Dunjie Lu, Zhennan Shen et al.

Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer

Yu Deng, Duomin Wang, Baoyuan Wang

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

Xiaogang Jia, Denis Blessing, Xinkai Jiang et al.

LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations

Anian Ruoss, Fabio Pardo, Harris Chan et al.

eTag: Class-Incremental Learning via Embedding Distillation and Task-Oriented Generation

Libo Huang, Yan Zeng, Chuanguang Yang et al.

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Zhefei Gong, Pengxiang Ding, Shangke Lyu et al.

Instant Policy: In-Context Imitation Learning via Graph Diffusion

Vitalis Vosylius, Edward Johns

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

Ben Eisner, Yi Yang, Todor Davchev et al.

Domain Prompt Learning with Quaternion Networks

Qinglong Cao, Zhengqin Xu, Yuntian Chen et al.

DiffAIL: Diffusion Adversarial Imitation Learning

Bingzheng Wang, Guoqiang Wu, Teng Pang et al.

AAAI 2024arXiv:2312.06348

imitation learningadversarial imitation learningdiffusion modelsreward function learning+4

20

citations

#11

Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection

Shengjia Chen, Luping Ji, Weiwei Duan et al.

Are Human-generated Demonstrations Necessary for In-context Learning?

Rui Li, Guoyin Wang, Jiwei Li

ET-SEED: EFFICIENT TRAJECTORY-LEVEL SE(3) EQUIVARIANT DIFFUSION POLICY

Chenrui Tie, Yue Chen, Ruihai Wu et al.

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Mingfei Han, Liang Ma, Kamila Zhumakhanova et al.

SkillMimic: Learning Basketball Interaction Skills from Demonstrations

Yinhuai Wang, Qihan Zhao, Runyi Yu et al.

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning

Yuanfei Wang, Xiaojie Zhang, Ruihai Wu et al.

ICLR 2025arXiv:2502.11124

articulated object manipulationadaptive manipulation policy3d visual diffusionimitation learning+4

12

citations

#20

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper

Xinyue Zhu, Binghao Huang, Yunzhu Li

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Baoqi Pei, Yifei Huang, Jilan Xu et al.

Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities

Michele Mazzamuto, Antonino Furnari, Yoichi Sato et al.

Circumventing Shortcuts in Audio-visual Deepfake Detection Datasets with Unsupervised Learning

Stefan Smeu, Dragos-Alexandru Boldisor, Dan Oneata et al.

ProMotion: Prototypes As Motion Learners

Yawen Lu, Dongfang Liu, Qifan Wang et al.

BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation

Yulu Pan, Ce Zhang, Gedas Bertasius

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

Ke Xu, Tsun Wai Siu, Rynson W.H. Lau

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Xiangyu Wang, Donglin Yang, Yue Liao et al.

Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning

Xialei Liu, Jiang-Tian Zhai, Andrew Bagdanov et al.

Instruction-based Image Manipulation by Watching How Things Move

Mingdeng Cao, Xuaner Zhang, Yinqiang Zheng et al.

Mimic In-Context Learning for Multimodal Tasks

Yuchu Jiang, Jiale Fu, chenduo hao et al.

Efficient Active Imitation Learning with Random Network Distillation

Emilien Biré, Anthony Kobanda, Ludovic Denoyer et al.

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Hongrui Jia, Chaoya Jiang, Haiyang Xu et al.

Noise-assisted Prompt Learning for Image Forgery Detection and Localization

Dong Li, Jiaying Zhu, Xueyang Fu et al.

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

Qiang Wang, Yuhang He, Songlin Dong et al.

A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

Arnav Kumar Jain, Vibhakar Mohta, Subin Kim et al.

NeurIPS 2025arXiv:2506.05294

behavioral cloningimitation learningworld modelreward model+3

6

citations

#37

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Yuejiang Liu, Jubayer Hamid, Annie Xie et al.

ICLR 2025arXiv:2408.17355

action chunkingbidirectional decodingrobot learninghuman demonstrations+3

6

citations

#38

Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

Mark Beliaev, Ramtin Pedarsani

CLIP-driven View-aware Prompt Learning for Unsupervised Vehicle Re-identification

Jiyang Xu, Qi Wang, Xin Xiong et al.

Unsupervised Object Interaction Learning with Counterfactual Dynamics Models

Jongwook Choi, Sungtae Lee, Xinyu Wang et al.

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Ning Gao, Yilun Chen, Shuai Yang et al.

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch

Shengyu Feng, Yiming Yang

Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution

Zhanyi Sun, Shuran Song

PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning

Yan Zhang, Yao Feng, Alpár Cseke et al.

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Qi Wang, Zhipeng Zhang, Baao Xie et al.

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Jiangning Wei, Lixiong Qin, Bo Yu et al.

Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining

Guanglu Dong, Tianheng Zheng, Yuanzhouhan Cao et al.

Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner

Aizierjiang Aiersilan

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

Jun Li, Jinpeng Wang, Chaolei Tan et al.

Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations

Thomas Tian, Kratarth Goel

ICLR 2025arXiv:2503.20105

preference alignmentmotion generationmulti-agent simulationimplicit feedback+3

4

citations

#51

Reward-free World Models for Online Imitation Learning

Shangzhe Li, Zhiao Huang, Hao Su

DUEL: Duplicate Elimination on Active Memory for Self-Supervised Class-Imbalanced Learning

Won-Seok Choi, Hyundo Lee, Dong-Sig Han et al.

AAAI 2024arXiv:2402.08963

self-supervised learningclass-imbalanced learningactive memoryduplicate elimination+3

3

citations

#53

Transfer Learning of Real Image Features with Soft Contrastive Loss for Fake Image Detection

Ziyou Liang, Weifeng Liu, Run Wang et al.

HaHeAE: Learning Generalisable Joint Representations of Human Hand and Head Movements in Extended Reality

Zhiming Hu, Guanhua Zhang, Zheming Yin et al.

Morphing Tokens Draw Strong Masked Image Models

Taekyung Kim, Byeongho Heo, Dongyoon Han

A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision

Chensheng Peng, Ido Sobol, Masayoshi Tomizuka et al.

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

Heyang Zhao, Xingrui Yu, David Bossens et al.

AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations

Pei Zhou, Ruizhe Liu, Qian Luo et al.

Differentiable Rule Induction from Raw Sequence Inputs

Kun Gao, Katsumi Inoue, Yongzhi Cao et al.

Demonstration Selection for In-Context Learning via Reinforcement Learning

Xubin Wang, Jianfei Wu, Yuan Yichen et al.

PseudoMapTrainer: Learning Online Mapping without HD Maps

Christian Löwens, Thorben Funke, Jingchao Xie et al.

Imitation Learning from a Single Temporally Misaligned Video

William Huey, Yuki (Huaxiaoyue) Wang, Anne Wu et al.

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Kunlun Xu, Fan Zhuo, Jiangmeng Li et al.

Forecasting Bimanual Object Manipulation Sequences from Unimanual Observations

Haziq Razali, Yiannis Demiris

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

Rishabh Agrawal, Nathan Dahlin, Rahul Jain et al.

S²DN: Learning to Denoise Unconvincing Knowledge for Inductive Knowledge Graph Completion

Tengfei Ma, Yujie Chen, Liang Wang et al.

Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning

Man Liu, Huihui Bai, Feng Li et al.

RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks

Mingxuan Yan, Yuping Wang, Zechun Liu et al.

TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly

Mengqi GUO, Chen Li, Yuyang Zhao et al.

Learning to Build by Building Your Own Instructions

Aaron Walsman, Muru Zhang, Adam Fishman et al.

MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.

ICCV 2025arXiv:2506.08694

self-supervised learningmotion trajectory clusteringoptimal transportdense representation learning+4

1

citations

#72

Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction

Hyeongseok Jeon, Sanmin Kim, Abi Rahman Syamil et al.

Zero-Shot Offline Imitation Learning via Optimal Transport

Thomas Rupf, Marco Bagatella, Nico Gürtler et al.

Action-Constrained Imitation Learning

Chia-Han Yeh, Tse-Sheng Nan, Risto Vuorio et al.

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Chenliang Li, Siliang Zeng, Zeyi Liao et al.

Imitation Beyond Expectation Using Pluralistic Stochastic Dominance

Ali Farajzadeh, Danyal Saeed, Syed M Abbas et al.

Imitation Learning from Observation with Automatic Discount Scheduling

Yuyang Liu, Weijun Dong, Yingdong Hu et al.

Memory-Consistent Neural Networks for Imitation Learning

Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman et al.

PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations

Qiang Liu, Huiqiao Fu, Kaiqiang Tang et al.

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Yanwei Wang, Johnson (Tsun-Hsuan) Wang, Jiayuan Mao et al.

Fast Imitation via Behavior Foundation Models

Matteo Pirotta, Andrea Tirinzoni, Ahmed Touati et al.

Subtask-Aware Visual Reward Learning from Segmented Demonstrations

Changyeon Kim, Minho Heo, Doohyun Lee et al.

Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations

Xin Liu, Haoran Li, Dongbin Zhao

Demonstration-Regularized RL

Daniil Tiapkin, Denis Belomestny, Daniele Calandriello et al.

Learning to Act from Actionless Videos through Dense Correspondences

Po-Chen Ko, Jiayuan Mao, Yilun Du et al.

Reverse Forward Curriculum Learning for Extreme Sample and Demo Efficiency

Stone Tao, Arth Shukla, Tse-kai Chan et al.

Student-Informed Teacher Training

Nico Messikommer, Jiaxu Xing, Elie Aljalbout et al.

Learning from Demonstrations via Capability-Aware Goal Sampling

Yuanlin Duan, Yuning Wang, Wenjie Qiu et al.

Behaviour Distillation

Andrei Lupu, Chris Lu, Jarek Liesen et al.

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Aleksandar Jevtić, Christoph Reich, Felix Wimbauer et al.

Rapid Selection and Ordering of In-Context Demonstrations via Prompt Embedding Clustering

Kha Pham, Hung Le, Man Ngo et al.

GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation Demonstration and Imitation

Zifan Wang, Junyu Chen, Ziqing Chen et al.

MIRE: Matched Implicit Neural Representations

Dhananjaya Jayasundara, Heng Zhao, Demetrio Labate et al.

Incremental Object Keypoint Learning

Mingfu Liang, Jiahuan Zhou, Xu Zou et al.

LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation

Ke Guo, Zhenwei Miao, Wei Jing et al.

Data Scaling Laws in Imitation Learning for Robotic Manipulation

Fanqi Lin, Yingdong Hu, Pingyue Sheng et al.

Learning Parameterized Skills from Demonstrations

Vedant Gupta, Haotian Fu, Calvin Luo et al.

DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

Zihan Ding, Chi Jin, Difan Liu et al.

From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning

Yexin Huang, Yongbin Lin, Lishengsa Yue et al.

Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Alejandro Castañeda Garcia, Jan Warchocki, Jan van Gemert et al.

CVPR 2025

—

not collected

Imitation Learning

Top Conferences

Related Topics (Reinforcement Learning)

Top Papers

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations

eTag: Class-Incremental Learning via Embedding Distillation and Task-Oriented Generation

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Instant Policy: In-Context Imitation Learning via Graph Diffusion

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

Domain Prompt Learning with Quaternion Networks

DiffAIL: Diffusion Adversarial Imitation Learning

Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection

Are Human-generated Demonstrations Necessary for In-context Learning?

ET-SEED: EFFICIENT TRAJECTORY-LEVEL SE(3) EQUIVARIANT DIFFUSION POLICY

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

SketchINR: A First Look into Sketches as Implicit Neural Representations

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

SkillMimic: Learning Basketball Interaction Skills from Demonstrations

Real Appearance Modeling for More General Deepfake Detection

AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities

Circumventing Shortcuts in Audio-visual Deepfake Detection Datasets with Unsupervised Learning

ProMotion: Prototypes As Motion Learners

BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning

Instruction-based Image Manipulation by Watching How Things Move

Mimic In-Context Learning for Multimodal Tasks

Efficient Active Imitation Learning with Random Network Distillation

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Noise-assisted Prompt Learning for Image Forgery Detection and Localization

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

CLIP-driven View-aware Prompt Learning for Unsupervised Vehicle Re-identification

Unsupervised Object Interaction Learning with Counterfactual Dynamics Models

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch

Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution

PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining

Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations

Reward-free World Models for Online Imitation Learning

DUEL: Duplicate Elimination on Active Memory for Self-Supervised Class-Imbalanced Learning

Transfer Learning of Real Image Features with Soft Contrastive Loss for Fake Image Detection

HaHeAE: Learning Generalisable Joint Representations of Human Hand and Head Movements in Extended Reality

Morphing Tokens Draw Strong Masked Image Models

A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations

Differentiable Rule Induction from Raw Sequence Inputs

Demonstration Selection for In-Context Learning via Reinforcement Learning

PseudoMapTrainer: Learning Online Mapping without HD Maps

Imitation Learning from a Single Temporally Misaligned Video

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Forecasting Bimanual Object Manipulation Sequences from Unimanual Observations

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

S²DN: Learning to Denoise Unconvincing Knowledge for Inductive Knowledge Graph Completion

Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning

RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks

TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly

Learning to Build by Building Your Own Instructions

MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction

Zero-Shot Offline Imitation Learning via Optimal Transport

Action-Constrained Imitation Learning

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Imitation Beyond Expectation Using Pluralistic Stochastic Dominance