🧬Reinforcement Learning

Imitation Learning

Learning from demonstrations

100 papers666 total citations
Compare with other topics
Feb '24 β€” Jan '26266 papers
Also includes: imitation learning, learning from demonstrations, behavioral cloning, inverse rl

Top Papers

#1

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Yiheng Xu, Dunjie Lu, Zhennan Shen et al.

ICLR 2025
50
citations
#2

Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer

Yu Deng, Duomin Wang, Baoyuan Wang

ECCV 2024
45
citations
#3

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

Xiaogang Jia, Denis Blessing, Xinkai Jiang et al.

ICLR 2024
42
citations
#4

LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations

Anian Ruoss, Fabio Pardo, Harris Chan et al.

ICML 2025
27
citations
#5

eTag: Class-Incremental Learning via Embedding Distillation and Task-Oriented Generation

Libo Huang, Yan Zeng, Chuanguang Yang et al.

AAAI 2024
26
citations
#6

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Zhefei Gong, Pengxiang Ding, Shangke Lyu et al.

ICCV 2025
23
citations
#7

Instant Policy: In-Context Imitation Learning via Graph Diffusion

Vitalis Vosylius, Edward Johns

ICLR 2025
23
citations
#8

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

Ben Eisner, Yi Yang, Todor Davchev et al.

ICLR 2024
22
citations
#9

Domain Prompt Learning with Quaternion Networks

Qinglong Cao, Zhengqin Xu, Yuntian Chen et al.

CVPR 2024
22
citations
#10

DiffAIL: Diffusion Adversarial Imitation Learning

Bingzheng Wang, Guoqiang Wu, Teng Pang et al.

AAAI 2024arXiv:2312.06348
imitation learningadversarial imitation learningdiffusion modelsreward function learning+4
20
citations
#11

Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection

Shengjia Chen, Luping Ji, Weiwei Duan et al.

AAAI 2025
16
citations
#12

Are Human-generated Demonstrations Necessary for In-context Learning?

Rui Li, Guoyin Wang, Jiwei Li

ICLR 2024
15
citations
#13

ET-SEED: EFFICIENT TRAJECTORY-LEVEL SE(3) EQUIVARIANT DIFFUSION POLICY

Chenrui Tie, Yue Chen, Ruihai Wu et al.

ICLR 2025
15
citations
#14

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

CVPR 2024
14
citations
#15

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

CVPR 2024
13
citations
#16

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Mingfei Han, Liang Ma, Kamila Zhumakhanova et al.

CVPR 2025
12
citations
#17

SkillMimic: Learning Basketball Interaction Skills from Demonstrations

Yinhuai Wang, Qihan Zhao, Runyi Yu et al.

CVPR 2025
12
citations
#18

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024
12
citations
#19

AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning

Yuanfei Wang, Xiaojie Zhang, Ruihai Wu et al.

ICLR 2025arXiv:2502.11124
articulated object manipulationadaptive manipulation policy3d visual diffusionimitation learning+4
12
citations
#20

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper

Xinyue Zhu, Binghao Huang, Yunzhu Li

NeurIPS 2025
11
citations
#21

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Baoqi Pei, Yifei Huang, Jilan Xu et al.

ICLR 2025
11
citations
#22

Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities

Michele Mazzamuto, Antonino Furnari, Yoichi Sato et al.

CVPR 2025
10
citations
#23

Circumventing Shortcuts in Audio-visual Deepfake Detection Datasets with Unsupervised Learning

Stefan Smeu, Dragos-Alexandru Boldisor, Dan Oneata et al.

CVPR 2025
9
citations
#24

ProMotion: Prototypes As Motion Learners

Yawen Lu, Dongfang Liu, Qifan Wang et al.

CVPR 2024
9
citations
#25

BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation

Yulu Pan, Ce Zhang, Gedas Bertasius

CVPR 2025
9
citations
#26

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

Ke Xu, Tsun Wai Siu, Rynson W.H. Lau

AAAI 2024
9
citations
#27

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Xiangyu Wang, Donglin Yang, Yue Liao et al.

NeurIPS 2025
8
citations
#28

Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning

Xialei Liu, Jiang-Tian Zhai, Andrew Bagdanov et al.

CVPR 2024
8
citations
#29

Instruction-based Image Manipulation by Watching How Things Move

Mingdeng Cao, Xuaner Zhang, Yinqiang Zheng et al.

CVPR 2025
8
citations
#30

Mimic In-Context Learning for Multimodal Tasks

Yuchu Jiang, Jiale Fu, chenduo hao et al.

CVPR 2025
8
citations
#31

Efficient Active Imitation Learning with Random Network Distillation

Emilien BirΓ©, Anthony Kobanda, Ludovic Denoyer et al.

ICLR 2025
7
citations
#32

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Hongrui Jia, Chaoya Jiang, Haiyang Xu et al.

CVPR 2025
7
citations
#33

Noise-assisted Prompt Learning for Image Forgery Detection and Localization

Dong Li, Jiaying Zhu, Xueyang Fu et al.

ECCV 2024
6
citations
#34

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.

AAAI 2025
6
citations
#35

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

Qiang Wang, Yuhang He, Songlin Dong et al.

AAAI 2025
6
citations
#36

A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

Arnav Kumar Jain, Vibhakar Mohta, Subin Kim et al.

NeurIPS 2025arXiv:2506.05294
behavioral cloningimitation learningworld modelreward model+3
6
citations
#37

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Yuejiang Liu, Jubayer Hamid, Annie Xie et al.

ICLR 2025arXiv:2408.17355
action chunkingbidirectional decodingrobot learninghuman demonstrations+3
6
citations
#38

Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

Mark Beliaev, Ramtin Pedarsani

AAAI 2025
6
citations
#39

CLIP-driven View-aware Prompt Learning for Unsupervised Vehicle Re-identification

Jiyang Xu, Qi Wang, Xin Xiong et al.

AAAI 2025
5
citations
#40

Unsupervised Object Interaction Learning with Counterfactual Dynamics Models

Jongwook Choi, Sungtae Lee, Xinyu Wang et al.

AAAI 2024
5
citations
#41

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Ning Gao, Yilun Chen, Shuai Yang et al.

CVPR 2025
5
citations
#42

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch

Shengyu Feng, Yiming Yang

AAAI 2025
5
citations
#43

Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution

Zhanyi Sun, Shuran Song

NeurIPS 2025
5
citations
#44

PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning

Yan Zhang, Yao Feng, AlpΓ‘r Cseke et al.

ICCV 2025
5
citations
#45

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Qi Wang, Zhipeng Zhang, Baao Xie et al.

ICCV 2025
4
citations
#46

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Jiangning Wei, Lixiong Qin, Bo Yu et al.

AAAI 2025
4
citations
#47

Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining

Guanglu Dong, Tianheng Zheng, Yuanzhouhan Cao et al.

CVPR 2025
4
citations
#48

Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner

Aizierjiang Aiersilan

AAAI 2025
4
citations
#49

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

Jun Li, Jinpeng Wang, Chaolei Tan et al.

ICCV 2025
4
citations
#50

Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations

Thomas Tian, Kratarth Goel

ICLR 2025arXiv:2503.20105
preference alignmentmotion generationmulti-agent simulationimplicit feedback+3
4
citations
#51

Reward-free World Models for Online Imitation Learning

Shangzhe Li, Zhiao Huang, Hao Su

ICML 2025
3
citations
#52

DUEL: Duplicate Elimination on Active Memory for Self-Supervised Class-Imbalanced Learning

Won-Seok Choi, Hyundo Lee, Dong-Sig Han et al.

AAAI 2024arXiv:2402.08963
self-supervised learningclass-imbalanced learningactive memoryduplicate elimination+3
3
citations
#53

Transfer Learning of Real Image Features with Soft Contrastive Loss for Fake Image Detection

Ziyou Liang, Weifeng Liu, Run Wang et al.

AAAI 2025
3
citations
#54

HaHeAE: Learning Generalisable Joint Representations of Human Hand and Head Movements in Extended Reality

Zhiming Hu, Guanhua Zhang, Zheming Yin et al.

ISMAR 2025
3
citations
#55

Morphing Tokens Draw Strong Masked Image Models

Taekyung Kim, Byeongho Heo, Dongyoon Han

ICLR 2025
3
citations
#56

A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats Generation with 2D Supervision

Chensheng Peng, Ido Sobol, Masayoshi Tomizuka et al.

ICCV 2025
3
citations
#57

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

Heyang Zhao, Xingrui Yu, David Bossens et al.

ICLR 2025
2
citations
#58

AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations

Pei Zhou, Ruizhe Liu, Qian Luo et al.

ICLR 2025
2
citations
#59

Differentiable Rule Induction from Raw Sequence Inputs

Kun Gao, Katsumi Inoue, Yongzhi Cao et al.

ICLR 2025
2
citations
#60

Demonstration Selection for In-Context Learning via Reinforcement Learning

Xubin Wang, Jianfei Wu, Yuan Yichen et al.

ICML 2025
2
citations
#61

PseudoMapTrainer: Learning Online Mapping without HD Maps

Christian LΓΆwens, Thorben Funke, Jingchao Xie et al.

ICCV 2025
2
citations
#62

Imitation Learning from a Single Temporally Misaligned Video

William Huey, Yuki (Huaxiaoyue) Wang, Anne Wu et al.

ICML 2025
2
citations
#63

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Kunlun Xu, Fan Zhuo, Jiangmeng Li et al.

ICCV 2025
2
citations
#64

Forecasting Bimanual Object Manipulation Sequences from Unimanual Observations

Haziq Razali, Yiannis Demiris

AAAI 2024
1
citations
#65

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

Rishabh Agrawal, Nathan Dahlin, Rahul Jain et al.

AAAI 2025
1
citations
#66

SΒ²DN: Learning to Denoise Unconvincing Knowledge for Inductive Knowledge Graph Completion

Tengfei Ma, Yujie Chen, Liang Wang et al.

AAAI 2025
1
citations
#67

Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning

Man Liu, Huihui Bai, Feng Li et al.

AAAI 2025
1
citations
#68

RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks

Mingxuan Yan, Yuping Wang, Zechun Liu et al.

NeurIPS 2025
1
citations
#69

TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly

Mengqi GUO, Chen Li, Yuyang Zhao et al.

ECCV 2024
1
citations
#70

Learning to Build by Building Your Own Instructions

Aaron Walsman, Muru Zhang, Adam Fishman et al.

ECCV 2024
1
citations
#71

MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.

ICCV 2025arXiv:2506.08694
self-supervised learningmotion trajectory clusteringoptimal transportdense representation learning+4
1
citations
#72

Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction

Hyeongseok Jeon, Sanmin Kim, Abi Rahman Syamil et al.

ECCV 2024
1
citations
#73

Zero-Shot Offline Imitation Learning via Optimal Transport

Thomas Rupf, Marco Bagatella, Nico GΓΌrtler et al.

ICML 2025
β€”
not collected
#74

Action-Constrained Imitation Learning

Chia-Han Yeh, Tse-Sheng Nan, Risto Vuorio et al.

ICML 2025
β€”
not collected
#75

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Chenliang Li, Siliang Zeng, Zeyi Liao et al.

ICLR 2025
β€”
not collected
#76

Imitation Beyond Expectation Using Pluralistic Stochastic Dominance

Ali Farajzadeh, Danyal Saeed, Syed M Abbas et al.

NeurIPS 2025
β€”
not collected
#77

Imitation Learning from Observation with Automatic Discount Scheduling

Yuyang Liu, Weijun Dong, Yingdong Hu et al.

ICLR 2024
β€”
not collected
#78

Memory-Consistent Neural Networks for Imitation Learning

Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman et al.

ICLR 2024
β€”
not collected
#79

PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations

Qiang Liu, Huiqiao Fu, Kaiqiang Tang et al.

ICLR 2025
β€”
not collected
#80

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Yanwei Wang, Johnson (Tsun-Hsuan) Wang, Jiayuan Mao et al.

ICLR 2024
β€”
not collected
#81

Fast Imitation via Behavior Foundation Models

Matteo Pirotta, Andrea Tirinzoni, Ahmed Touati et al.

ICLR 2024
β€”
not collected
#82

Subtask-Aware Visual Reward Learning from Segmented Demonstrations

Changyeon Kim, Minho Heo, Doohyun Lee et al.

ICLR 2025
β€”
not collected
#83

Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations

Xin Liu, Haoran Li, Dongbin Zhao

NeurIPS 2025
β€”
not collected
#84

Demonstration-Regularized RL

Daniil Tiapkin, Denis Belomestny, Daniele Calandriello et al.

ICLR 2024
β€”
not collected
#85

Learning to Act from Actionless Videos through Dense Correspondences

Po-Chen Ko, Jiayuan Mao, Yilun Du et al.

ICLR 2024
β€”
not collected
#86

Reverse Forward Curriculum Learning for Extreme Sample and Demo Efficiency

Stone Tao, Arth Shukla, Tse-kai Chan et al.

ICLR 2024
β€”
not collected
#87

Student-Informed Teacher Training

Nico Messikommer, Jiaxu Xing, Elie Aljalbout et al.

ICLR 2025
β€”
not collected
#88

Learning from Demonstrations via Capability-Aware Goal Sampling

Yuanlin Duan, Yuning Wang, Wenjie Qiu et al.

NeurIPS 2025
β€”
not collected
#89

Behaviour Distillation

Andrei Lupu, Chris Lu, Jarek Liesen et al.

ICLR 2024
β€”
not collected
#90

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Aleksandar Jevtić, Christoph Reich, Felix Wimbauer et al.

ICCV 2025
β€”
not collected
#91

Rapid Selection and Ordering of In-Context Demonstrations via Prompt Embedding Clustering

Kha Pham, Hung Le, Man Ngo et al.

ICLR 2025
β€”
not collected
#92

GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation Demonstration and Imitation

Zifan Wang, Junyu Chen, Ziqing Chen et al.

CVPR 2024
β€”
not collected
#93

MIRE: Matched Implicit Neural Representations

Dhananjaya Jayasundara, Heng Zhao, Demetrio Labate et al.

CVPR 2025
β€”
not collected
#94

Incremental Object Keypoint Learning

Mingfu Liang, Jiahuan Zhou, Xu Zou et al.

CVPR 2025
β€”
not collected
#95

LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation

Ke Guo, Zhenwei Miao, Wei Jing et al.

CVPR 2024
β€”
not collected
#96

Data Scaling Laws in Imitation Learning for Robotic Manipulation

Fanqi Lin, Yingdong Hu, Pingyue Sheng et al.

ICLR 2025
β€”
not collected
#97

Learning Parameterized Skills from Demonstrations

Vedant Gupta, Haotian Fu, Calvin Luo et al.

NeurIPS 2025
β€”
not collected
#98

DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

Zihan Ding, Chi Jin, Difan Liu et al.

ICCV 2025
β€”
not collected
#99

From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning

Yexin Huang, Yongbin Lin, Lishengsa Yue et al.

ICCV 2025
β€”
not collected
#100

Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems

Alejandro CastaΓ±eda Garcia, Jan Warchocki, Jan van Gemert et al.

CVPR 2025
β€”
not collected