ICLR Oral Papers

444 papers found • Page 2 of 9

CViT: Continuous Vision Transformer for Operator Learning

Sifan Wang, Jacob Seidman, Shyam Sankaran et al.

ICLR 2025oralarXiv:2405.13998
26
citations

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Yu Ying Chiu, Liwei Jiang, Yejin Choi

ICLR 2025oral

DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL

Mathias Jackermeier, Alessandro Abate

ICLR 2025oral

Deep Random Features for Scalable Interpolation of Spatiotemporal Data

Weibin Chen, Azhir Mahmood, Michel Tsamados et al.

ICLR 2025oral

DeepTAGE: Deep Temporal-Aligned Gradient Enhancement for Optimizing Spiking Neural Networks

Wei Liu, Li Yang, Mingxuan Zhao et al.

ICLR 2025oral

Dense Video Object Captioning from Disjoint Supervision

Xingyi Zhou, Anurag Arnab, Chen Sun et al.

ICLR 2025oralarXiv:2306.11729
7
citations

Depth Any Video with Scalable Synthetic Data

Honghui Yang, Di Huang, Wei Yin et al.

ICLR 2025oralarXiv:2410.10815
44
citations

Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models

Zeyu Yang, Zijie Pan, Chun Gu et al.

ICLR 2025oralarXiv:2404.02148
18
citations

Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data

Hengyu Fu, Zehao Dou, Jiawei Guo et al.

ICLR 2025oralarXiv:2407.16134
3
citations

Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents

Hao Bai, Yifei Zhou, Li Li et al.

ICLR 2025oral

Discovering Temporally Compositional Neural Manifolds with Switching Infinite GPFA

Changmin Yu, Maneesh Sahani, Máté Lengyel

ICLR 2025oral

Disentangling 3D Animal Pose Dynamics with Scrubbed Conditional Latent Variables

Joshua Wu, Hari Koneru, James Ravenel et al.

ICLR 2025oral

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning

Hanlin Yang, Jian Yao, Weiming Liu et al.

ICLR 2025oral
2
citations

Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?

BOSHEN XU, Ziheng Wang, Yang Du et al.

ICLR 2025oral

DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving

Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.

ICLR 2025oral

DyCAST: Learning Dynamic Causal Structure from Time Series

Yue Cheng, Bochen Lyu, Weiwei Xing et al.

ICLR 2025oral
1
citations

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Xingzhuo Guo, Yu Zhang, Baixu Chen et al.

ICLR 2025oral
6
citations

Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment

Jinwoo Choi, Seung-Woo Seo

ICLR 2025oralarXiv:2504.14805
1
citations

Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Dynamic Scenes

Isabella Liu, Hao Su, Xiaolong Wang

ICLR 2025oral
15
citations

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Ruichen Shao, Bei Li, Gangao Liu et al.

ICLR 2025oral

EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation

Carl Qi, Dan Haramati, Tal Daniel et al.

ICLR 2025oral
3
citations

Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction

Anthony GX-Chen, Kenneth Marino, Rob Fergus

ICLR 2025oral

Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark

Bing Cao, Quanhao Lu, Jiekang Feng et al.

ICLR 2025oral
2
citations

Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching

Lei Yuan, Yuqi Bian, Lihe Li et al.

ICLR 2025oral
5
citations

EG4D: Explicit Generation of 4D Object without Score Distillation

Qi Sun, Zhiyang Guo, Ziyu Wan et al.

ICLR 2025oralarXiv:2405.18132
39
citations

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos

Jilan Xu, Yifei Huang, Baoqi Pei et al.

ICLR 2025oral

EgoSim: Egocentric Exploration in Virtual Worlds with Multi-modal Conditioning

Wei Yu, Songheng Yin, Steve Easterbrook et al.

ICLR 2025oral

Eliciting Human Preferences with Language Models

Belinda Li, Alex Tamkin, Noah Goodman et al.

ICLR 2025oral
76
citations

Episodic Memories Generation and Evaluation Benchmark for Large Language Models

Alexis Huet, Zied Houidi, Dario Rossi

ICLR 2025oralarXiv:2501.13121
8
citations

Episodic Novelty Through Temporal Distance

Yuhua Jiang, Qihan Liu, Yiqin Yang et al.

ICLR 2025oral
5
citations

Error-quantified Conformal Inference for Time Series

Junxi Wu, Dongjian Hu, Yajie Bao et al.

ICLR 2025oralarXiv:2502.00818
8
citations

Expand and Compress: Exploring Tuning Principles for Continual Spatio-Temporal Graph Forecasting

Wei Chen, Yuxuan Liang

ICLR 2025oral
6
citations

Exposure Bracketing Is All You Need For A High-Quality Image

Zhilu Zhang, Shuohao Zhang, Renlong Wu et al.

ICLR 2025oral

FACTS: A Factored State-Space Framework for World Modelling

Li Nanbo, Firas Laakom, Yucheng XU et al.

ICLR 2025oral
1
citations

Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage

Ying-yee Ava Lau, Zhiwen Shao, Dit-Yan Yeung

ICLR 2025oral
8
citations

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Zhengyao Lyu, Chenyang Si, Junhao Song et al.

ICLR 2025oral

Flow Matching with Gaussian Process Priors for Probabilistic Time Series Forecasting

Marcel Kollovieh, Marten Lienen, David Lüdke et al.

ICLR 2025oral

Gaussian Ensemble Belief Propagation for Efficient Inference in High-Dimensional, Black-box Systems

Dan MacKinlay, Russell Tsuchida, Daniel Pagendam et al.

ICLR 2025oral

Generalized Video Moment Retrieval

Qin You, Qilong Wu, Yicong Li et al.

ICLR 2025oral
1
citations

GenXD: Generating Any 3D and 4D Scenes

Yuyang Zhao, Chung-Ching Lin, Kevin Lin et al.

ICLR 2025oral

Glad: A Streaming Scene Generator for Autonomous Driving

Bin Xie, Yingfei Liu, Tiancai Wang et al.

ICLR 2025oral
11
citations

GLOMA: Global Video Text Spotting with Morphological Association

Han Wang, Yanjie Wang, Yang Li et al.

ICLR 2025oral
2
citations

Going Beyond Static: Understanding Shifts with Time-Series Attribution

Jiashuo Liu, Nabeel Seedat, Peng Cui et al.

ICLR 2025oral
1
citations

GPS: A Probabilistic Distributional Similarity with Gumbel Priors for Set-to-Set Matching

Ziming Zhang, Fangzhou Lin, Haotian Liu et al.

ICLR 2025oral

Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective

Yushun Dong, Patrick Soga, Yinhan He et al.

ICLR 2025oral
2
citations

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding

Dongping Chen, Yue Huang, Siyuan Wu et al.

ICLR 2025oral
23
citations

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

Jiahao Cui, Hui Li, Yao Yao et al.

ICLR 2025oral

Handling Delay in Real-Time Reinforcement Learning

Ivan Anokhin, Rishav Rishav, Matt Riemer et al.

ICLR 2025oral

High-Dynamic Radar Sequence Prediction for Weather Nowcasting Using Spatiotemporal Coherent Gaussian Representation

Ziye Wang, Yiran Qin, Lin Zeng et al.

ICLR 2025oral
1
citations

High-Quality Joint Image and Video Tokenization with Causal VAE

Dawit Mureja Argaw, Xian Liu, Qinsheng Zhang et al.

ICLR 2025oral
1
citations