Most Cited ICLR "spatio-temporal reasoning" Papers
6,124 papers found • Page 31 of 31
Conference
Probabilistic Self-supervised Representation Learning via Scoring Rules Minimization
Amirhossein Vahidi, Simon Schosser, Lisa Wimmer et al.
$\mathbb{D}^2$ Pruning: Message Passing for Balancing Diversity & Difficulty in Data Pruning
Adyasha Maharana, Prateek Yadav, Mohit Bansal
Scaling physics-informed hard constraints with mixture-of-experts
Nithin Chalapathi, Yiheng Du, Aditi Krishnapriyan
On Stationary Point Convergence of PPO-Clip
Ruinan Jin, Shuai Li, Baoxiang Wang
How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention to Kronecker Computation
Josh Alman, Zhao Song
General Graph Random Features
Isaac Reid, Krzysztof Choromanski, Eli Berger et al.
Are Models Biased on Text without Gender-related Language?
Catarina Belém, Preethi Seshadri, Yasaman Razeghi et al.
Privacy-Preserving In-Context Learning for Large Language Models
Tong Wu, Ashwinee Panda, Jiachen (Tianhao) Wang et al.
A Discretization Framework for Robust Contextual Stochastic Optimization
Rares Cristian, Georgia Perakis
Chain of Log-Concave Markov Chains
Saeed Saremi, Ji Won Park, Francis Bach
Perceptual Scales Predicted by Fisher Information Metrics
Jonathan Vacher, Pascal Mamassian
Protein Discovery with Discrete Walk-Jump Sampling
Nathan Frey, Dan Berenberg, Karina Zadorozhny et al.
A Simple and Scalable Representation for Graph Generation
Yunhui Jang, Seul Lee, Sungsoo Ahn
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Tri Dao
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
Michal Geyer, Omer Bar Tal, Shai Bagon et al.
Turning large language models into cognitive models
Marcel Binz, Eric Schulz
Neural Snowflakes: Universal Latent Graph Inference via Trainable Latent Geometries
Haitz Sáez de Ocáriz Borde, Anastasis Kratsios
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Ahmed Hendawy, Jan Peters, Carlo D'Eramo
Unveiling Options with Neural Network Decomposition
Mahdi Alikhasi, Levi Lelis
Towards the Fundamental Limits of Knowledge Transfer over Finite Domains
Qingyue Zhao, Banghua Zhu
Active Test-Time Adaptation: Theoretical Analyses and An Algorithm
Shurui Gui, Xiner Li, Shuiwang Ji
Quantifying and Enhancing Multi-modal Robustness with Modality Preference
Zequn Yang, Yake Wei, Ce Liang et al.
RingAttention with Blockwise Transformers for Near-Infinite Context
Hao Liu, Matei Zaharia, Pieter Abbeel
Improved Techniques for Training Consistency Models
Yang Song, Prafulla Dhariwal
Modeling Boundedly Rational Agents with Latent Inference Budgets
Athul Jacob, Abhishek Gupta, Jacob Andreas
HYPO: Hyperspherical Out-Of-Distribution Generalization
Haoyue Bai, Yifei Ming, Julian Katz-Samuels et al.
On the Foundations of Shortcut Learning
Katherine Hermann, Hossein Mobahi, Thomas FEL et al.
Emergent Communication with Conversational Repair
Mitja Nikolaus
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Yue Wu, Xuan Tang, Tom Mitchell et al.
A General Framework for User-Guided Bayesian Optimization
Carl Hvarfner, Frank Hutter, Luigi Nardi
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin, Yadong MU
HIFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance
Junzhe Zhu, Peiye Zhuang, Sanmi Koyejo
Understanding Domain Generalization: A Noise Robustness Perspective
Rui Qiao, Bryan Kian Hsiang Low
Can Transformers Capture Spatial Relations between Objects?
Chuan Wen, Dinesh Jayaraman, Yang Gao
The LLM Surgeon
Tycho van der Ouderaa, Markus Nagel, Mart van Baalen et al.
Equivariant Scalar Fields for Molecular Docking with Fast Fourier Transforms
Bowen Jing, Tommi Jaakkola, Bonnie Berger
Diffusion-TS: Interpretable Diffusion for General Time Series Generation
Xinyu Yuan, Yan Qiao
Why is SAM Robust to Label Noise?
Christina Baek, J Kolter, Aditi Raghunathan
An Efficient Tester-Learner for Halfspaces
Aravind Gollakota, Adam Klivans, Konstantinos Stavropoulos et al.
Batch normalization is sufficient for universal function approximation in CNNs
Rebekka Burkholz
Predictive, scalable and interpretable knowledge tracing on structured domains
Hanqi Zhou, Robert Bamler, Charley Wu et al.
Imitation Learning from Observation with Automatic Discount Scheduling
Yuyang Liu, Weijun Dong, Yingdong Hu et al.
Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition
Feng Lu, Lijun Zhang, Xiangyuan Lan et al.
ImageNet-OOD: Deciphering Modern Out-of-Distribution Detection Algorithms
William Yang, Byron Zhang, Olga Russakovsky
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth Benchmarking
Mert Kosan, Samidha Verma, Burouj Armgaan et al.
A Benchmark Study on Calibration
Linwei Tao, Younan Zhu, Haolan Guo et al.
Guaranteed Approximation Bounds for Mixed-Precision Neural Operators
Renbo Tu, Colin White, Jean Kossaifi et al.
Lifting Architectural Constraints of Injective Flows
Peter Sorrenson, Felix Draxler, Armand Rousselot et al.
Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost
Yuan Gao, WEIZHONG ZHANG, Wenhan Luo et al.
Language Model Self-improvement by Reinforcement Learning Contemplation
Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li et al.
Fast Updating Truncated SVD for Representation Learning with Sparse Matrices
Haoran Deng, Yang Yang, Jiahe Li et al.
SOInter: A Novel Deep Energy-Based Interpretation Method for Explaining Structured Output Models
S. Fatemeh Seyyedsalehi, Mahdieh Baghshah, Hamid Rabiee
Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN
Biswadeep Chakraborty, Beomseok Kang, Harshit Kumar et al.
Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning
Chengxing Jia, Chen-Xiao Gao, Hao Yin et al.
ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis
DongHao Luo, Xue Wang
Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction
Yilan Zhang, Yingxue XU, Jianqi Chen et al.
On the Role of General Function Approximation in Offline Reinforcement Learning
Chenjie Mao, Qiaosheng Zhang, Zhen Wang et al.
Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings
Ilyass Hammouamri, Ismail Khalfaoui Hassani, Timothée Masquelier
The Generative AI Paradox: “What It Can Create, It May Not Understand”
Peter West, Ximing Lu, Nouha Dziri et al.
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin, Abhilasha Ravichander, Ximing Lu et al.
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement
Linlu Qiu, Liwei Jiang, Ximing Lu et al.
Evaluating Large Language Models at Evaluating Instruction Following
Zhiyuan Zeng, Jiatong Yu, Tianyu Gao et al.
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia, Tianyu Gao, Zhiyuan Zeng et al.
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Pratyusha Sharma, Jordan Ash, Dipendra Kumar Misra
Learning Grounded Action Abstractions from Language
Lio Wong, Jiayuan Mao, Pratyusha Sharma et al.
Scaling Laws for Sparsely-Connected Foundation Models
Elias Frantar, Carlos Riquelme Ruiz, Neil Houlsby et al.
From Sparse to Soft Mixtures of Experts
Joan Puigcerver, Carlos Riquelme Ruiz, Basil Mustafa et al.
iGraphMix: Input Graph Mixup Method for Node Classification
Jongwon Jeong, Hoyeop Lee, Hyui Geon Yoon et al.
Retrieval-Enhanced Contrastive Vision-Text Models
Ahmet Iscen, Mathilde Caron, Alireza Fathi et al.
Raidar: geneRative AI Detection viA Rewriting
Chengzhi Mao, Carl Vondrick, Hao Wang et al.
Function Vectors in Large Language Models
Eric Todd, Millicent Li, Arnab Sen Sharma et al.
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Yang Jin, Kun Xu, Kun Xu et al.
A Policy Gradient Method for Confounded POMDPs
Mao Hong, Zhengling Qi, Yanxun Xu
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
Yinya Huang, Xiaohan Lin, Zhengying Liu et al.
LEGO-Prover: Neural Theorem Proving with Growing Libraries
Haiming Wang, Huajian Xin, Chuanyang Zheng et al.
THOUGHT PROPAGATION: AN ANALOGICAL APPROACH TO COMPLEX REASONING WITH LARGE LANGUAGE MODELS
Junchi Yu, Ran He, Rex Ying
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan, Wenxiang Jiao, Wenxuan Wang et al.
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
Jen-tse Huang, Wenxuan Wang, Eric John Li et al.
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Seungcheol Park, Hojun Choi, U Kang
INViTE: INterpret and Control Vision-Language Models with Text Explanations
Haozhe Chen, Junfeng Yang, Carl Vondrick et al.
Effective pruning of web-scale datasets based on complexity of concept clusters
Amro Kamal, Evgenia Rusak, Kushal Tirumala et al.
Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape
Rundi Wu, Ruoshi Liu, Carl Vondrick et al.
Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment
Utkarsh Kumar Mall, Cheng Perng Phoo, Meilin Liu et al.
DiffEnc: Variational Diffusion with a Learned Encoder
Beatrix M. G. Nielsen, Anders Christensen, Andrea Dittadi et al.
GIM: Learning Generalizable Image Matcher From Internet Videos
Xuelun Shen, zhipeng cai, Wei Yin et al.
DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks
Kaijie Zhu, Jiaao Chen, Jindong Wang et al.
The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric
Daniel Severo, Lucas Theis, Johannes Ballé
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods
Xiaotian Han, Jianfeng Chi, Yu Chen et al.
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
Yuhta Takida, Masaaki Imaizumi, Takashi Shibuya et al.
Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances
Mikhail Khodak, Edmond Chow, Nina Balcan et al.
Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment
Bowen Gao, Yinjun JIA, Yuanle Mo et al.
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Yiyang Ma, Huan Yang, Wenhan Yang et al.
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi, Yuyao Liu, Yanjie Ze et al.
Rayleigh Quotient Graph Neural Networks for Graph-level Anomaly Detection
Xiangyu Dong, Xingyi Zhang, Sibo WANG
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning
Jing Xiong, Zixuan Li, Chuanyang Zheng et al.
MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework
Sirui Hong, Mingchen Zhuge, Jonathan Chen et al.
In-context Autoencoder for Context Compression in a Large Language Model
Tao Ge, Hu Jing, Lei Wang et al.
GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models
Haitao Yang, Xiangru Huang, Bo Sun et al.
Hard-Constrained Deep Learning for Climate Downscaling
Paula Harder, Alex Hernandez-Garcia, Venkatesh Ramesh et al.
Information Bottleneck Analysis of Deep Neural Networks via Lossy Compression
Ivan Butakov, Aleksandr Tolmachev, Sofia Malanchuk et al.
ZipIt! Merging Models from Different Tasks without Training
George Stoica, Daniel Bolya, Jakob Bjorner et al.
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs
Yuxin Zhang, Lirui Zhao, Mingbao Lin et al.
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Hanlei Zhang, Xin Wang, Hua Xu et al.
RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment
Kevin Yang, Dan Klein, Asli Celikyilmaz et al.
Localizing and Editing Knowledge In Text-to-Image Generative Models
Samyadeep Basu, Nanxuan Zhao, Vlad Morariu et al.
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Arnab Mondal, Siba Smarak Panigrahi, Sai Rajeswar et al.
Generative Learning for Financial Time Series with Irregular and Scale-Invariant Patterns
Hongbin Huang, Minghua Chen, Xiao Qiao
Linear attention is (maybe) all you need (to understand Transformer optimization)
Kwangjun Ahn, Xiang Cheng, Minhak Song et al.
Scalable Diffusion for Materials Generation
Sherry Yang, Kwanghwan Cho, Amil Merchant et al.
MG-TSD: Multi-Granularity Time Series Diffusion Models with Guided Learning Process
Xinyao Fan, Yueying Wu, Chang XU et al.
RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches
Jiayuan Gu, Sean Kirmani, Paul Wohlhart et al.
Simplicial Representation Learning with Neural $k$-Forms
Kelly Maggs, Celia Hacker, Bastian Rieck
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments
Qinhong Zhou, Sunli Chen, Yisong Wang et al.
Efficient Multi-agent Reinforcement Learning by Planning
Qihan Liu, Jianing Ye, Xiaoteng Ma et al.
DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari et al.
On the Stability of Iterative Retraining of Generative Models on their own Data
Quentin Bertrand, Joey Bose, Alexandre Duplessis et al.
A Study of Bayesian Neural Network Surrogates for Bayesian Optimization
Yucen Li, Tim G. J. Rudner, Andrew Gordon Wilson
Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Nate Gruver, Anuroop Sriram, Andrea Madotto et al.
Prediction Error-based Classification for Class-Incremental Learning
Michał Zając, Tinne Tuytelaars, Gido M van de Ven
Deep Geodesic Canonical Correlation Analysis for Covariance-Based Neuroimaging Data
Ce Ju, Reinmar Kobler, Liyao Tang et al.
Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness
Bohang Zhang, Jingchu Gai, Yiheng Du et al.
Adapting to Distribution Shift by Visual Domain Prompt Generation
Zhixiang Chi, Li Gu, Tao Zhong et al.
ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering
Ilya Shenbin, Sergey Nikolenko
Universal Image Restoration Pre-training via Degradation Classification
Jiakui Hu, Lujia Jin, Zhengjian Yao et al.