Most Cited ICLR "spatiotemporal context" Papers
6,124 papers found • Page 16 of 31
Conference
Capability Localization: Capabilities Can be Localized rather than Individual Knowledge
Xiusheng Huang, Jiaxiang Liu, Yequan Wang et al.
PolyhedronNet: Representation Learning for Polyhedra with Surface-attributed Graph
Dazhou Yu, Genpei Zhang, Liang Zhao
CoMRes: Semi-Supervised Time Series Forecasting Utilizing Consensus Promotion of Multi-Resolution
Yunju Cho, Jay-Yoon Lee
Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap
Christopher Liao, Christian So, Theodoros Tsiligkaridis et al.
PRDP: Progressively Refined Differentiable Physics
Kanishk Bhatia, Felix Koehler, Nils Thuerey
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
Yuguang Yang, Tongfei Chen, Haoyu Huang et al.
Trajectory-Class-Aware Multi-Agent Reinforcement Learning
Hyungho Na, Kwanghyeon Lee, Sumin Lee et al.
The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws
Tian Jin, Ahmed Imtiaz Humayun, Utku Evci et al.
Optimizing importance weighting in the presence of sub-population shifts
Floris Holstege, Bram Wouters, Noud Giersbergen et al.
Training Robust Ensembles Requires Rethinking Lipschitz Continuity
Ali Ebrahimpour Boroojeny, Hari Sundaram, Varun Chandrasekaran
InCoDe: Interpretable Compressed Descriptions For Image Generation
Armand Comas, Aditya Chattopadhyay, Feliu Formosa et al.
Solving Differential Equations with Constrained Learning
Viggo Moro, Luiz Chamon
When narrower is better: the narrow width limit of Bayesian parallel branching neural networks
Zechen Zhang, Haim Sompolinsky
Scalable Universal T-Cell Receptor Embeddings from Adaptive Immune Repertoires
Paidamoyo Chapfuwa, Ilker Demirel, Lorenzo Pisani et al.
Distribution-Specific Agnostic Conditional Classification With Halfspaces
Jizhou Huang, Brendan Juba
Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Shuchen Wu, Mirko Thalmann, Peter Dayan et al.
SelKD: Selective Knowledge Distillation via Optimal Transport Perspective
Liangliang Shi, Zhengyan Shi, Junchi Yan
Causal Discovery via Bayesian Optimization
Bao Duong, Sunil Gupta, Thin Nguyen
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep, Nikhil Singh
Lines of Thought in Large Language Models
Raphaël Sarfati, Toni Liu, Nicolas Boulle et al.
Neuron Platonic Intrinsic Representation From Dynamics Using Contrastive Learning
Wei Wu, Can Liao, Zizhen Deng et al.
On Minimizing Adversarial Counterfactual Error in Adversarial Reinforcement Learning
Roman Belaire, Arunesh Sinha, Pradeep Varakantham
Do Mice Grok? Glimpses of Hidden Progress in Sensory Cortex
Tanishq Kumar, Blake Bordelon, Cengiz Pehlevan et al.
SAVA: Scalable Learning-Agnostic Data Valuation
Samuel Kessler, Tam Le, Vu Nguyen
Boost Self-Supervised Dataset Distillation via Parameterization, Predefined Augmentation, and Approximation
Sheng-Feng Yu, Jia-Jiun Yao, Wei-Chen Chiu
Neural Causal Graph for Interpretable and Intervenable Classification
Jiawei Wang, Shaofei Lu, Da Cao et al.
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation
Dingdong Yang, Yizhi Wang, Konrad Schindler et al.
Release the Powers of Prompt Tuning: Cross-Modality Prompt Transfer
Ningyuan Zhang, Jie Lu, Keqiuyin Li et al.
Adaptive Batch Size for Privately Finding Second-Order Stationary Points
Daogao Liu, Kunal Talwar
Benign Overfitting in Out-of-Distribution Generalization of Linear Models
Shange Tang, Jiayun Wu, Jianqing Fan et al.
Boosting Methods for Interval-censored Data with Regression and Classification
Yuan Bian, Grace Yi, Wenqing He
The Hidden Cost of Waiting for Accurate Predictions
Ali Shirali, Ariel Procaccia, Rediet Abebe
Shared-AE: Automatic Identification of Shared Subspaces in High-dimensional Neural and Behavioral Activity
Daiyao Yi, Hao Dong, Michael Higley et al.
Alchemy: Amplifying Theorem-Proving Capability Through Symbolic Mutation
Shaonan Wu, Shuai Lu, Yeyun Gong et al.
Near-optimal Active Regression of Single-Index Models
Yi Li, Wai Ming Tai
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Xiangxin Zhou, Yi Xiao, Haowei Lin et al.
Incremental Causal Effect for Time to Treatment Initialization
Andrew Ying, Zhichen Zhao, Ronghui Xu
SleepSMC: Ubiquitous Sleep Staging via Supervised Multimodal Coordination
Shuo Ma, Yingwei Zhang, Yiqiang Chen et al.
The "Law'' of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities
Yongwei Che, Benjamin Eysenbach
In vivo cell-type and brain region classification via multimodal contrastive learning
Han Yu, Hanrui Lyu, YiXun Xu et al.
Effective post-training embedding compression via temperature control in contrastive training
georgiana dinu, Corey Barrett, Yi Xiang et al.
Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment
Chenliang Li, Siliang Zeng, Zeyi Liao et al.
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series
Byoungwoo Park, Hyungi Lee, Juho Lee
SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
Leo Zhang, Kianoosh Ashouritaklimi, Yee Whye Teh et al.
Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models
Jung Hyun Lee, June Yong Yang, Byeongho Heo et al.
Improved Convergence Rate for Diffusion Probabilistic Models
Gen Li, Yuchen Jiao
The Illustrated AlphaFold
Elana Simon, Jake Silberg
Denoising Levy Probabilistic Models
Dario Shariatian, Umut Simsekli, Alain Oliviero Durmus
Generalizing Reasoning Problems to Longer Lengths
Changnan Xiao, Bing Liu
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models
Andy K Zhang, Neil Perry, Riya Dulepet et al.
Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments
Hongjin SU, Ruoxi Sun, Jinsung Yoon et al.
LLM Unlearning via Loss Adjustment with Only Forget Data
Yaxuan Wang, Jiaheng Wei, Yuhao Liu et al.
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
GUOJUN XIONG, Ujwal Dinesha, Debajoy Mukherjee et al.
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse et al.
EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation
Jiaxiang Tang, Max Li, Zekun Hao et al.
Action Sequence Augmentation for Action Anticipation
Yihui Qiu, Deepu Rajan
Topological Zigzag Spaghetti for Diffusion-based Generation and Prediction on Graphs
Yuzhou Chen, Yulia Gel
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration
Dezhan Tu, Danylo Vashchilenko, Yuzhe Lu et al.
Learning to Select Nodes in Branch and Bound with Sufficient Tree Representation
Sijia Zhang, Shuli Zeng, Shaoang Li et al.
Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
Souradip Chakraborty, Sujay Bhatt, Udari Sehwag et al.
Computational Explorations of Total Variation Distance
Arnab Bhattacharyya, Sutanu Gayen, Kuldeep S. Meel et al.
Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs
Siyan Zhao, Mingyi Hong, Yang Liu et al.
Convex Formulations for Training Two-Layer ReLU Neural Networks
Karthik Prakhya, Tolga Birdal, Alp Yurtsever
Long-Short Decision Transformer: Bridging Global and Local Dependencies for Generalized Decision-Making
Jincheng Wang, Penny Karanasou, Pengyuan Wei et al.
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
Thomas Cannon, Özgür Şimşek
Learning to Explore and Exploit with GNNs for Unsupervised Combinatorial Optimization
Utku Umur Acikalin, Aaron Ferber, Carla Gomes
HeadMap: Locating and Enhancing Knowledge Circuits in LLMs
Xuehao Wang, Liyuan Wang, Binghuai Lin et al.
MTSAM: Multi-Task Fine-Tuning for Segment Anything Model
Xuehao Wang, Zhan ZHUANG, Feiyang YE et al.
It Helps to Take a Second Opinion: Teaching Smaller LLMs To Deliberate Mutually via Selective Rationale Optimisation
Sohan Patnaik, Milan Aggarwal, Sumit Bhatia et al.
Do not write that jailbreak paper
Javier Rando
OPTAMI: Global Superlinear Convergence of High-order Methods
Dmitry Kamzolov, Artem Agafonov, Dmitry Pasechnyuk et al.
DELTA: DENSE EFFICIENT LONG-RANGE 3D TRACKING FOR ANY VIDEO
Tuan Ngo, Peiye Zhuang, Evangelos Kalogerakis et al.
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo, Quentin Delfosse, Devendra Singh Dhami et al.
MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction
Jing Yang, Minyue Jiang, Sen Yang et al.
Chain-of-Focus Prompting: Leveraging Sequential Visual Cues to Prompt Large Autoregressive Vision Models
Jiyang Zheng, Jialiang Shen, Yu Yao et al.
Towards Unbiased Learning in Semi-Supervised Semantic Segmentation
Rui Sun, Huayu Mai, Wangkai Li et al.
AIR-BENCH 2024: A Safety Benchmark based on Regulation and Policies Specified Risk Categories
Yi Zeng, Yu Yang, Andy Zhou et al.
YouTube-SL-25: A Large-Scale, Open-Domain Multilingual Sign Language Parallel Corpus
Garrett Tanzer, Biao Zhang
Learning Causal Alignment for Reliable Disease Diagnosis
Mingzhou Liu, Ching-Wen Lee, Xinwei Sun et al.
Curriculum-aware Training for Discriminating Molecular Property Prediction Models
Hansi Yang, Quanming Yao, James Kwok
“I Am the One and Only, Your Cyber BFF”: Understanding the Impact of GenAI Requires Understanding the Impact of Anthropomorphic AI
Myra Cheng, Alicia DeVrio, Lisa Egede et al.
Immunogenicity Prediction with Dual Attention Enables Vaccine Target Selection
Song Li, Yang Tan, Song Ke et al.
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Tengyang Xie, Dylan Foster, Akshay Krishnamurthy et al.
Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization
Jiajun Fan, Shuaike Shen, Chaoran Cheng et al.
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
Tianchen Zhao, Tongcheng Fang, Haofeng Huang et al.
TexTailor: Customized Text-aligned Texturing via Effective Resampling
Suin Lee, DAE SHIK KIM
On the Adversarial Risk of Test Time Adaptation: An Investigation into Realistic Test-Time Data Poisoning
Yongyi Su, Yushu Li, Nanqing Liu et al.
Credit-based self organizing maps: training deep topographic networks with minimal performance degradation
Amir Ozhan Dehghani, Xinyu Qian, Asa Farahani et al.
KAA: Kolmogorov-Arnold Attention for Enhancing Attentive Graph Neural Networks
Taoran Fang, Tianhong Gao, Chunping Wang et al.
Towards Non-Asymptotic Convergence for Diffusion-Based Generative Models
Gen Li, Yuting Wei, Yuxin Chen et al.
DECO: Unleashing the Potential of ConvNets for Query-based Detection and Segmentation
Xinghao Chen, Siwei Li, Yijing Yang et al.
A Theoretical Framework for Partially-Observed Reward States in RLHF
Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano et al.
Shallow diffusion networks provably learn hidden low-dimensional structure
Nicholas Boffi, Arthur Jacot, Stephen Tu et al.
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models
Sumeet Singh, Vikas Sindhwani, Stephen Tu
HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting
Nian Ran, Peng Xiao, Yue Wang et al.
Jamba: Hybrid Transformer-Mamba Language Models
Barak Lenz, Opher Lieber, Alan Arazi et al.
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
Hiroshi Takahashi, Tomoharu Iwata, Atsutoshi Kumagai et al.
MAI: A Multi-turn Aggregation-Iteration Model for Composed Image Retrieval
Yanzhe Chen, Zhiwen Yang, Jinglin Xu et al.
kNN Attention Demystified: A Theoretical Exploration for Scalable Transformers
Themistoklis Haris
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Huayu Chen, Hang Su, Peize Sun et al.
Rethinking and Improving Autoformalization: Towards a Faithful Metric and a Dependency Retrieval-based Approach
Qi Liu, Xinhao Zheng, Xudong Lu et al.
Learning Structured Universe Graph with Outlier OOD Detection for Partial Matching
Zetian Jiang, Jiaxin Lu, Haizhao Fan et al.
What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models
Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Nader Vasconcelos et al.
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich, Yumin Suh, Samuel Schulter et al.
Forget the Data and Fine-Tuning! Just Fold the Network to Compress
Dong Wang, Haris Šikić, Lothar Thiele et al.
Learning Geometric Reasoning Networks For Robot Task And Motion Planning
Smail Ait Bouhsain, Rachid Alami, Thierry Simeon
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
Zhiliang Chen, Xinyuan Niu, Chuan Sheng Foo et al.
RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks
Nazia Tasnim, Bryan Plummer
DocMIA: Document-Level Membership Inference Attacks against DocVQA Models
Khanh Nguyen, Raouf Kerkouche, Mario Fritz et al.
Semi-Parametric Retrieval via Binary Bag-of-Tokens Index
Jiawei Zhou, Li Dong, Furu Wei et al.
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
Tue Cao, Nhat Hoang-Xuan, Hieu Pham et al.
Generalized Consistency Trajectory Models for Image Manipulation
Beomsu Kim, Jaemin Kim, Jeongsol Kim et al.
An Information Criterion for Controlled Disentanglement of Multimodal Data
Chenyu Wang, Sharut Gupta, Xinyi Zhang et al.
Uncertainty-Aware Decoding with Minimum Bayes Risk
Nico Daheim, Clara Meister, Thomas Möllenhoff et al.
Learning to Search from Demonstration Sequences
Dixant Mittal, Liwei Kang, Wee Sun Lee
Fantastic Targets for Concept Erasure in Diffusion Models and Where To Find Them
Anh Bui, Thuy-Trang Vu, Long Vuong et al.
Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits
Yuta Natsubori, Masataka Ushiku, Yuta Saito
Tracking objects that change in appearance with phase synchrony
Sabine Muzellec, Drew Linsley, Alekh Ashok et al.
Descent with Misaligned Gradients and Applications to Hidden Convexity
Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar et al.
On the Price of Differential Privacy for Hierarchical Clustering
Chengyuan Deng, Jie Gao, Jalaj Upadhyay et al.
Diffusion State-Guided Projected Gradient for Inverse Problems
Rayhan Zirvi, Bahareh Tolooshams, anima anandkumar
Improving Pretraining Data Using Perplexity Correlations
Tristan Thrush, Christopher Potts, Tatsunori Hashimoto
Towards Unbiased Calibration using Meta-Regularization
Jacek Golebiowski, Cheng Wang
A Distributional Approach to Uncertainty-Aware Preference Alignment Using Offline Demonstrations
Sheng Xu, Bo Yue, Hongyuan Zha et al.
Estimating the Probabilities of Rare Outputs in Language Models
Gabriel Wu, Jacob Hilton
Methods with Local Steps and Random Reshuffling for Generally Smooth Non-Convex Federated Optimization
Yury Demidovich, Petr Ostroukhov, Grigory Malinovsky et al.
Atomas: Hierarchical Adaptive Alignment on Molecule-Text for Unified Molecule Understanding and Generation
Yikun Zhang, Geyan Ye, Chaohao Yuan et al.
Differential Transformer
Tianzhu Ye, Li Dong, Yuqing Xia et al.
Brain Bandit: A Biologically Grounded Neural Network for Efficient Control of Exploration
Chen Jiang, Jiahui An, Yating Liu et al.
Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning
Xinran Li, Xiaolu Wang, Chenjia Bai et al.
Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning
Oleh Kolner, Thomas Ortner, Stanisław Woźniak et al.
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Bahare Fatemi, Seyed Mehran Kazemi, Anton Tsitsulin et al.
Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization
Jingrong Wei, Long Chen
Training Free Guided Flow-Matching with Optimal Control
Luran Wang, Chaoran Cheng, Yizhen Liao et al.
S$2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
Safa Messaoud, Billel Mokeddem, Zhenghai Xue et al.
On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning
Bokun Wang, Yunwen Lei, Yiming Ying et al.
Pre-training of Foundation Adapters for LLM Fine-tuning
Linh The Nguyen, Dat Quoc Nguyen
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
Zhiyuan Liu, Yanchen Luo, Han Huang et al.
NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals
Wei-Bang Jiang, Yansen Wang, Bao-liang Lu et al.
Robust Transfer of Safety-Constrained Reinforcement Learning Agents
Markel Zubia, Thiago Simão, Nils Jansen
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen, Huaqing Zhang, Hongzhou Lin et al.
A deep inverse-mapping model for a flapping robotic wing
Hadar Sharvit, Raz Karl, Tsevi Beatus
VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs
Ling Yang, Ye Tian, Minkai Xu et al.
Generalizable Human Gaussians from Single-View Image
Jinnan Chen, Chen Li, Jianfeng Zhang et al.
Stochastic variance-reduced Gaussian variational inference on the Bures-Wasserstein manifold
Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams et al.
Efficient Interpolation between Extragradient and Proximal Methods for Weak MVIs
Thomas Pethick, Ioannis Mavrothalassitis, Volkan Cevher
Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory
Svetha Venkatesh, Kien Do, Hung Le et al.
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
Simon Schrodi, David T. Hoffmann, Max Argus et al.
A Linear Algebraic Framework for Counterfactual Generation
Jong-Hoon Ahn, Akshay Vashist
Rapid Selection and Ordering of In-Context Demonstrations via Prompt Embedding Clustering
Kha Pham, Hung Le, Man Ngo et al.
MA$^2$E: Addressing Partial Observability in Multi-Agent Reinforcement Learning with Masked Auto-Encoder
Sehyeok Kang, Yongsik Lee, Gahee Kim et al.
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
Runyi Hu, Jie Zhang, Yiming Li et al.
Influence Functions for Scalable Data Attribution in Diffusion Models
Bruno Mlodozeniec, Runa Eschenhagen, Juhan Bae et al.
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Buu Phan, Brandon Amos, Itai Gat et al.
Does Training with Synthetic Data Truly Protect Privacy?
Yunpeng Zhao, Jie Zhang
Improved Training Technique for Latent Consistency Models
Minh Quan Dao, Khanh Doan, Di Liu et al.
Learning to Reject Meets Long-tail Learning
Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum et al.
GETS: Ensemble Temperature Scaling for Calibration in Graph Neural Networks
Dingyi Zhuang, Chonghe Jiang, Yunhan Zheng et al.
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
Jaden Fiotto-Kaufman, Alexander Loftus, Eric Todd et al.
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
Renqiu Xia, mingsheng li, Hancheng Ye et al.
Safety Layers in Aligned Large Language Models: The Key to LLM Security
Shen Li, Liuyi Yao, Lan Zhang et al.
Learning Harmonized Representations for Speculative Sampling
Lefan Zhang, Xiaodan Wang, Yanhua Huang et al.
Relation-Aware Diffusion for Heterogeneous Graphs with Partially Observed Features
Daeho Um, Yoonji Lee, Jiwoong Park et al.
Bayesian Optimization via Continual Variational Last Layer Training
Paul Brunzema, Mikkel Jordahn, John Willes et al.
KinFormer: Generalizable Dynamical Symbolic Regression for Catalytic Organic Reaction Kinetics
Jindou Chen, Jidong Tian, Liang Wu et al.
SimpleTM: A Simple Baseline for Multivariate Time Series Forecasting
Hui Chen, Viet Luong, Lopamudra Mukherjee et al.
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
Mingkun Zhang, Keping Bi, Wei Chen et al.
REMEDY: Recipe Merging Dynamics in Large Vision-Language Models
Didi Zhu, Yibing Song, tao shen et al.
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Guanyu Zhou, Yibo Yan, Xin Zou et al.
Differentiable and Learnable Wireless Simulation with Geometric Transformers
Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy et al.
Noise Separation guided Candidate Label Reconstruction for Noisy Partial Label Learning
Xiaorui Peng, Yuheng Jia, Fuchao Yang et al.
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
Chenxi Wang, Xiang Chen, Ningyu Zhang et al.
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning
Anh Tong, Thanh Nguyen-Tang, Dongeun Lee et al.
MMD-Regularized Unbalanced Optimal Transport
SakethaNath Jagarlapudi, Pratik Jawanpuria, Piyushi Manupriya
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models
Lisa Dunlap, Krishna Mandal, trevor darrell et al.
Projection Head is Secretly an Information Bottleneck
Zhuo Ouyang, Kaiwen Hu, Qi Zhang et al.
Bias Mitigation in Graph Diffusion Models
Meng Yu, Kun Zhan
Causal Graphical Models for Vision-Language Compositional Understanding
Fiorenzo Parascandolo, Nicholas Moratelli, Enver Sangineto et al.
An operator preconditioning perspective on training in physics-informed machine learning
Tim De Ryck, Florent Bonnet, Siddhartha Mishra et al.
Scalable Extraction of Training Data from Aligned, Production Language Models
Milad Nasr, Javier Rando, Nicholas Carlini et al.
SOAP: Improving and Stabilizing Shampoo using Adam for Language Modeling
Nikhil Vyas, Depen Morwani, Rosie Zhao et al.
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
Hongkang Li, Yihua Zhang, shuai ZHANG et al.
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
Jiayi Liu, Denys Iliash, Angel Chang et al.
O(d/T) Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions
Gen Li, Yuling Yan
OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld et al.
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
Rosie Zhao, Depen Morwani, David Brandfonbrener et al.
The Ramanujan Library - Automated Discovery on the Hypergraph of Integer Relations
Itay Beit Halachmi, Ido Kaminer
Bayesian Image Regression with Soft-thresholded Conditional Autoregressive Prior
Yuliang Xu, Jian Kang
Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training
Maximillian Chen, Ruoxi Sun, Tomas Pfister et al.
Understanding Methods for Scalable MCTS
Will Knipe
Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding
Eric Lei, Hamed Hassani, Shirin Saeedi Bidokhti
REFINE: Inversion-Free Backdoor Defense via Model Reprogramming
Yukun Chen, Shuo Shao, Enhao Huang et al.
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Yukang Chen, Fuzhao Xue, Dacheng Li et al.
GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation
Yushi LAN, Shangchen Zhou, Zhaoyang Lyu et al.
Hindsight PRIORs for Reward Learning from Human Preferences
Mudit Verma, Katherine Metcalf
Sparse components distinguish visual pathways & their alignment to neural networks
Ammar I Marvi, Nancy Kanwisher, Meenakshi Khosla
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Yang Zhou, Hao Shao, Letian Wang et al.
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
Lu Li, Tianyu Zhang, Zhiqi Bu et al.
Nonlinear Sequence Embedding by Monotone Variational Inequality
Jonathan Y. Zhou, Yao Xie
Quantum (Inspired) $D^2$-sampling with Applications
Poojan Shah, Ragesh Jaiswal