Most Cited ICML "data curriculum" Papers
5,975 papers found • Page 7 of 30
Conference
Rethinking Benign Overfitting in Two-Layer Neural Networks
Ruichen Xu, Kexin Chen
Core Knowledge Deficits in Multi-Modal Language Models
Yijiang Li, Qingying Gao, Tianwei Zhao et al.
A Sub-Problem Quantum Alternating Operator Ansatz for Correlation Clustering
Lucas Fabian Naumann, Jannik Irmai, Bjoern Andres
Sidechain conditioning and modeling for full-atom protein sequence design with FAMPNN
Talal Widatalla, Richard Shuai, Brian Hie et al.
CollabLLM: From Passive Responders to Active Collaborators
Shirley Wu, Michel Galley, Baolin Peng et al.
Efficient Core-set Selection for Deep Learning Through Squared Loss Minimization
Jianting Chen
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
peijie liu, Fengli Xu, Yong Li
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention for Long-Context LLM Serving
Yeonju Ro, Zhenyu Zhang, Souvik Kundu et al.
Transfer Q-Learning with Composite MDP Structures
Jinhang Chai, Elynn Chen, Lin Yang
TabSDS: a Lightweight, Fully Non-Parametric, and Model Free Approach for Generating Synthetic Tabular Data
Elias Chaibub Neto
MultiPDENet: PDE-embedded Learning with Multi-time-stepping for Accelerated Flow Simulation
Qi Wang, Yuan Mi, Wang Haoyun et al.
Learning Input Encodings for Kernel-Optimal Implicit Neural Representations
Zhemin Li, Liyuan Ma, Hongxia Wang et al.
Taming Diffusion for Dataset Distillation with High Representativeness
Lin Zhao, Yushu Wu, Xinru Jiang et al.
Learning Utilities from Demonstrations in Markov Decision Processes
Filippo Lazzati, Alberto Maria Metelli
Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed Improvement of Utility-Privacy Trade-off
Yuecheng Li, Lele Fu, Tong Wang et al.
DiffusionVLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression
Junjie Wen, Yichen Zhu, Minjie Zhu et al.
A Online Statistical Framework for Out-of-Distribution Detection
Xinsong Ma, Xin Zou, Weiwei Liu
TLLC: Transfer Learning-based Label Completion for Crowdsourcing
Wenjun Zhang, Liangxiao Jiang, Chaoqun Li
Batch List-Decodable Linear Regression via Higher Moments
Ilias Diakonikolas, Daniel Kane, Sushrut Karmalkar et al.
Outsourced Diffusion Sampling: Efficient Posterior Inference in Latent Spaces of Generative Models
Siddarth Venkatraman, Mohsin Hasan, Minsu Kim et al.
FEAT-KD: Learning Concise Representations for Single and Multi-Target Regression via TabNet Knowledge Distillation
Kei Sen Fong, Mehul Motani
Safety Reasoning with Guidelines
Haoyu Wang, Zeyu Qin, Li Shen et al.
DyPolySeg: Taylor Series-Inspired Dynamic Polynomial Fitting Network for Few-shot Point Cloud Semantic Segmentation
Changshuo Wang, Xiang Fang, Prayag Tiwari
LEVIS: Large Exact Verifiable Input Spaces for Neural Networks
Mohamad Chehade, Wenting Li, Brian Bell et al.
Online Learning in the Random-Order Model
Martino Bernasconi, Andrea Celli, Riccardo Colini Baldeschi et al.
Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups
Weiqiu You, Helen Qu, Marco Gatti et al.
One-dimensional Path Convolution
Xuanshu Luo, Martin Werner
Efficient Optimization with Orthogonality Constraint: a Randomized Riemannian Submanifold Method
Andi Han, Pierre-Louis Poirion, Akiko Takeda
Private Model Personalization Revisited
Conor Snedeker, Xinyu Zhou, Raef Bassily
Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres
Muskan Dosi, Chiranjeev Chiranjeev, Kartik Thakral et al.
MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition
ning wang, Zekun Li, Tongxin Bai et al.
Stochastic Encodings for Active Feature Acquisition
Alexander Norcliffe, Changhee Lee, Fergus Imrie et al.
Elucidating the Design Space of Multimodal Protein Language Models
Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang et al.
Maximum Total Correlation Reinforcement Learning
Bang You, Puze Liu, Huaping Liu et al.
HetSSNet: Spatial-Spectral Heterogeneous Graph Learning Network for Panchromatic and Multispectral Images Fusion
Mengting Ma, Yizhen Jiang, Mengjiao Zhao et al.
Skip the Equations: Learning Behavior of Personalized Dynamical Systems Directly From Data
Krzysztof Kacprzyk, Julianna Piskorz, Mihaela van der Schaar
Event-Customized Image Generation
Zhen Wang, Yilei JIANG, Dong Zheng et al.
CSV-Occ: Fusing Multi-frame Alignment for Occupancy Prediction with Temporal Cross State Space Model and Central Voting Mechanism
Ziming Zhu, Yu Zhu, Jiahao Chen et al.
Phase and Amplitude-aware Prompting for Enhancing Adversarial Robustness
Yibo Xu, Dawei Zhou, Decheng Liu et al.
Unified Screening for Multiple Diseases
Yiğit Narter, Alihan Hüyük, Mihaela van der Schaar et al.
ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation
Tianci Bu, Le Zhou, Wenchuan Yang et al.
Self-supervised Masked Graph Autoencoder via Structure-aware Curriculum
Haoyang Li, Xin Wang, Zeyang Zhang et al.
Generalization and Robustness of the Tilted Empirical Risk
Gholamali Aminian, Amir R. Asadi, Tian Li et al.
Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints
Qixin Zhang, Wei Huang, Can Jin et al.
Feature Shift Localization Network
Míriam Barrabés, Daniel Mas Montserrat, Kapal Dev et al.
Dynamic Similarity Graph Construction with Kernel Density Estimation
Steinar Laenen, Peter Macgregor, He Sun
Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks
Yixin Cheng, Hongcheng Guo, Yangming Li et al.
Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data
Corinna Cortes, Anqi Mao, Mehryar Mohri et al.
Geometry Informed Tokenization of Molecules for Language Model Generation
Xiner Li, Limei Wang, Youzhi Luo et al.
Online Learning in Risk Sensitive constrained MDP
Arnob Ghosh, Mehrdad Moharrami
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
Yike Yuan, Ziyu Wang, Zihao Huang et al.
EGPlace: An Efficient Macro Placement Method via Evolutionary Search with Greedy Repositioning Guided Mutation
ji deng, Zhao Li, Ji Zhang et al.
When to retrain a machine learning model
Florence Regol, Leo Schwinn, Kyle Sprague et al.
Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Rui Min, Tianyu Pang, Chao Du et al.
Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training
Max Milkert, David Hyde, Forrest Laine
Non-Asymptotic and Non-Lipschitzian Bounds on Optimal Values in Stochastic Optimization Under Heavy Tails
Jindong Tong, Hongcheng Liu, Johannes Royset
BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training
Chenyi yang, Wenjie Nie, Yuxin Zhang et al.
Hierarchical Overlapping Clustering on Graphs: Cost Function, Algorithm and Scalability
Yicheng Pan, Renjie Chen, Pengyu Long et al.
Adversarial Robust Generalization of Graph Neural Networks
Chang Cao, Han Li, Yulong Wang et al.
CoPINN: Cognitive Physics-Informed Neural Networks
Siyuan Duan, Wenyuan Wu, Peng Hu et al.
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
Wendong Bu, Yang Wu, Qifan Yu et al.
Compositional Flows for 3D Molecule and Synthesis Pathway Co-design
Tony Shen, Seonghwan Seo, Ross Irwin et al.
SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression
Mohammad Mozaffari, Amir Yazdanbakhsh, Maryam Mehri Dehnavi
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu, Yifei Shen, Dongsheng Li et al.
MARS: Unleashing the Power of Variance Reduction for Training Large Models
Huizhuo Yuan, Yifeng Liu, Shuang Wu et al.
FedBEns: One-Shot Federated Learning based on Bayesian Ensemble
Jacopo Talpini, Marco Savi, Giovanni Neglia
Do We Need to Verify Step by Step? Rethinking Process Supervision from a Theoretical Perspective
Zeyu Jia, Alexander Rakhlin, Tengyang Xie
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
Jianliang He, Xintian Pan, Siyu Chen et al.
Fast Tensor Completion via Approximate Richardson Iteration
Mehrdad Ghadiri, Matthew Fahrbach, Yunbum Kook et al.
Safe-EF: Error Feedback for Non-smooth Constrained Optimization
Rustem Islamov, Yarden As, Ilyas Fatkhullin
Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence
İlker Işık, Ramazan Gokberk Cinbis, Ebru Gol
KoopSTD: Reliable Similarity Analysis between Dynamical Systems via Approximating Koopman Spectrum with Timescale Decoupling
Shimin Zhang, Ziyuan Ye, Yinsong Yan et al.
Iterative Vectors: In-Context Gradient Steering without Backpropagation
Yiting Liu, Zhi-Hong Deng
Mutual Learning for SAM Adaptation: A Dual Collaborative Network Framework for Source-Free Domain Transfer
Yabo Liu, Waikeung Wong, Chengliang Liu et al.
Balancing Model Efficiency and Performance: Adaptive Pruner for Long-tailed Data
Zhe Zhao, HaiBin Wen, Pengkun Wang et al.
Learning to Reuse Policies in State Evolvable Environments
Ziqian Zhang, Bohan Yang, Lihe Li et al.
Pruning for GNNs: Lower Complexity with Comparable Expressiveness
Dun Ma, Jianguo Chen, Wenguo Yang et al.
Learning Mean Field Control on Sparse Graphs
Christian Fabian, Kai Cui, Heinz Koeppl
Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization
Simone Bombari, Marco Mondelli
SCENT: Robust Spatiotemporal Learning for Continuous Scientific Data via Scalable Conditioned Neural Fields
David K Park, Xihaier Luo, Guang Zhao et al.
E-LDA: Toward Interpretable LDA Topic Models with Strong Guarantees in Logarithmic Parallel Time
Adam Breuer
Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation
CHUANQI CHENG, Jian Guan, Wei Wu et al.
ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning
Artavazd Maranjyan, El Mehdi Saad, Peter Richtarik et al.
Adaptive Flow Matching for Resolving Small-Scale Physics
Stathi Fotiadis, Noah Brenowitz, Tomas Geffner et al.
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More
Xialie Zhuang, Zhikai Jia, Jianjin Li et al.
DLP: Dynamic Layerwise Pruning in Large Language Models
Yuli Chen, Bo Cheng, Jiale Han et al.
Complete-Tree Space Favors Data-Efficient Link Prediction
Chi Gao, Lukai Li, Yancheng Zhou et al.
Weisfeiler and Leman Go Gambling: Why Expressive Lottery Tickets Win
Lorenz Kummer, Samir Moustafa, Anatol Ehrlich et al.
Analytical Construction on Geometric Architectures: Transitioning from Static to Temporal Link Prediction
Yadong Sun, Xiaofeng Cao, Ivor Tsang et al.
EcoMapper: Generative Modeling for Climate-Aware Satellite Imagery
Muhammed Göktepe, Amir Hossein Shamseddin, Erencan Uysal et al.
Semi-Supervised Blind Quality Assessment with Confidence-quantifiable Pseudo-label Learning for Authentic Images
Yan Zhong, Chenxi Yang, Suyuan Zhao et al.
GraphGPT: Generative Pre-trained Graph Eulerian Transformer
Qifang Zhao, Weidong Ren, Tianyu Li et al.
Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift
Nguyen Nhat Minh To, Paul Wilson, Viet Nguyen et al.
Neural Event-Triggered Control with Optimal Scheduling
Luan Yang, Jingdong Zhang, Qunxi Zhu et al.
Near-optimal Sketchy Natural Gradients for Physics-Informed Neural Networks
Maricela Best Mckay, Avleen Kaur, Chen Greif et al.
Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models
Kejia Chen, Jiawen Zhang, Jiacong Hu et al.
Nesterov Method for Asynchronous Pipeline Parallel Optimization
Thalaiyasingam Ajanthan, Sameera Ramasinghe, Yan Zuo et al.
MaskTwins: Dual-form Complementary Masking for Domain-Adaptive Image Segmentation
Jiawen Wang, Yinda Chen, Xiaoyu Liu et al.
Improved Discretization Complexity Analysis of Consistency Models: Variance Exploding Forward Process and Decay Discretization Scheme
Ruofeng Yang, Bo Jiang, Cheng Chen et al.
Symmetry-Driven Discovery of Dynamical Variables in Molecular Simulations
Jeet Mohapatra, Nima Dehmamy, Csaba Both et al.
EgoPrivacy: What Your First-Person Camera Says About You?
Yijiang Li, Genpei Zhang, Jiacheng Cheng et al.
On the Role of Label Noise in the Feature Learning Process
Andi Han, Wei Huang, Zhanpeng Zhou et al.
Mixture of Experts Provably Detect and Learn the Latent Cluster Structure in Gradient-Based Learning
Ryotaro Kawata, Kohsei Matsutani, Yuri Kinoshita et al.
MARGE: Improving Math Reasoning with Guided Exploration
Jingyue Gao, Runji Lin, Keming Lu et al.
Physics Aware Neural Networks for Unsupervised Binding Energy Prediction
Ke Liu, Hao Chen, Chunhua Shen
RLTHF: Targeted Human Feedback for LLM Alignment
Yifei Xu, Tusher Chakraborty, Emre Kiciman et al.
FOUNDER: Grounding Foundation Models in World Models for Open-Ended Embodied Decision Making
Yucen Wang, Rui Yu, Shenghua Wan et al.
Adversarial Combinatorial Semi-bandits with Graph Feedback
Yuxiao Wen
Breaking the $n^{1.5}$ Additive Error Barrier for Private and Efficient Graph Sparsification via Private Expander Decomposition
Anders Aamand, Justin Chen, Mina Dalirrooyfard et al.
Heterogeneous Label Shift: Theory and Algorithm
Chao Xu, Xijia Tang, Chenping Hou
Improved Approximations for Hard Graph Problems using Predictions
Anders Aamand, Justin Chen, Siddharth Gollapudi et al.
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition
Sungnyun Kim, Kangwook Jang, Sangmin Bae et al.
Ranked from Within: Ranking Large Multimodal Models Without Labels
Weijie Tu, Weijian Deng, Dylan Campbell et al.
Knowledge Retention in Continual Model-Based Reinforcement Learning
Haotian Fu, Yixiang Sun, Michael L. Littman et al.
Reinforcement Learning Control of a Physical Robot Device for Assisted Human Walking without a Simulator
junmin zhong, Emiliano Quinones Yumbla, Seyed Yousef Soltanian et al.
Adaptive Sample Sharing for Multi Agent Linear Bandits
Hamza Cherkaoui, Merwan Barlier, Igor Colin
Synthetic Text Generation for Training Large Language Models via Gradient Matching
Dang Nguyen, Zeman Li, MohammadHossein Bateni et al.
Principled Algorithms for Optimizing Generalized Metrics in Binary Classification
Anqi Mao, Mehryar Mohri, Yutao Zhong
The Limits of Predicting Agents from Behaviour
Alexis Bellot, Jonathan Richens, Tom Everitt
Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism
Aviv Bick, Eric Xing, Albert Gu
Risk-Sensitive Theory of Mind: Coordinating with Agents of Unknown Bias using Cumulative Prospect Theory
Mason O. Smith, Wenlong Zhang
Rényi Neural Processes
Xuesong Wang, He Zhao, Edwin V. Bonilla
SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive Decoding
Woohyeon Park, Woojin Kim, Jaeik Kim et al.
Robust Sparsification via Sensitivity
Chansophea Wathanak In, Yi Li, David Woodruff et al.
Federated Incomplete Multi-view Clustering with Globally Fused Graph Guidance
Guoqing Chao, Zhenghao Zhang, Lei Meng et al.
The Sparse-Plus-Low-Rank Quasi-Newton Method for Entropic-Regularized Optimal Transport
Chenrui Wang, Yixuan Qiu
Compositional Condition Question Answering in Tabular Understanding
Jun-Peng Jiang, Tao Zhou, De-Chuan Zhan et al.
The Complexity of Learning Sparse Superposed Features with Feedback
Akash Kumar
Textual Unlearning Gives a False Sense of Unlearning
Jiacheng Du, Zhibo Wang, Jie Zhang et al.
Bifurcate then Alienate: Incomplete Multi-view Clustering via Coupled Distribution Learning with Linear Overhead
Shengju Yu, Yiu-ming Cheung, Siwei Wang et al.
Offline Learning for Combinatorial Multi-armed Bandits
Xutong Liu, Xiangxiang Dai, Jinhang Zuo et al.
HyperIV: Real-time Implied Volatility Smoothing
Yongxin Yang, Wenqi Chen, Chao Shu et al.
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation
Haoquan Fang, Markus Grotz, Wilbert Pumacay et al.
FLAM: Frame-Wise Language-Audio Modeling
Yusong Wu, Christos Tsirigotis, Ke Chen et al.
Geometric Resampling in Nearly Linear Time for Follow-the-Perturbed-Leader with Best-of-Both-Worlds Guarantee in Bandit Problems
Botao Chen, Jongyeong Lee, Junya Honda
Provably Cost-Sensitive Adversarial Defense via Randomized Smoothing
Yuan Xin, Dingfan Chen, Michael Backes et al.
Splitting & Integrating: Out-of-Distribution Detection via Adversarial Gradient Attribution
Jiayu Zhang, Xinyi Wang, Zhibo Jin et al.
IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models
Hanting Wang, Tao Jin, Wang Lin et al.
A Computationally Efficient Algorithm for Infinite-Horizon Average-Reward Linear MDPs
Kihyuk Hong, Ambuj Tewari
SeedLoRA: A Fusion Approach to Efficient LLM Fine-Tuning
Yong Liu, Di Fu, Shenggan Cheng et al.
Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers
Alireza Amiribavandpour, Xinting Huang, Mark Rofin et al.
Pfeife: Automatic Pipeline Parallelism for PyTorch
Ho Young Jhoo, Chung-Kil Hur, Nuno P. Lopes
The Price of Linear Time: Error Analysis of Structured Kernel Interpolation
Alexander Moreno, Justin Xiao, Jonathan Mei
Discrete Markov Probabilistic Models: An Improved Discrete Score-Based Framework with sharp convergence bounds under minimal assumptions
Le Tuyet Nhi PHAM, Dario Shariatian, Antonio Ocello et al.
Compositional Risk Minimization
Divyat Mahajan, Mohammad Pezeshki, Charles Arnal et al.
Runtime Analysis of Evolutionary NAS for Multiclass Classification
Zeqiong Lv, Chao Qian, Yun Liu et al.
ROME is Forged in Adversity: Robust Distilled Datasets via Information Bottleneck
Zheng Zhou, Wenquan Feng, Qiaosheng Zhang et al.
Enhancing the Influence of Labels on Unlabeled Nodes in Graph Convolutional Networks
Jincheng Huang, Yujie Mo, Xiaoshuang Shi et al.
Be Confident: Uncovering Overfitting in MLLM Multi-Task Tuning
Wenke Huang, Jian Liang, Guancheng Wan et al.
Position: Supervised Classifiers Answer the Wrong Questions for OOD Detection
Yucen Li, Daohan Lu, Polina Kirichenko et al.
Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability
Michael Crawshaw, Blake Woodworth, Mingrui Liu
Probabilistic Group Mask Guided Discrete Optimization for Incremental Learning
Fengqiang Wan, Yang Yang
Multi-View Graph Clustering via Node-Guided Contrastive Encoding
Yazhou Ren, Junlong Ke, Zichen Wen et al.
Multiobjective distribution matching
Xiaoyuan Zhang, Peijie Li, Ying Ying YU et al.
Fixed-Confidence Multiple Change Point Identification under Bandit Feedback
Joseph Lazzaro, Ciara Pike-Burke
ReFrame: Layer Caching for Accelerated Inference in Real-Time Rendering
Lufei Liu, Tor Aamodt
Calibrating Video Watch-time Predictions with Credible Prototype Alignment
Chao, Shisong Tang, Fan Li et al.
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
Qin Guo, Ailing Zeng, Dongxu Yue et al.
Unveiling AI's Blind Spots: An Oracle for In-Domain, Out-of-Domain, and Adversarial Errors
Shuangpeng Han, Mengmi Zhang
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
Motoki Omura, Kazuki Ota, Takayuki Osa et al.
STAIR: Improving Safety Alignment with Introspective Reasoning
Yichi Zhang, Siyuan Zhang, Yao Huang et al.
Training Dynamics of In-Context Learning in Linear Attention
Yedi Zhang, Aaditya Singh, Peter Latham et al.
Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective
Firas Laakom, Haobo Chen, Jürgen Schmidhuber et al.
Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation
Juncheol Shin, Minsang Seok, Seonggon Kim et al.
Stochastic Smoothed Primal-Dual Algorithms for Nonconvex Optimization with Linear Inequality Constraints
Ruichuan Huang, Jiawei Zhang, Ahmet Alacaoglu
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features
xiaokun Feng, Dailing Zhang, Shiyu Hu et al.
Flow-field inference from neural data using deep recurrent networks
Timothy Doyeon Kim, Thomas Luo, Tankut Can et al.
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Yucheng Hu, Yanjiang Guo, Pengchao Wang et al.
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
Hang Guo, Yawei Li, Tao Dai et al.
EvFocus: Learning to Reconstruct Sharp Images from Out-of-Focus Event Streams
Lin Zhu, Xiantao Ma, Xiao Wang et al.
XAttention: Block Sparse Attention with Antidiagonal Scoring
Ruyi Xu, Guangxuan Xiao, Haofeng Huang et al.
Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning
Ngoc Bui, Menglin Yang, Runjin Chen et al.
Learning Fused State Representations for Control from Multi-View Observations
Zeyu Wang, Yao-Hui Li, Xin Li et al.
Subgoal-Guided Policy Heuristic Search with Learned Subgoals
Jake Tuero, Michael Buro, Levi Lelis
Curse of High Dimensionality Issue in Transformer for Long Context Modeling
Shuhai Zhang, Zeng You, Yaofo Chen et al.
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
Ju-Seung Byun, Andrew Perrault
Large Language Models to Diffusion Finetuning
Edoardo Cetin, Tianyu Zhao, Yujin Tang
Accelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach
Yang Xu, Vaneet Aggarwal
Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
Sungwon Kim, Namkyeong Lee, Yunyoung Doh et al.
Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models
Daiki Chijiwa, Taku Hasegawa, Kyosuke Nishida et al.
Pointwise Information Measures as Confidence Estimators in Deep Neural Networks: A Comparative Study
Shelvia Wongso, Rohan Ghosh, Mehul Motani
Bayesian Neural Scaling Law Extrapolation with Prior-Data Fitted Networks
Dongwoo Lee, Dong Bok Lee, Steven Adriaensen et al.
Be a Goldfish: Forgetting Bad Conditioning in Sparse Linear Regression via Variational Autoencoders
Kuheli Pratihar, Debdeep Mukhopadhyay
Large Continual Instruction Assistant
Jingyang Qiao, zhizhong zhang, Xin Tan et al.
K$^2$IE: Kernel Method-based Kernel Intensity Estimators for Inhomogeneous Poisson Processes
Hideaki Kim, Tomoharu Iwata, Akinori Fujino
A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations
Junwei Su, Chuan Wu
Sparse Spectral Training and Inference on Euclidean and Hyperbolic Neural Networks
Jialin Zhao, Yingtao Zhang, Xinghang Li et al.
When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class
Yujin Kim, Hyunsoo Kim, Hyunwoo Kim et al.
$\mathcal{V}ista\mathcal{DPO}$: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Haojian Huang, Haodong Chen, Shengqiong Wu et al.
CurvGAD: Leveraging Curvature for Enhanced Graph Anomaly Detection
Karish Grover, Geoff Gordon, Christos Faloutsos
Retrieval-Augmented Perception: High-resolution Image Perception Meets Visual RAG
Wenbin Wang, Yongcheng Jing, Liang Ding et al.
Unsupervised Learning for Class Distribution Mismatch
Pan Du, Zhao, Xinai Lu et al.
Beyond Atoms: Enhancing Molecular Pretrained Representations with 3D Space Modeling
Shuqi Lu, Xiaohong Ji, Bohang Zhang et al.
Computing Optimal Transport Maps and Wasserstein Barycenters Using Conditional Normalizing Flows
Gabriele Visentin, Patrick Cheridito
InfoCons: Identifying Interpretable Critical Concepts in Point Clouds via Information Theory
Feifei Li, Mi Zhang, Zhaoxiang Wang et al.
CERTAIN: Context Uncertainty-aware One-Shot Adaptation for Context-based Offline Meta Reinforcement Learning
Hongtu Zhou, Ruiling Yang, Yakun Zhu et al.
A Model of Place Field Reorganization During Reward Maximization
M Ganesh Kumar, Blake Bordelon, Jacob A Zavatone-Veth et al.
Leveraging Sparsity for Sample-Efficient Preference Learning: A Theoretical Perspective
Yunzhen Yao, Lie He, Michael Gastpar
Sample-Optimal Agnostic Boosting with Unlabeled Data
Udaya Ghai, Karan Singh
Boosting Adversarial Robustness with CLAT: Criticality Leveraged Adversarial Training
Bhavna Gopal, Huanrui Yang, Jingyang Zhang et al.