Most Cited 2025 "exchangeability assumption" Papers
22,274 papers found • Page 107 of 112
Conference
Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport
Lvmin Zhang, Anyi Rao, Maneesh Agrawala
Identifying latent state transitions in non-linear dynamical systems
Çağlar Hızlı, Çağatay Yıldız, Matthias Bethge et al.
Selective Task Group Updates for Multi-Task Optimization
Wooseong Jeong, Kuk-Jin Yoon
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation
Xingchen Wan, Han Zhou, Ruoxi Sun et al.
Training Verification-Friendly Neural Networks via Neuron Behavior Consistency
Zongxin Liu, Zhe Zhao, Fu Song et al.
PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis
Satoki Ishikawa, Makoto Yamada, Han Bao et al.
Self-Supervised Diffusion Models for Electron-Aware Molecular Representation Learning
Gyoung S. Na, Chanyoung Park
Explore Theory of Mind: program-guided adversarial data generation for theory of mind reasoning
Melanie Sclar, Jane Dwivedi-Yu, Maryam Fazel-Zarandi et al.
Linear Recurrences Accessible to Everyone
Felix Sarnthein
Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations
Julius Aka, Johannes Brunnemann, Jörg Eiden et al.
More Experts Than Galaxies: Conditionally-Overlapping Experts with Biologically-Inspired Fixed Routing
Sagi Shaier, Francisco Pereira, Katharina Kann et al.
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Tianqi Chen, Shujian Zhang, Mingyuan Zhou
Has the Deep Neural Network learned the Stochastic Process? An Evaluation Viewpoint
Harshit Kumar, Beomseok Kang, Biswadeep Chakraborty et al.
Beyond Mere Token Analysis: A Hypergraph Metric Space Framework for Defending Against Socially Engineered LLM Attacks
Manohar Kaul, Aditya Saibewar, Sadbhavana Babar
One Model Transfer to All: On Robust Jailbreak Prompts Generation against LLMs
Linbao Li, Yannan Liu, Daojing He et al.
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model
Chunming He, Chengyu Fang, Yulun Zhang et al.
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher
Yong Guo, Shulian Zhang, Haolin Pan et al.
A Unified Theory of Quantum Neural Network Loss Landscapes
Eric Anschuetz
Improved Sampling Algorithms for Lévy-Itô Diffusion Models
Vadim Popov, Assel Yermekova, Tasnima Sadekova et al.
Counterfactual Realizability
Arvind Raghavan, Elias Bareinboim
Variance-Reducing Couplings for Random Features
Isaac Reid, Stratis Markou, Krzysztof Choromanski et al.
Geometry of Neural Reinforcement Learning in Continuous State and Action Spaces
Saket Tiwari, Omer Gottesman, George D Konidaris
Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval
Pengcheng Jiang, Cao (Danica) Xiao, Minhao Jiang et al.
TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks
Ivan Rubachev, Nikolay Kartashev, Yury Gorishniy et al.
Learning to Discover Regulatory Elements for Gene Expression Prediction
Xingyu Su, Haiyang Yu, Degui Zhi et al.
FlickerFusion: Intra-trajectory Domain Generalizing Multi-agent Reinforcement Learning
Woosung Koh, Wonbeen Oh, Siyeol Kim et al.
BaB-ND: Long-Horizon Motion Planning with Branch-and-Bound and Neural Dynamics
Keyi Shen, Jiangwei Yu, Jose Barreiros et al.
Regularization by Texts for Latent Diffusion Inverse Solvers
Jeongsol Kim, Geon Yeong Park, Hyungjin Chung et al.
Partial Gromov-Wasserstein Metric
Yikun Bai, Rocio Diaz Martin, Abihith Kothapalli et al.
Demystifying the Token Dynamics of Deep Selective State Space Models
Thieu Vo, Duy-Tung Pham, Xin Tong et al.
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision
Yaowen Ye, Cassidy Laidlaw, Jacob Steinhardt
MixMax: Distributional Robustness in Function Space via Optimal Data Mixtures
Anvith Thudi, Chris Maddison
Improving Unsupervised Constituency Parsing via Maximizing Semantic Information
Junjie Chen, Xiangheng He, Yusuke Miyao et al.
Efficient Top-m Data Values Identification for Data Selection
Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng et al.
Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models
Yong-Hyun Park, Chieh-Hsin Lai, Satoshi Hayakawa et al.
HyperPLR: Hypergraph Generation through Projection, Learning, and Reconstruction
Weihuang Wen, Tianshu Yu
M^3PC: Test-time Model Predictive Control using Pretrained Masked Trajectory Model
Kehan Wen, Yutong Hu, Yao Mu et al.
Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning
Charlie Snell, Jaehoon Lee, Kelvin Xu et al.
Intermediate Layer Classifiers for OOD generalization
Arnas Uselis, Seong Joon Oh
A Curious Case of the Missing Measure: Better Scores and Worse Generation
Joseph Turian, Jordie Shier
ViSAGe: Video-to-Spatial Audio Generation
Jaeyeon Kim, Heeseung Yun, Gunhee Kim
Deep Kernel Posterior Learning under Infinite Variance Prior Weights
Jorge Loría, Anindya Bhadra
Privacy Auditing of Large Language Models
Ashwinee Panda, Xinyu Tang, Christopher Choquette-Choo et al.
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
Enshu Liu, Xuefei Ning, Yu Wang et al.
PooDLe🐩: Pooled and dense self-supervised learning from naturalistic videos
Alex N. Wang, Christopher Hoang, Yuwen Xiong et al.
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Junda Wu, Xintong Li, Ruoyu Wang et al.
Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs
Xuandong Zhao, Lei Li, Yu-Xiang Wang
Breaking Class Barriers: Efficient Dataset Distillation via Inter-Class Feature Compensator
xin zhang, Jiawei Du, Ping Liu et al.
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Fangyu Lei, Jixuan Chen, Yuxiao Ye et al.
How do we interpret the outputs of a neural network trained on classification?
Yudi Xie
ONLINE EPSILON NET & PIERCING SET FOR GEOMETRIC CONCEPTS
Sujoy Bhore, Devdan Dey, Satyam Singh
Counterfactual Concept Bottleneck Models
Gabriele Dominici, Pietro Barbiero, Francesco Giannini et al.
Predicate Hierarchies Improve Few-Shot State Classification
Emily Jin, Joy Hsu, Jiajun Wu
Towards Neural Scaling Laws for Time Series Foundation Models
Qingren Yao, Chao-Han Huck Yang, Renhe Jiang et al.
On the Modeling Capabilities of Large Language Models for Sequential Decision Making
Martin Klissarov, R Devon Hjelm, Alexander Toshev et al.
Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
Qixin ZHANG, Zongqi Wan, Yu Yang et al.
On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models
Xu-Hui Liu, Yali Du, Jun Wang et al.
Multi-modal Learning: A Look Back and the Road Ahead
Divyam Madaan, Sumit Chopra, Kyunghyun Cho
Correlation and Navigation in the Vocabulary Key Representation Space of Language Models
Letian Peng, Chenyang An, Jingbo Shang
LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch
caigao jiang, Xiang Shu, Hong Qian et al.
SOO-Bench: Benchmarks for Evaluating the Stability of Offline Black-Box Optimization
Hong Qian, Yiyi Zhu, Xiang Shu et al.
Emergent Orientation Maps —— Mechanisms, Coding Efficiency and Robustness
Haixin Zhong, Haoyu Wang, Wei Dai et al.
Learning Partial Graph Matching via Optimal Partial Transport
Gathika Ratnayaka, James Nichols, Qing Wang
Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning
Xinyue Wang, Biwei Huang
HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models
Qiushi Huang, Tom Ko, Zhan ZHUANG et al.
Sharpness-Aware Black-Box Optimization
Feiyang YE, YUEMING LYU, Xuehao Wang et al.
Better Instruction-Following Through Minimum Bayes Risk
Ian Wu, Patrick Fernandes, Amanda Bertsch et al.
Joint Gradient Balancing for Data Ordering in Finite-Sum Multi-Objective Optimization
Hansi Yang, James Kwok
Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs
Yu-Zhe Shi, Mingchen Liu, Fanxu Meng et al.
Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling
Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics
Alireza Mousavi-Hosseini, Denny Wu, Murat A Erdogdu
Sensitivity-Aware Amortized Bayesian Inference
Lasse Elsemüller, Hans Olischläger, Marvin Schmitt et al.
Why In-Context Learning Models are Good Few-Shot Learners?
Shiguang Wu, Yaqing Wang, Quanming Yao
Co$^{\mathbf{3}}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Xingqun Qi, Yatian Wang, Hengyuan Zhang et al.
Latent Action Pretraining from Videos
Seonghyeon Ye, Joel Jang, Byeongguk Jeon et al.
Efficient Neuron Segmentation in Electron Microscopy by Affinity-Guided Queries
Hang Chen, Chufeng Tang, Xiao Li et al.
Diff-PIC: Revolutionizing Particle-In-Cell Nuclear Fusion Simulation with Diffusion Models
Chuan Liu, Chunshu Wu, shihui cao et al.
LucidPPN: Unambiguous Prototypical Parts Network for User-centric Interpretable Computer Vision
Mateusz Pach, Koryna Lewandowska, Jacek Tabor et al.
FlashRNN: I/O-Aware Optimization of Traditional RNNs on modern hardware
Korbinian Pöppel, Maximilian Beck, Sepp Hochreiter
Neural Stochastic Differential Equations for Uncertainty-Aware Offline RL
Cevahir Koprulu, Franck Djeumou, ufuk topcu
Efficient and Robust Neural Combinatorial Optimization via Wasserstein-Based Coresets
Xu Wang, Fuyou Miao, Wenjie Liu et al.
Provable Uncertainty Decomposition via Higher-Order Calibration
Gustaf Ahdritz, Aravind Gollakota, Parikshit Gopalan et al.
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
Dongyoung Kim, Kimin Lee, Jinwoo Shin et al.
Synthesizing Realistic fMRI: A Physiological Dynamics-Driven Hierarchical Diffusion Model for Efficient fMRI Acquisition
Yufan Hu, Jiang, Wuyang Li et al.
AstroCompress: A benchmark dataset for multi-purpose compression of astronomical data
Tuan Truong, Rithwik Sudharsan, Yibo Yang et al.
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Jaehong Yoon, Shoubin Yu, Vaidehi Ramesh Patil et al.
Following the Human Thread in Social Navigation
Luca Scofano, Alessio Sampieri, Tommaso Campari et al.
Population Transformer: Learning Population-level Representations of Neural Activity
Geeling Chau, Christopher Wang, Sabera Talukder et al.
PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans
Giang Nguyen, Valerie Chen, Mohammad Reza Taesiri et al.
ST-GCond: Self-supervised and Transferable Graph Dataset Condensation
Beining Yang, Qingyun Sun, Cheng Ji et al.
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Han-Hung Lee, Yiming Zhang, Angel Chang
Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?
Charles Dawson, Van Tran, Max Li et al.
FairDen: Fair Density-Based Clustering
Lena Krieger, Anna Beer, Pernille Matthews et al.
Discrete Copula Diffusion
Anji Liu, Oliver Broadrick, Mathias Niepert et al.
Learn hybrid prototypes for multivariate time series anomaly detection
Ke-Yuan Shen
Transformer Block Coupling and its Correlation with Generalization in LLMs
Murdock Aubry, Haoming Meng, Anton Sugolov et al.
Aligned LLMs Are Not Aligned Browser Agents
Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.
Sensitivity Verification for Additive Decision Tree Ensembles
Arhaan Ahmad, Tanay Tayal, Ashutosh Gupta et al.
An Undetectable Watermark for Generative Image Models
Samuel Gunn, Xuandong Zhao, Dawn Song
Differentiable Causal Discovery for Latent Hierarchical Causal Models
Parjanya Prashant, Ignavier Ng, Kun Zhang et al.
Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees
Yingzhen Yang, Ping Li
Fengbo: a Clifford Neural Operator pipeline for 3D PDEs in Computational Fluid Dynamics
Alberto Pepe, Mattia Montanari, Joan Lasenby
Problem-Parameter-Free Federated Learning
Wenjing Yan, Kai Zhang, Xiaolu Wang et al.
Progressive distillation induces an implicit curriculum
Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi et al.
Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback
Sanjiban Choudhury, Paloma Sodhi
Self-Improving Robust Preference Optimization
Eugene Choi, Arash Ahmadian, Matthieu Geist et al.
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation
Shengjie Ma, Chengjin Xu, Xuhui Jiang et al.
Leveraging Variable Sparsity to Refine Pareto Stationarity in Multi-Objective Optimization
Zeou Hu, Yaoliang Yu
On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions
Omer Madmon, Idan Pipano, Itamar Jacob Reinman et al.
ToolGen: Unified Tool Retrieval and Calling via Generation
Renxi Wang, Xudong Han, Lei Ji et al.
Equivariant Denoisers Cannot Copy Graphs: Align Your Graph Diffusion Models
Najwa Laabid, Severi Rissanen, Markus Heinonen et al.
Towards a Complete Logical Framework for GNN Expressiveness
Tuo Xu
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Zachary Ankner, Cody Blakeney, Kartik Sreenivasan et al.
Efficient and Context-Aware Label Propagation for Zero-/Few-Shot Training-Free Adaptation of Vision-Language Model
Yushu Li, Yongyi Su, Adam Goodge et al.
Matérn Kernels for Tunable Implicit Surface Reconstruction
Maximilian Weiherer, Bernhard Egger
Open-CK: A Large Multi-Physics Fields Coupling benchmarks in Combustion Kinetics
Zaige Fei, Fan Xu, Junyuan Mao et al.
CameraCtrl: Enabling Camera Control for Video Diffusion Models
Hao He, Yinghao Xu, Yuwei Guo et al.
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Botao Ye, Sifei Liu, Haofei Xu et al.
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Pengyang Ling, Jiazi Bu, Pan Zhang et al.
Cluster-guided Contrastive Class-imbalanced Graph Classification
Wei Ju, Zhengyang Mao, Siyu Yi et al.
ACES: Automatic Cohort Extraction System for Event-Stream Datasets
Justin Xu, Jack Gallifant, ALISTAIR JOHNSON et al.
Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models
Aniruddha Kembhavi, Mohit Bansal, Amita Kamath et al.
See It from My Perspective: How Language Affects Cultural Bias in Image Understanding
Amith Ananthram, Elias Stengel-Eskin, Mohit Bansal et al.
Streaming Algorithms For $\ell_p$ Flows and $\ell_p$ Regression
Amit Chakrabarti, Jeffrey Jiang, David Woodruff et al.
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.
Coreset Spectral Clustering
Ben Jourdan, Gregory Schwartzman, Peter Macgregor et al.
Start Smart: Leveraging Gradients For Enhancing Mask-based XAI Methods
Buelent Uendes, Shujian Yu, Mark Hoogendoorn
Learning Continually by Spectral Regularization
Alex Lewandowski, Michał Bortkiewicz, Saurabh Kumar et al.
MGDA Converges under Generalized Smoothness, Provably
Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.
Boundary constrained Gaussian processes for robust physics-informed machine learning of linear partial differential equations
David Dalton, Alan Lazarus, Hao Gao et al.
Bayesian Regularization of Latent Representation
Chukwudi Paul Obite, Zhi Chang, Keyan Wu et al.
Boosting Ray Search Procedure of Hard-label Attacks with Transfer-based Priors
Chen Ma, Xinjie Xu, Shuyu Cheng et al.
DEPfold: RNA Secondary Structure Prediction as Dependency Parsing.
Ke Wang, Shay B Cohen
Open-Source vs Close-Source: The Context Utilization Challenge
Litu Ou
Aria-MIDI: A Dataset of Piano MIDI Files for Symbolic Music Modeling
Louis Bradshaw, Simon Colton
Variational Bayesian Pseudo-Coreset
Hyungi Lee, Seungyoo Lee, Juho Lee
Scaling up the Banded Matrix Factorization Mechanism for Large Scale Differentially Private ML
Ryan McKenna
BoneMet: An Open Large-Scale Multi-Modal Murine Dataset for Breast Cancer Bone Metastasis Diagnosis and Prognosis
Tiankuo Chu, Fudong Lin, Shubo Wang et al.
ADAPT: Attentive Self-Distillation and Dual-Decoder Prediction Fusion for Continual Panoptic Segmentation
Ze Yang, Shichao Dong, Ruibo Li et al.
Flow With What You Know
Scott Hawley
Difference-of-submodular Bregman Divergence
Masanari Kimura, Takahiro Kawashima, Tasuku Soma et al.
Differential learning kinetics govern the transition from memorization to generalization during in-context learning
Alex Nguyen, Gautam Reddy Nallamala
GameGen-X: Interactive Open-world Game Video Generation
Haoxuan Che, Xuanhua He, Quande Liu et al.
Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers
Shijie Chen, Bernal Jimenez Gutierrez, Yu Su
Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design
Melis Ilayda Bal, Pier Giuseppe Sessa, Mojmir Mutny et al.
Meta-Dynamical State Space Models for Integrative Neural Data Analysis
Ayesha Vermani, Josue Nassar, Hyungju Jeon et al.
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel, Hava Siegelmann
Large Scale Knowledge Washing
Yu Wang, Ruihan Wu, Zexue He et al.
Rethinking Graph Neural Networks From A Geometric Perspective Of Node Features
Feng Ji, Yanan Zhao, KAI ZHAO et al.
RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models
Tanqiu Jiang, Changjiang Li, Fenglong Ma et al.
Modeling dynamic social vision highlights gaps between deep learning and humans
Kathy Garcia, Emalie McMahon, Colin Conwell et al.
MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling
Rachel Teo, Tan Nguyen
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz Abdullaev, Tan Nguyen
RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression
Hengzhe Zhang, Qi Chen, Bing XUE et al.
Inverse Attention Agents for Multi-Agent Systems
Qian Long, Ruoyan Li, Minglu Zhao et al.
SelectFormer in Data Markets: Privacy-Preserving and Efficient Data Selection for Transformers with Multi-Party Computation
Xu Ouyang, Felix Xiaozhu Lin, Yangfeng Ji
Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization
Jianting Yang, Srecko Durasinovic, Jean Bernard Lasserre et al.
IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking
Shubham Dipak Ugare, Rohan Gumaste, Tarun Suresh et al.
AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies
Jian Gao, Weidong Cao, Junyi Yang et al.
The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD
Milad Nasr, Thomas Steinke, Borja Balle et al.
Near-Exact Privacy Amplification for Matrix Mechanisms
Christopher Choquette-Choo, Arun Ganesh, Saminul Haque et al.
Looking into User’s Long-term Interests through the Lens of Conservative Evidential Learning
Dingrong Wang, Krishna Neupane, Ervine Zheng et al.
TopoLM: brain-like spatio-functional organization in a topographic language model
Neil Rathi, Johannes Mehrer, Badr AlKhamissi et al.
SSOLE: Rethinking Orthogonal Low-rank Embedding for Self-Supervised Learning
Lun Huang, Qiang Qiu, Guillermo Sapiro
Large Convolutional Model Tuning via Filter Subspace
Wei Chen, Zichen Miao, Qiang Qiu
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
Yuxin Jiang, Bo Huang, Yufei Wang et al.
DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle
Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.
Tailoring Mixup to Data for Calibration
Quentin Bouniot, Pavlo Mozharovskyi, Florence d'Alché-Buc
On Evaluating the Durability of Safeguards for Open-Weight LLMs
Xiangyu Qi, Boyi Wei, Nicholas Carlini et al.
LeanVec: Searching vectors faster by making them fit
Ishwar Bhati, Cecilia Aguerrebere, Mark Hildebrand et al.
Efficient Dictionary Learning with Switch Sparse Autoencoders
Anish Mudide, Josh Engels, Eric Michaud et al.
Rationalizing and Augmenting Dynamic Graph Neural Networks
Guibin Zhang, Yiyan Qi, Ziyang Cheng et al.
Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning
Chongjie Si, Xuehui Wang, Xue Yang et al.
Evidential Learning-based Certainty Estimation for Robust Dense Feature Matching
Lile Cai, Chuan Sheng Foo, Xun Xu et al.
Policy Design in Long-run Welfare Dynamics
Jiduan Wu, Rediet Abebe, Moritz Hardt et al.
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION
Jingxuan Chen, Derek Yuen, Bin Xie et al.
Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators
Bin Lu, Xinyu Xiao, Changzhou Zhang et al.
DeeperForward: Enhanced Forward-Forward Training for Deeper and Better Performance
Liang Sun, Yang Zhang, Weizhao He et al.
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding
Zhenyu Yang, Yuhang Hu, Zemin Du et al.
Spherical Tree-Sliced Wasserstein Distance
Viet-Hoang Tran, Thanh Chu, Minh-Khoi Nguyen-Nhat et al.
Disentangled Representation Learning with the Gromov-Monge Gap
Théo Uscidda, Luca Eyring, Karsten Roth et al.
Towards Unified Human Motion-Language Understanding via Sparse Interpretable Characterization
guangtao lyu, Chenghao Xu, Jiexi Yan et al.
Efficient Low-Bit Quantization with Adaptive Scales for Multi-Task Co-Training
Boyu Liu, Haoyu Huang, Linlin Yang et al.
Regularizing Energy among Training Samples for Out-of-Distribution Generalization
Yiting Chen, Qitian Wu, Junchi Yan
To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions
Noah Marshall, Ke Liang Xiao, Atish Agarwala et al.
Locality Alignment Improves Vision-Language Models
Ian Covert, Tony Sun, James Y Zou et al.
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts
Huy Nguyen, Pedram Akbarian Saravi, Trang Pham et al.
Prompting Fairness: Integrating Causality to Debias Large Language Models
Jingling Li, Zeyu Tang, Xiaoyu Liu et al.
Dynamic Negative Guidance of Diffusion Models
Felix Koulischer, Johannes Deleu, Gabriel Raya et al.
Bilinear MLPs enable weight-based mechanistic interpretability
Michael Pearce, Thomas Dooms, Alice Rigg et al.
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
Wenkai Yang, Shiqi Shen, Guangyao Shen et al.
Training-Free Diffusion Model Alignment with Sampling Demons
Po-Hung Yeh, Kuang-Huei Lee, Jun-Cheng Chen
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
Thomas Robert, Mher Safaryan, Ionut-Vlad Modoranu et al.
Self-Normalized Resets for Plasticity in Continual Learning
Vivek Farias, Adam Jozefiak
Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization
Zhanfeng Mo, Long-Kai Huang, Sinno Jialin Pan
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo, Florian Eddie Dorner, Moritz Hardt
COME: Test-time Adaption by Conservatively Minimizing Entropy
Qingyang Zhang, Yatao Bian, Xinke Kong et al.
Oracle efficient truncated statistics
Konstantinos Karatapanis, Vasilis Kontonis, Christos Tzamos
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
Mingjie Li, Wai Man Si, Michael Backes et al.
Conformalized Interactive Imitation Learning: Handling Expert Shift and Intermittent Feedback
Michelle Zhao, Henny Admoni, Reid Simmons et al.