Most Cited ICML "target network soft update" Papers
5,975 papers found • Page 14 of 30
Conference
Preference Optimization for Combinatorial Optimization Problems
Mingjun Pan, Guanquan Lin, You-Wei Luo et al.
Adversarial Robust Generalization of Graph Neural Networks
Chang Cao, Han Li, Yulong Wang et al.
Censor Dependent Variational Inference
Chuanhui Liu, Xiao Wang
Hierarchical Overlapping Clustering on Graphs: Cost Function, Algorithm and Scalability
Yicheng Pan, Renjie Chen, Pengyu Long et al.
Efficient Personalized Adaptation for Physiological Signal Foundation Model
Chenrui Wu, Haishuai Wang, Xiang Zhang et al.
ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via $\alpha$-$\beta$-Divergence
Guanghui Wang, Zhiyong Yang, Zitai Wang et al.
QuEst: Enhancing Estimates of Quantile-Based Distributional Measures Using Model Predictions
Xinyu Yang, Tom Zollo, Benjamin Eyre et al.
Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training
Max Milkert, David Hyde, Forrest Laine
Neural Interpretable PDEs: Harmonizing Fourier Insights with Attention for Scalable and Interpretable Physics Discovery
Ning Liu, Yue Yu
Optimization Proxies using Limited Labeled Data and Training Time -- A Semi-Supervised Bayesian Neural Network Approach
Parikshit Pareek, Abhijith Jayakumar, Kaarthik Sundar et al.
Graph Attention is Not Always Beneficial: A Theoretical Analysis of Graph Attention Mechanisms via Contextual Stochastic Block Models
Zhongtian Ma, Qiaosheng Zhang, Bocheng Zhou et al.
Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Rui Min, Tianyu Pang, Chao Du et al.
Supervised Contrastive Learning from Weakly-Labeled Audio Segments for Musical Version Matching
Joan Serrà, Recep Oguz Araz, Dmitry Bogdanov et al.
EGPlace: An Efficient Macro Placement Method via Evolutionary Search with Greedy Repositioning Guided Mutation
ji deng, Zhao Li, Ji Zhang et al.
DRAG: Data Reconstruction Attack using Guided Diffusion
Wa-Kin Lei, Jun-Cheng Chen, Shang-Tse Chen
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Xinyu Guan, Li Lyna Zhang, Yifei Liu et al.
EduLLM: Leveraging Large Language Models and Framelet-Based Signed Hypergraph Neural Networks for Student Performance Prediction
Ming Li, Yukang Cheng, Lu Bai et al.
Pruning for GNNs: Lower Complexity with Comparable Expressiveness
Dun Ma, Jianguo Chen, Wenguo Yang et al.
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models
Jiawei Zhang, Xuan Yang, Taiqi Wang et al.
Test-Time Selective Adaptation for Uni-Modal Distribution Shift in Multi-Modal Data
MingCai Chen, Baoming Zhang, Zongbo Han et al.
Disparate Conditional Prediction in Multiclass Classifiers
Sivan Sabato, Eran Treister, Elad Yom-Tov
DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications
Ibrahim Fayad, Max Zimmer, Martin Schwartz et al.
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
Bernal Jimenez Gutierrez, Yiheng Shu, Weijian Qi et al.
Multinoulli Extension: A Lossless Yet Effective Probabilistic Framework for Subset Selection over Partition Constraints
Qixin Zhang, Wei Huang, Can Jin et al.
Self-supervised Masked Graph Autoencoder via Structure-aware Curriculum
Haoyang Li, Xin Wang, Zeyang Zhang et al.
Flex3D: Feed-Forward 3D Generation with Flexible Reconstruction Model and Input View Curation
Junlin Han, Jianyuan Wang, Andrea Vedaldi et al.
Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training
Zizheng Huang, Haoxing Chen, Jiaqi Li et al.
Trustworthy Machine Learning through Data-Specific Indistinguishability
Hanshen Xiao, Zhen Yang, Edward Suh
Test-Time Canonicalization by Foundation Models for Robust Perception
Utkarsh Singhal, Ryan Feng, Stella Yu et al.
Textural or Textual: How Vision-Language Models Read Text in Images
Hanzhang Wang, Qingyuan Ma
Direct Prediction Set Minimization via Bilevel Conformal Classifier Training
Yuanjie Shi, Hooman Shahrokhi, Xuesong Jia et al.
Weisfeiler and Leman Go Gambling: Why Expressive Lottery Tickets Win
Lorenz Kummer, Samir Moustafa, Anatol Ehrlich et al.
Commute Graph Neural Networks
Wei Zhuo, Han Yu, Guang Tan et al.
Omni-Angle Assault: An Invisible and Powerful Physical Adversarial Attack on Face Recognition
Shuai Yuan, Hongwei Li, Rui Zhang et al.
ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior
Zhongweiyang Xu, Xulin Fan, Zhong-Qiu Wang et al.
Event-Customized Image Generation
Zhen Wang, Yilei JIANG, Dong Zheng et al.
Flow Matching for Few-Trial Neural Adaptation with Stable Latent Dynamics
Puli Wang, Yu Qi, Yueming Wang et al.
The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence
Tom Wollschläger, Jannes Elstner, Simon Geisler et al.
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards
Chenlu Ye, Yujia Jin, Alekh Agarwal et al.
OmiAD: One-Step Adaptive Masked Diffusion Model for Multi-class Anomaly Detection via Adversarial Distillation
Yaoxuan Feng, Wenchao Chen, yuxin li et al.
Mastering Multiple-Expert Routing: Realizable $H$-Consistency and Strong Guarantees for Learning to Defer
Anqi Mao, Mehryar Mohri, Yutao Zhong
Autonomy-of-Experts Models
Ang Lv, Ruobing Xie, Yining Qian et al.
Computing Voting Rules with Improvement Feedback
Evi Micha, Vasilis Varsamis
(How) Do Language Models Track State?
Belinda Li, Carl Guo, Jacob Andreas
Free Process Rewards without Process Labels
Lifan Yuan, Wendi Li, Huayu Chen et al.
FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
Yingying Deng, Xiangyu He, Changwang Mei et al.
Symmetry-Aware GFlowNets
Hohyun Kim, Seunggeun Lee, Min-hwan Oh
Maximum Total Correlation Reinforcement Learning
Bang You, Puze Liu, Huaping Liu et al.
Graph Neural Network Generalization With Gaussian Mixture Model Based Augmentation
Yassine Abbahaddou, Fragkiskos Malliaros, Johannes Lutzeyer et al.
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Dianwen Ng, Kun Zhou, Yi-Wen Chao et al.
Are Large Brainwave Foundation Models Capable Yet ? Insights from Fine-Tuning
Na Lee, Konstantinos Barmpas, Yannis Panagakis et al.
Linear Contextual Bandits With Interference
Yang Xu, Wenbin Lu, Rui Song
Feature Importance Metrics in the Presence of Missing Data
Henrik von Kleist, Joshua Wendland, Ilya Shpitser et al.
Clustering Items through Bandit Feedback: Finding the Right Feature out of Many
Maximilian Graf, Victor Thuot, Nicolas Verzelen
Improving Transformer World Models for Data-Efficient RL
Antoine Dedieu, Joseph Ortiz, Xinghua Lou et al.
MaskTwins: Dual-form Complementary Masking for Domain-Adaptive Image Segmentation
Jiawen Wang, Yinda Chen, Xiaoyu Liu et al.
Reducing Confounding Bias without Data Splitting for Causal Inference via Optimal Transport
Yuguang Yan, Zongyu Li, Haolin Yang et al.
Pairwise Maximum Likelihood For Multi-Class Logistic Regression Model With Multiple Rare Classes
Xuetong Li, Danyang Huang, Hansheng Wang
Feature-Mapping Topology Optimization with Neural Heaviside Signed Distance Functions
Aleksandr Kolomeitsev, ANH-HUY PHAN
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy
Kian Kenyon-Dean, Zitong Jerry Wang, John Urbanik et al.
One-dimensional Path Convolution
Xuanshu Luo, Martin Werner
CodeIO: Condensing Reasoning Patterns via Code Input-Output Prediction
Junlong Li, Daya Guo, Dejian Yang et al.
Efficient ANN-SNN Conversion with Error Compensation Learning
chang liu, Jiangrong Shen, Xuming Ran et al.
AffinityFlow: Guided Flows for Antibody Affinity Maturation
Can Chen, Karla-Luise Herpoldt, Chenchao Zhao et al.
An End-to-End Model for Logits-Based Large Language Models Watermarking
KA HIM WONG, Jicheng Zhou, Jiantao Zhou et al.
OneForecast: A Universal Framework for Global and Regional Weather Forecasting
Yuan Gao, Hao Wu, Ruiqi Shu et al.
FairPFN: A Tabular Foundation Model for Causal Fairness
Jake Robertson, Noah Hollmann, Samuel Gabriel Müller et al.
Zebra: In-Context Generative Pretraining for Solving Parametric PDEs
Louis Serrano, Armand Kassaï Koupaï, Thomas Wang et al.
FSTLLM: Spatio-Temporal LLM for Few Shot Time Series Forecasting
Yue Jiang, Yile Chen, Xiucheng Li et al.
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Qiwei Di, Jiafan He, Quanquan Gu
CoMemo: LVLMs Need Image Context with Image Memory
Shi Liu, Weijie Su, Xizhou Zhu et al.
LEAPS: A discrete neural sampler via locally equivariant networks
Peter Holderrieth, Michael Albergo, Tommi Jaakkola
Learning Progress Driven Multi-Agent Curriculum
Wenshuai Zhao, Zhiyuan Li, Joni Pajarinen
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback
Yafu Li, Xuyang Hu, Xiaoye Qu et al.
Position: Language model developers should report train-test overlap
Andy Zhang, Kevin Klyman, Yifan Mai et al.
Steerable Transformers for Volumetric Data
Soumyabrata Kundu, Risi Kondor
DyPolySeg: Taylor Series-Inspired Dynamic Polynomial Fitting Network for Few-shot Point Cloud Semantic Segmentation
Changshuo Wang, Xiang Fang, Prayag Tiwari
Safety Reasoning with Guidelines
Haoyu Wang, Zeyu Qin, Li Shen et al.
When Can Proxies Improve the Sample Complexity of Preference Learning?
Yuchen Zhu, Daniel Augusto de Souza, Zhengyan Shi et al.
Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-Trees
Zehong Wang, Zheyuan Zhang, Tianyi MA et al.
Heterogeneous Label Shift: Theory and Algorithm
Chao Xu, Xijia Tang, Chenping Hou
Falcon: Fast Visuomotor Policies via Partial Denoising
Haojun Chen, Minghao Liu, Chengdong Ma et al.
Uniform Mean Estimation for Heavy-Tailed Distributions via Median-of-Means
Mikael Møller Høgsgaard, Andrea Paudice
RBench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation
Meng-Hao Guo, Jiajun Xu, Yi Zhang et al.
OpenworldAUC: Towards Unified Evaluation and Optimization for Open-world Prompt Tuning
Cong Hua, Qianqian Xu, Zhiyong Yang et al.
TLLC: Transfer Learning-based Label Completion for Crowdsourcing
Wenjun Zhang, Liangxiao Jiang, Chaoqun Li
The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products
YuQing Xie, Ameya Daigavane, Mit Kotak et al.
PolyConf: Unlocking Polymer Conformation Generation through Hierarchical Generative Models
Fanmeng Wang, Wentao Guo, Qi Ou et al.
Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning
Adrià López Escoriza, Nicklas Hansen, Stone Tao et al.
Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in sEMG Analysis
Weiyu Guo, Ziyue Qiao, Ying Sun et al.
Measuring Variable Importance in Heterogeneous Treatment Effects with Confidence
Joseph Paillard, Angel REYERO LOBO, Vitaliy Kolodyazhniy et al.
LAION-C: An Out-of-Distribution Benchmark for Web-Scale Vision Models
Fanfei Li, Thomas Klein, Wieland Brendel et al.
Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed Improvement of Utility-Privacy Trade-off
Yuecheng Li, Lele Fu, Tong Wang et al.
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
Xinyang Li, Siqi Liu, Bochao Zou et al.
BSemiFL: Semi-supervised Federated Learning via a Bayesian Approach
Haozhao Wang, Shengyu Wang, Jiaming Li et al.
Learning Robust Neural Processes with Risk-Averse Stochastic Optimization
Huafeng Liu, Yiran Fu, Liping Jing et al.
Learning Utilities from Demonstrations in Markov Decision Processes
Filippo Lazzati, Alberto Maria Metelli
Learning Input Encodings for Kernel-Optimal Implicit Neural Representations
Zhemin Li, Liyuan Ma, Hongxia Wang et al.
MultiPDENet: PDE-embedded Learning with Multi-time-stepping for Accelerated Flow Simulation
Qi Wang, Yuan Mi, Wang Haoyun et al.
Adjustment for Confounding using Pre-Trained Representations
Rickmer Schulte, David Rügamer, Thomas Nagler
Positive-unlabeled AUC Maximization under Covariate Shift
Atsutoshi Kumagai, Tomoharu Iwata, Hiroshi Takahashi et al.
Pareto Merging: Multi-Objective Optimization for Preference-Aware Model Merging
Weiyu CHEN, James Kwok
MathConstruct: Challenging LLM Reasoning with Constructive Proofs
Mislav Balunovic, Jasper Dekoninck, Nikola Jovanović et al.
Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts
Jiahai Feng, Stuart Russell, Jacob Steinhardt
Laplace Transform Based Low-Complexity Learning of Continuous Markov Semigroups
Vladimir Kostic, Karim Lounici, Hélène Halconruy et al.
Logits are All We Need to Adapt Closed Models
Gaurush Hiranandani, Haolun Wu, Subhojyoti Mukherjee et al.
A Variational Framework for Improving Naturalness in Generative Spoken Language Models
Li-Wei Chen, Takuya Higuchi, Zakaria Aldeneh et al.
Transfer Q-Learning with Composite MDP Structures
Jinhang Chai, Elynn Chen, Lin Yang
Contour Integration Underlies Human-Like Vision
Ben Lonnqvist, Elsa Scialom, Abdulkadir Gokce et al.
Scaling Laws for Pre-training Agents and World Models
Tim Pearce, Tabish Rashid, David Bignell et al.
Spectral-Aware Reservoir Computing for Fast and Accurate Time Series Classification
Shikang Liu, Chuyang Wei, Xiren Zhou et al.
When Will It Fail?: Anomaly to Prompt for Forecasting Future Anomalies in Time Series
Min-Yeong Park, Won-Jeong Lee, Seong Tae Kim et al.
Leveraging Model Guidance to Extract Training Data from Personalized Diffusion Models
Xiaoyu Wu, Jiaru Zhang, Steven Wu
Constrained Online Convex Optimization with Polyak Feasibility Steps
Spencer Hutchinson, Mahnoosh Alizadeh
Algorithms and Hardness for Active Learning on Graphs
Vincent Cohen-Addad, Silvio Lattanzi, Simon Meierhans
Sidechain conditioning and modeling for full-atom protein sequence design with FAMPNN
Talal Widatalla, Richard Shuai, Brian Hie et al.
Bifurcate then Alienate: Incomplete Multi-view Clustering via Coupled Distribution Learning with Linear Overhead
Shengju Yu, Yiu-ming Cheung, Siwei Wang et al.
HEAP: Hyper Extended A-PDHG Operator for Constrained High-dim PDEs
Mingquan Feng, Weixin Liao, Yixin Huang et al.
Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization
Peiyan Zhang, Haibo Jin, Leyang Hu et al.
GLGENN: A Novel Parameter-Light Equivariant Neural Networks Architecture Based on Clifford Geometric Algebras
Ekaterina Filimoshina, Dmitry Shirokov
WildChat-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training
Benjamin Feuer, Chinmay Hegde
TimeBase: The Power of Minimalism in Efficient Long-term Time Series Forecasting
Qihe Huang, Zhengyang Zhou, Kuo Yang et al.
Upcycling Text-to-Image Diffusion Models for Multi-Task Capabilities
Ruchika Chavhan, Abhinav Mehrotra, Malcolm Chadwick et al.
S4S: Solving for a Fast Diffusion Model Solver
Eric Frankel, Sitan Chen, Jerry Li et al.
Active Reward Modeling: Adaptive Preference Labeling for Large Language Model Alignment
Yunyi Shen, Hao Sun, Jean-Francois Ton
WATCH: Adaptive Monitoring for AI Deployments via Weighted-Conformal Martingales
Drew Prinster, Xing Han, Anqi Liu et al.
Identifying Causal Direction via Variational Bayesian Compression
Quang-Duy Tran, Bao Duong, Phuoc Nguyen et al.
Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG
Xinxu Wei, kanhao zhao, Yong Jiao et al.
CASE-Bench: Context-Aware SafEty Benchmark for Large Language Models
Guangzhi Sun, Xiao Zhan, Shutong Feng et al.
KGMark: A Diffusion Watermark for Knowledge Graphs
Hongrui Peng, Haolang Lu, Yuanlong Yu et al.
Ergodic Generative Flows
Leo Brunswic, Mateo Clémente, Rui Heng Yang et al.
Rethinking Score Distilling Sampling for 3D Editing and Generation
Xingyu Miao, Haoran Duan, Yang Long et al.
O-MAPL: Offline Multi-agent Preference Learning
The Viet Bui, Tien Mai, Thanh Nguyen
Improving Flow Matching by Aligning Flow Divergence
Yuhao Huang, Taos Transue, Shih-Hsin Wang et al.
Accelerating PDE-Constrained Optimization by the Derivative of Neural Operators
Ze Cheng, Zhuoyu Li, Wang Xiaoqiang et al.
PEAKS: Selecting Key Training Examples Incrementally via Prediction Error Anchored by Kernel Similarity
Mustafa Burak Gurbuz, Xingyu Zheng, Constantine Dovrolis
The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph
Minghao Wu, Thuy-Trang Vu, Lizhen Qu et al.
Adaptive Data Collection for Robust Learning Across Multiple Distributions
Chengbo Zang, Mehmet Turkcan, Gil Zussman et al.
Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion
Kulin Shah, Alkis Kalavasis, Adam Klivans et al.
ProofAug: Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis
Haoxiong Liu, Jiacheng Sun, Zhenguo Li et al.
Implicit degree bias in the link prediction task
Rachith Aiyappa, Xin Wang, Munjung Kim et al.
Trajectory Inference with Smooth Schrödinger Bridges
Wanli Hong, Yuliang Shi, Jonathan Niles-Weed
Shortcut-connected Expert Parallelism for Accelerating Mixture of Experts
Weilin Cai, Juyong Jiang, Le Qin et al.
Adversaries Can Misuse Combinations of Safe Models
Erik Jones, Anca Dragan, Jacob Steinhardt
Protriever: End-to-End Differentiable Protein Homology Search for Fitness Prediction
Ruben Weitzman, Peter Mørch Groth, Lood van Niekerk et al.
Learning Parametric Distributions from Samples and Preferences
Marc Jourdan, Gizem Yüce, Nicolas Flammarion
LSCD: Lomb--Scargle Conditioned Diffusion for Time series Imputation
Elizabeth M Fons Etcheverry, Alejandro Sztrajman, Yousef El-Laham et al.
Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models
Chao Li, Jiawei Fan, Anbang Yao
Efficient Quantification of Multimodal Interaction at Sample Level
Zequn Yang, Hongfa Wang, Di Hu
Adaptive Multi-prompt Contrastive Network for Few-shot Out-of-distribution Detection
Xiang Fang, Arvind Easwaran, Blaise Genest
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou, Hang Le, Yichen Wang et al.
Probabilistic Group Mask Guided Discrete Optimization for Incremental Learning
Fengqiang Wan, Yang Yang
False Coverage Proportion Control for Conformal Prediction
Alexandre Blain, Thirion Bertrand, Pierre Neuvial
Dendritic Localized Learning: Toward Biologically Plausible Algorithm
Changze Lv, Jingwen Xu, Yiyang Lu et al.
Multiobjective distribution matching
Xiaoyuan Zhang, Peijie Li, Ying Ying YU et al.
Fixed-Confidence Multiple Change Point Identification under Bandit Feedback
Joseph Lazzaro, Ciara Pike-Burke
ReFrame: Layer Caching for Accelerated Inference in Real-Time Rendering
Lufei Liu, Tor Aamodt
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts
Tobias Braun, Mark Rothermel, Marcus Rohrbach et al.
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
Qin Guo, Ailing Zeng, Dongxu Yue et al.
De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks
Wei Fan, Kejiang Chen, Chang Liu et al.
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences
Yunhong Lu, Qichao Wang, Hengyuan Cao et al.
Improving Generalization in Federated Learning with Highly Heterogeneous Data via Momentum-Based Stochastic Controlled Weight Averaging
Junkang Liu, Yuanyuan Liu, Fanhua Shang et al.
Reinforcement Learning with Segment Feedback
Yihan Du, Anna Winnicki, Gal Dalal et al.
Contrastive Learning with Simplicial Convolutional Networks for Short-Text Classification
Liang Huang, Benedict Lee, Daniel Ng et al.
Kandinsky Conformal Prediction: Beyond Class- and Covariate-Conditional Coverage
Konstantina Bairaktari, Jiayun Wu, Steven Wu
FlipAttack: Jailbreak LLMs via Flipping
Yue Liu, Xiaoxin He, Miao Xiong et al.
BSLoRA: Enhancing the Parameter Efficiency of LoRA with Intra-Layer and Inter-Layer Sharing
Yuhua Zhou, Ruifeng Li, Changhai Zhou et al.
ExLM: Rethinking the Impact of $\texttt{[MASK]}$ Tokens in Masked Language Models
Kangjie Zheng, Junwei Yang, Siyue Liang et al.
On Explaining Equivariant Graph Networks via Improved Relevance Propagation
Hongyi Ling, Haiyang Yu, Zhimeng Jiang et al.
3D Question Answering via only 2D Vision-Language Models
FENGYUN WANG, Sicheng Yu, Jiawei Wu et al.
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion Models
Yaopei Zeng, Yuanpu Cao, Bochuan Cao et al.
Stabilizing Sample Similarity in Representation via Mitigating Random Consistency
Jieting Wang, ZhangZelong Zhang, Feijiang Li et al.
Diff-MoE: Diffusion Transformer with Time-Aware and Space-Adaptive Experts
Kun Cheng, Xiao He, Lei Yu et al.
Bi-perspective Splitting Defense: Achieving Clean-Seed-Free Backdoor Security
Yangyang Shen, Xiao Tan, Dian Shen et al.
Structure-Guided Large Language Models for Text-to-SQL Generation
Qinggang Zhang, Hao Chen, Junnan Dong et al.
Step-DAD: Semi-Amortized Policy-Based Bayesian Experimental Design
Marcel Hedman, Desi Ivanova, Cong Guan et al.
Spherical-Nested Diffusion Model for Panoramic Image Outpainting
Xiancheng Sun, Senmao Ma, Shengxi Li et al.
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features
xiaokun Feng, Dailing Zhang, Shiyu Hu et al.
Learning Event Completeness for Weakly Supervised Video Anomaly Detection
Yu Wang, Shiwei Chen
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching
Sucheng Ren, Qihang Yu, Ju He et al.
Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space
Max van Spengler, Pascal Mettes
A Mixed-Curvature based Pre-training Paradigm for Multi-Task Vehicle Routing Solver
Suyu Liu, Zhiguang Cao, Shanshan Feng et al.
Near-optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback
Tal Lancewicki, Yishay Mansour
Occult: Optimizing Collaborative Communications across Experts for Accelerated Parallel MoE Training and Inference
Shuqing Luo, Pingzhi Li, Jie Peng et al.
XAttention: Block Sparse Attention with Antidiagonal Scoring
Ruyi Xu, Guangxuan Xiao, Haofeng Huang et al.
Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning
Ngoc Bui, Menglin Yang, Runjin Chen et al.
Learning to Steer Learners in Games
Yizhou Zhang, Yian Ma, Eric Mazumdar
$\texttt{I$^2$MoE}$: Interpretable Multimodal Interaction-aware Mixture-of-Experts
Jiayi Xin, Sukwon Yun, Jie Peng et al.
A Hitchhiker's Guide to Scaling Law Estimation
Leshem Choshen, Yang Zhang, Jacob Andreas
Variance as a Catalyst: Efficient and Transferable Semantic Erasure Adversarial Attack for Customized Diffusion Models
Jiachen Yang, Yusong Wang, Yanmei Fang et al.
A Recipe for Causal Graph Regression: Confounding Effects Revisited
Yujia Yin, Tianyi Qu, Zihao Wang et al.
Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization
Junyi Liao, Zihan Zhu, Ethan Fang et al.
Flexible and Efficient Grammar-Constrained Decoding
Kanghee Park, Timothy Zhou, Loris D'Antoni
On the Diversity of Adversarial Ensemble Learning
Jun-Qi Guo, Meng-Zhang Qian, Wei Gao et al.
BECAME: Bayesian Continual Learning with Adaptive Model Merging
Mei Li, Yuxiang Lu, Qinyan Dai et al.
A Near Linear Query Lower Bound for Submodular Maximization
Binghui Peng, Aviad Rubinstein
Accelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach
Yang Xu, Vaneet Aggarwal
Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-Tailed
Savelii Chezhegov, Klyukin Yaroslav, Andrei Semenov et al.
Modalities Contribute Unequally: Enhancing Medical Multi-modal Learning through Adaptive Modality Token Re-balancing
Jie Peng, Jenna Ballard, Mohan Zhang et al.
Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing
Kento Nishi, Rahul Ramesh, Maya Okawa et al.