Most Cited 2025 "w4a4 quantization" Papers

22,274 papers found • Page 37 of 112

#7201

ELICIT: LLM Augmentation Via External In-context Capability

Futing Wang, Jianhao (Elliott) Yan, Yue Zhang et al.

ICLR 2025arXiv:2410.09343
6
citations
#7202

ALLVB: All-in-One Long Video Understanding Benchmark

Xichen Tan, Yuanjing Luo, Yunfan Ye et al.

AAAI 2025paperarXiv:2503.07298
6
citations
#7203

CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness

Shoucheng Song, Youfang Lin, Sheng Han et al.

AAAI 2025paperarXiv:2501.05207
6
citations
#7204

Certification of Speaker Recognition Models to Additive Perturbations

Dmitrii Korzh, Elvir Karimov, Mikhail Pautov et al.

AAAI 2025paperarXiv:2404.18791
6
citations
#7205

Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization

Simone Bombari, Marco Mondelli

ICML 2025arXiv:2502.01347
6
citations
#7206

Simulate and Eliminate: Revoke Backdoors for Generative Large Language Models

Haoran Li, Yulin Chen, Zihao Zheng et al.

AAAI 2025paperarXiv:2405.07667
6
citations
#7207

Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow

Jiaqi Bai, Hongcheng Guo, Zhongyuan Peng et al.

AAAI 2025paperarXiv:2502.20750
6
citations
#7208

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2410.22316
6
citations
#7209

Models of Heavy-Tailed Mechanistic Universality

Liam Hodgkinson, Zhichao Wang, Michael Mahoney

ICML 2025arXiv:2506.03470
6
citations
#7210

Tuning LLM Judge Design Decisions for 1/1000 of the Cost

David Salinas, Omar Swelam, Frank Hutter

ICML 2025arXiv:2501.17178
6
citations
#7211

Selective Response Strategies for GenAI

Boaz Taitler, Omer Ben-Porat

ICML 2025arXiv:2502.00729
6
citations
#7212

Severing Spurious Correlations with Data Pruning

Varun Mulchandani, Jung-Eun Kim

ICLR 2025arXiv:2503.18258
6
citations
#7213

Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization

Juntao Dai, Taiye Chen, Yaodong Yang et al.

ICLR 2025arXiv:2503.18130
6
citations
#7214

When Maximum Entropy Misleads Policy Optimization

Ruipeng Zhang, Ya-Chien Chang, Sicun Gao

ICML 2025arXiv:2506.05615
6
citations
#7215

Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct

Christopher Ackerman, Nina Panickssery

ICLR 2025oralarXiv:2410.02064
6
citations
#7216

Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection

Hongsong Wang, Andi Xu, Pinle Ding et al.

AAAI 2025paperarXiv:2412.17210
6
citations
#7217

Aligning Language Models Using Follow-up Likelihood as Reward Signal

Chen Zhang, Dading Chong, Feng Jiang et al.

AAAI 2025paperarXiv:2409.13948
6
citations
#7218

Features are fate: a theory of transfer learning in high-dimensional regression

Javan Tahir, Surya Ganguli, Grant Rotskoff

ICML 2025arXiv:2410.08194
6
citations
#7219

ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans

Ashkan Shahbazi, Elaheh Akbari, Darian Salehi et al.

ICML 2025arXiv:2502.07962
6
citations
#7220

Scaling Probabilistic Circuits via Monarch Matrices

Honghua Zhang, Meihua Dang, Benjie Wang et al.

ICML 2025arXiv:2506.12383
6
citations
#7221

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.

ICML 2025arXiv:2502.03032
6
citations
#7222

Elucidating the design space of language models for image generation

Xuantong Liu, Shaozhe Hao, Xianbiao Qi et al.

ICML 2025arXiv:2410.16257
6
citations
#7223

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

Xinyi Wang, Na Zhao, Zhiyuan Han et al.

AAAI 2025paperarXiv:2501.09428
6
citations
#7224

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025arXiv:2502.00874
6
citations
#7225

Revisiting a Design Choice in Gradient Temporal Difference Learning

Xiaochi Qian, Shangtong Zhang

ICLR 2025oralarXiv:2308.01170
6
citations
#7226

MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning

Yifu Yuan, Zhenrui Zheng, Zibin Dong et al.

ICML 2025arXiv:2408.15501
6
citations
#7227

Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Fusheng Liu, Qianxiao Li

ICLR 2025oralarXiv:2411.19455
6
citations
#7228

LLM-RG4: Flexible and Factual Radiology Report Generation Across Diverse Input Contexts

Zhuhao Wang, Yihua Sun, Zihan Li et al.

AAAI 2025paperarXiv:2412.12001
6
citations
#7229

Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

Mohit Pandey, Gopeshh Subbaraj, Artem Cherkasov et al.

ICML 2025arXiv:2503.06337
6
citations
#7230

PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening

RuoCheng Wu, Zien Zhang, Shangqi Deng et al.

AAAI 2025paperarXiv:2409.06980
6
citations
#7231

A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models

Mengyang Sun, Yihao Wang, Tao Feng et al.

ICML 2025arXiv:2502.15828
6
citations
#7232

Can LLMs Handle WebShell Detection? Overcoming Detection Challenges with Behavioral Function-Aware Framework

Feijiang Han, Jiaming Zhang, Chuyi Deng et al.

COLM 2025paperarXiv:2504.13811
6
citations
#7233

FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning

Jialuo He, Wei Chen, Xiaojin Zhang

AAAI 2025paperarXiv:2402.05541
6
citations
#7234

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025arXiv:2507.05502
6
citations
#7235

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025arXiv:2503.04429
6
citations
#7236

PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection

Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.

AAAI 2025paperarXiv:2412.11807
6
citations
#7237

Offline Safe Reinforcement Learning Using Trajectory Classification

Ze Gong, Akshat Kumar, Pradeep Varakantham

AAAI 2025paperarXiv:2412.15429
6
citations
#7238

Learning Physics Informed Neural ODEs with Partial Measurements

Paul Ghanem, Ahmet Demirkaya, Tales Imbiriba et al.

AAAI 2025paperarXiv:2412.08681
6
citations
#7239

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games

David Guzman Piedrahita, Yongjin Yang, Mrinmaya Sachan et al.

COLM 2025paper
6
citations
#7240

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Lukas Fluri, Leon Lang, Alessandro Abate et al.

ICML 2025arXiv:2406.15753
6
citations
#7241

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025arXiv:2405.06575
6
citations
#7242

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation

Ke Yan, Qing Cai, Fan Zhang et al.

AAAI 2025paperarXiv:2412.15526
6
citations
#7243

Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck

Xingcheng Fu, Yisen Gao, Beining Yang et al.

AAAI 2025paperarXiv:2412.17355
6
citations
#7244

6D Object Pose Tracking in Internet Videos for Robotic Manipulation

Georgy Ponimatkin, Martin Cífka, Tomas Soucek et al.

ICLR 2025oralarXiv:2503.10307
6
citations
#7245

SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints

Ziqi Sheng, Wei Lu, Xiangyang Luo et al.

AAAI 2025paperarXiv:2412.09981
6
citations
#7246

PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization

Mingjing Xu, Peizhong Ju, Jia Liu et al.

AAAI 2025paperarXiv:2412.10961
6
citations
#7247

Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution

Jiarui Yang, Tao Dai, Yufei Zhu et al.

AAAI 2025paperarXiv:2412.16552
6
citations
#7248

DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback

Xuening Feng, Zhaohui Jiang, Timo Kaufmann et al.

AAAI 2025paper
6
citations
#7249

Self-Normalized Resets for Plasticity in Continual Learning

Vivek Farias, Adam Jozefiak

ICLR 2025arXiv:2410.20098
6
citations
#7250

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.

ICML 2025arXiv:2505.17552
6
citations
#7251

Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning

Yiming Yang, Yueru Luo, Bingkun He et al.

AAAI 2025paperarXiv:2502.08974
6
citations
#7252

Counterfactual Concept Bottleneck Models

Gabriele Dominici, Pietro Barbiero, Francesco Giannini et al.

ICLR 2025arXiv:2402.01408
6
citations
#7253

SADA: Stability-guided Adaptive Diffusion Acceleration

Ting Jiang, Yixiao Wang, Hancheng Ye et al.

ICML 2025arXiv:2507.17135
6
citations
#7254

On the Optimal Memorization Capacity of Transformers

Tokio Kajitsuka, Issei Sato

ICLR 2025arXiv:2409.17677
6
citations
#7255

AdaFisher: Adaptive Second Order Optimization via Fisher Information

Damien GOMES, Yanlei Zhang, Eugene Belilovsky et al.

ICLR 2025arXiv:2405.16397
6
citations
#7256

Deep Incomplete Multi-view Learning via Cyclic Permutation of VAEs

Xin Gao, Jian Pu

ICLR 2025arXiv:2502.11037
6
citations
#7257

T-JEPA: Augmentation-Free Self-Supervised Learning for Tabular Data

Hugo Thimonier, José Lucas De Melo Costa, Fabrice Popineau et al.

ICLR 2025arXiv:2410.05016
6
citations
#7258

SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression

Mohammad Mozaffari, Amir Yazdanbakhsh, Maryam Mehri Dehnavi

ICML 2025arXiv:2410.09615
6
citations
#7259

Residual Matrix Transformers: Scaling the Size of the Residual Stream

Brian Mak, Jeffrey Flanigan

ICML 2025arXiv:2506.22696
6
citations
#7260

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025arXiv:2411.09858
6
citations
#7261

Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models

Guosheng Zhang, Keyao Wang, Haixiao Yue et al.

AAAI 2025paperarXiv:2501.01720
6
citations
#7262

UV-Attack: Physical-World Adversarial Attacks on Person Detection via Dynamic-NeRF-based UV Mapping

Yanjie Li, Kaisheng Liang, Bin Xiao

ICLR 2025arXiv:2501.05783
6
citations
#7263

Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling

Xinyue Fang, Zhen Huang, Zhiliang Tian et al.

AAAI 2025paperarXiv:2409.11283
6
citations
#7264

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization

Jinlu Zhang, Jiji Tang, Rongsheng Zhang et al.

AAAI 2025paperarXiv:2412.07375
6
citations
#7265

Spherical Tree-Sliced Wasserstein Distance

Viet-Hoang Tran, Thanh Chu, Minh-Khoi Nguyen-Nhat et al.

ICLR 2025arXiv:2503.11249
6
citations
#7266

End-to-end Learning of Gaussian Mixture Priors for Diffusion Sampler

Denis Blessing, Xiaogang Jia, Gerhard Neumann

ICLR 2025arXiv:2503.00524
6
citations
#7267

Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning

Hung Le, Dung Nguyen, Kien Do et al.

ICLR 2025arXiv:2410.10132
6
citations
#7268

MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading

Wenhao Zhang, Jun Wang, Yong Luo et al.

AAAI 2025paperarXiv:2404.11979
6
citations
#7269

Beyond Spatial Domain: Cross-domain Promoted Fourier Convolution Helps Single Image Dehazing

Xiaozhe Zhang, Haidong Ding, Fengying Xie et al.

AAAI 2025paper
6
citations
#7270

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Yan Zhang, Gangyan Zeng, Huawen Shen et al.

AAAI 2025paperarXiv:2412.12502
6
citations
#7271

Category Prompt Mamba Network for Nuclei Segmentation and Classification

Ye Zhang, Zijie Fang, Yifeng Wang et al.

AAAI 2025paperarXiv:2503.10422
6
citations
#7272

Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization

Yechao Zhang, Yingzhe Xu, Junyu Shi et al.

AAAI 2025paperarXiv:2503.12793
6
citations
#7273

Supercharging Graph Transformers with Advective Diffusion

Qitian Wu, Chenxiao Yang, Kaipeng Zeng et al.

ICML 2025arXiv:2310.06417
6
citations
#7274

Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative Experts

Lan Li, Da-Wei Zhou, Han-Jia Ye et al.

ICML 2025arXiv:2507.07100
6
citations
#7275

Robust Conformal Outlier Detection under Contaminated Reference Data

Meshi Bashari, Matteo Sesia, Yaniv Romano

ICML 2025arXiv:2502.04807
6
citations
#7276

ZeroHAR: Sensor Context Augments Zero-Shot Wearable Action Recognition

Ranak Roy Chowdhury, Ritvik Kapila, Ameya Panse et al.

AAAI 2025paper
6
citations
#7277

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025arXiv:2410.08067
6
citations
#7278

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks

Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.

ICML 2025arXiv:2502.16075
6
citations
#7279

Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification

Hsun-Yu Kuo, Yin-Hsiang Liao, Yu-Chieh Chao et al.

ICLR 2025arXiv:2410.21526
6
citations
#7280

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Jan Pauls, Max Zimmer, Berkant Turan et al.

ICML 2025oralarXiv:2501.19328
6
citations
#7281

Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes

Kuiyuan Zhang, Zhongyun Hua, Rushi Lan et al.

AAAI 2025paperarXiv:2412.12619
6
citations
#7282

Locally Convex Global Loss Network for Decision-Focused Learning

Haeun Jeon, Hyunglip Bae, Minsu Park et al.

AAAI 2025paperarXiv:2403.01875
6
citations
#7283

Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

Tinglin Huang, Tianyu Liu, Mehrtash Babadi et al.

ICML 2025spotlightarXiv:2506.05361
6
citations
#7284

Differential Coding for Training-Free ANN-to-SNN Conversion

Zihan Huang, Wei Fang, Tong Bu et al.

ICML 2025arXiv:2503.00301
6
citations
#7285

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Weizhi Wang, Yu Tian, Linjie Yang et al.

COLM 2025paperarXiv:2504.00595
6
citations
#7286

IDInit: A Universal and Stable Initialization Method for Neural Network Training

Yu Pan, Chaozheng Wang, Zekai Wu et al.

ICLR 2025arXiv:2503.04626
6
citations
#7287

Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models

Rongchao Zhang, Yu Huang, Yiwei Lou et al.

AAAI 2025paper
6
citations
#7288

Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis

Hyunwoo Lee, Hayoung Choi, Hyunju Kim

ICLR 2025arXiv:2410.02242
6
citations
#7289

MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction

Yitao Zhu, Sheng Wang, Mengjie Xu et al.

AAAI 2025paperarXiv:2403.05055
6
citations
#7290

Revisiting the Predictability of Performative, Social Events

Juan Perdomo

ICML 2025arXiv:2503.11713
6
citations
#7291

Towards Trustworthy Federated Learning with Untrusted Participants

Youssef Allouah, Rachid Guerraoui, John Stephan

ICML 2025arXiv:2505.01874
6
citations
#7292

Constrained Belief Updates Explain Geometric Structures in Transformer Representations

Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.

ICML 2025arXiv:2502.01954
6
citations
#7293

Massively Parallel Continuous Local Search for Hybrid SAT Solving on GPUs

Yunuo Cen, Zhiwei Zhang, Xuanyao Fong

AAAI 2025paperarXiv:2308.15020
6
citations
#7294

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Yilun Kong, Guozheng Ma, Qi Zhao et al.

ICML 2025arXiv:2505.24378
6
citations
#7295

To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models

Anna Hedström, Salim I. Amoukou, Tom Bewley et al.

ICML 2025arXiv:2510.13290
6
citations
#7296

DRL: Decomposed Representation Learning for Tabular Anomaly Detection

Hangting Ye, He Zhao, Wei Fan et al.

ICLR 2025
6
citations
#7297

Inverse Bridge Matching Distillation

Nikita Gushchin, David Li, Daniil Selikhanovych et al.

ICML 2025arXiv:2502.01362
5
citations
#7298

Efficient Time Series Processing for Transformers and State-Space Models through Token Merging

Leon Götz, Marcel Kollovieh, Stephan Günnemann et al.

ICML 2025arXiv:2405.17951
5
citations
#7299

The Lock-in Hypothesis: Stagnation by Algorithm

Tianyi Qiu, Zhonghao He, Tejasveer Chugh et al.

ICML 2025arXiv:2506.06166
5
citations
#7300

Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning

Fengyu Gao, Ruida Zhou, Tianhao Wang et al.

ICLR 2025arXiv:2410.12085
5
citations
#7301

Scaling Laws for Floating–Point Quantization Training

Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.

ICML 2025
5
citations
#7302

PARQ: Piecewise-Affine Regularized Quantization

Lisa Jin, Jianhao Ma, Zechun Liu et al.

ICML 2025arXiv:2503.15748
5
citations
#7303

Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images

Jonathan Brokman, Amit Giloni, Omer Hofman et al.

ICLR 2025arXiv:2504.15470
5
citations
#7304

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias

Ruiquan Huang, Yingbin LIANG, Jing Yang

ICML 2025arXiv:2505.00926
5
citations
#7305

Scaling Analysis of Interleaved Speech-Text Language Models

Gallil Maimon, Michael Hassid, Amit Roth et al.

COLM 2025paperarXiv:2504.02398
5
citations
#7306

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.

ICML 2025arXiv:2503.11842
5
citations
#7307

QA-Calibration of Language Model Confidence Scores

Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.

ICLR 2025arXiv:2410.06615
5
citations
#7308

MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science

Erle Zhu, Yadi Liu, Zhe Zhang et al.

ICLR 2025arXiv:2501.10768
5
citations
#7309

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno, Hila Manor, Gregory Ongie et al.

ICML 2025arXiv:2506.19031
5
citations
#7310

Learning Soft Sparse Shapes for Efficient Time-Series Classification

Zhen Liu, Yicheng Luo, Boyuan Li et al.

ICML 2025oralarXiv:2505.06892
5
citations
#7311

Reflection-Window Decoding: Text Generation with Selective Refinement

Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.

ICML 2025arXiv:2502.03678
5
citations
#7312

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

Xingke Song, Xiaoying Yang, Chenglin Yao et al.

AAAI 2025paperarXiv:2504.09608
5
citations
#7313

Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Yuxin Wang, Maresa Schröder, Dennis Frauen et al.

ICLR 2025arXiv:2412.11511
5
citations
#7314

Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning

Yunyue Wei, Shanning Zhuang, Vincent Zhuang et al.

ICLR 2025arXiv:2505.08238
5
citations
#7315

PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation

Liyao Jiang, Negar Hassanpour, Mohammad Salameh et al.

AAAI 2025paperarXiv:2412.14283
5
citations
#7316

Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation

Wenxuan Bao, Zhichen Zeng, Zhining Liu et al.

ICLR 2025arXiv:2410.06976
5
citations
#7317

Learning Graph Invariance by Harnessing Spuriosity

Tianjun Yao, Yongqiang Chen, Kai Hu et al.

ICLR 2025
5
citations
#7318

RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting

Shuo Yang, Bardh Prenkaj, Gjergji Kasneci

AAAI 2025paperarXiv:2412.07675
5
citations
#7319

Offline Hierarchical Reinforcement Learning via Inverse Optimization

Carolin Schmidt, Daniele Gammelli, James Harrison et al.

ICLR 2025arXiv:2410.07933
5
citations
#7320

DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework

Yueru Jia, Aosong Cheng, Yuhui Yuan et al.

AAAI 2025paper
5
citations
#7321

ReFF: Reinforcing Format Faithfulness in Language Models Across Varied Tasks

Jiashu Yao, Heyan Huang, Zeming Liu et al.

AAAI 2025paperarXiv:2412.09173
5
citations
#7322

MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis

Jun-Yan He, Zhi-Qi Cheng, Chenyang Li et al.

ICLR 2025arXiv:2406.19859
5
citations
#7323

Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation

Federico Julian Camerota Verdù, Lorenzo Castelli, Luca Bortolussi

AAAI 2025paperarXiv:2412.10163
5
citations
#7324

Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks

Alexander Jaus, Constantin Marc Seibold, Simon Reiß et al.

AAAI 2025paperarXiv:2410.18684
5
citations
#7325

FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update

Ziru Niu, Hai Dong, A. K. Qin

AAAI 2025paperarXiv:2403.11464
5
citations
#7326

Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings

Di Wu, Siyuan Li, Chen Feng et al.

ICLR 2025arXiv:2410.12866
5
citations
#7327

Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain

Gaozheng Pei, Ke Ma, Yingfei Sun et al.

ICML 2025spotlightarXiv:2505.01267
5
citations
#7328

Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks

Angelica Chen, Samuel Stanton, Frances Ding et al.

ICML 2025arXiv:2410.22296
5
citations
#7329

Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model

Weilin Sun, Xinran Li, Manyi Li et al.

AAAI 2025paperarXiv:2502.10675
5
citations
#7330

Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding

Xiaolong Sun, Liushuai Shi, Le Wang et al.

AAAI 2025paperarXiv:2406.00143
5
citations
#7331

Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes

Dongjae Jeon, Dueun Kim, Albert No

ICML 2025spotlightarXiv:2412.04140
5
citations
#7332

Generative Medical Segmentation

Jiayu Huo, Xi Ouyang, Sébastien Ourselin et al.

AAAI 2025paperarXiv:2403.18198
5
citations
#7333

FeatSharp: Your Vision Model Features, Sharper

Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.

ICML 2025arXiv:2502.16025
5
citations
#7334

p-Mean Regret for Stochastic Bandits

Anand Krishna, Philips George John, Adarsh Barik et al.

AAAI 2025paperarXiv:2412.10751
5
citations
#7335

MTVHunter: Smart Contracts Vulnerability Detection Based on Multi-Teacher Knowledge Translation

Guokai Sun, Yuan Zhuang, Shuo Zhang et al.

AAAI 2025paperarXiv:2502.16955
5
citations
#7336

Wasserstein Distances, Neuronal Entanglement, and Sparsity

Shashata Sawmya, Linghao Kong, Ilia Markov et al.

ICLR 2025arXiv:2405.15756
5
citations
#7337

Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao et al.

AAAI 2025paperarXiv:2501.17326
5
citations
#7338

AdaSplash: Adaptive Sparse Flash Attention

Nuno Gonçalves, Marcos V. Treviso, Andre Martins

ICML 2025oralarXiv:2502.12082
5
citations
#7339

Dueling Convex Optimization with General Preferences

Aadirupa Saha, Tomer Koren, Yishay Mansour

ICML 2025arXiv:2210.02562
5
citations
#7340

PFDiff: Training-Free Acceleration of Diffusion Models Combining Past and Future Scores

Guangyi Wang, Yuren Cai, lijiang Li et al.

ICLR 2025arXiv:2408.08822
5
citations
#7341

A Training-free Synthetic Data Selection Method for Semantic Segmentation

Hao Tang, Siyue Yu, Jian Pang et al.

AAAI 2025paperarXiv:2501.15201
5
citations
#7342

Expressive Power of Temporal Message Passing

Przemysław Andrzej Wałęga, Michael Rawson

AAAI 2025paperarXiv:2408.09918
5
citations
#7343

On the Hölder Stability of Multiset and Graph Neural Networks

Yair Davidson, Nadav Dym

ICLR 2025arXiv:2406.06984
5
citations
#7344

Active Large Language Model-Based Knowledge Distillation for Session-Based Recommendation

Yingpeng Du, Zhu Sun, Ziyan Wang et al.

AAAI 2025paperarXiv:2502.15685
5
citations
#7345

Efficient Construction of Model Family through Progressive Training Using Model Expansion

Kazuki Yano, Sho Takase, Sosuke Kobayashi et al.

COLM 2025paperarXiv:2504.00623
5
citations
#7346

SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness

Sangamesh Kodge, Deepak Ravikumar, Gobinda Saha et al.

AAAI 2025paperarXiv:2403.08618
5
citations
#7347

Density Ratio Estimation with Conditional Probability Paths

Hanlin Yu, Arto Klami, Aapo Hyvarinen et al.

ICML 2025arXiv:2502.02300
5
citations
#7348

Direct Motion Models for Assessing Generated Videos

Kelsey Allen, Carl Doersch, Guangyao Zhou et al.

ICML 2025oralarXiv:2505.00209
5
citations
#7349

ScImage: How good are multimodal large language models at scientific text-to-image generation?

Leixin Zhang, Steffen Eger, Yinjie Cheng et al.

ICLR 2025arXiv:2412.02368
5
citations
#7350

Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation

Prashansa Panda, Shalabh Bhatnagar

AAAI 2025paperarXiv:2402.01371
5
citations
#7351

Specifying What You Know or Not for Multi-Label Class-Incremental Learning

Aoting Zhang, Dongbao Yang, Chang Liu et al.

AAAI 2025paperarXiv:2503.17017
5
citations
#7352

CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs

Jinpeng Li, Haiping Wang, Jiabin chen et al.

ICLR 2025
5
citations
#7353

VProChart: Answering Chart Question Through Visual Perception Alignment Agent and Programmatic Solution Reasoning

Muye Huang, Lingling Zhang, Han Lai et al.

AAAI 2025paperarXiv:2409.01667
5
citations
#7354

Contextualizing biological perturbation experiments through language

Menghua (Rachel) Wu, Russell Littman, Jacob Levine et al.

ICLR 2025arXiv:2502.21290
5
citations
#7355

The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products

YuQing Xie, Ameya Daigavane, Mit Kotak et al.

ICML 2025arXiv:2506.13523
5
citations
#7356

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering

Xuehang Guo, Xingyao Wang, Yangyi Chen et al.

ICML 2025arXiv:2502.06994
5
citations
#7357

(Im)possibility of Automated Hallucination Detection in Large Language Models

Amin Karbasi, Omar Montasser, John Sous et al.

COLM 2025paperarXiv:2504.17004
5
citations
#7358

On-the-fly Preference Alignment via Principle-Guided Decoding

Mingye Zhu, Yi Liu, Lei Zhang et al.

ICLR 2025arXiv:2502.14204
5
citations
#7359

MonoBox: Tightness-Free Box-Supervised Polyp Segmentation Using Monotonicity Constraint

Qiang Hu, Zhenyu Yi, Ying Zhou et al.

AAAI 2025paperarXiv:2404.01188
5
citations
#7360

Spatial Reasoning with Denoising Models

Christopher Wewer, Bartlomiej Pogodzinski, Bernt Schiele et al.

ICML 2025arXiv:2502.21075
5
citations
#7361

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Tianze Wang, Dongnan Gui, Yifan Hu et al.

ICML 2025arXiv:2502.18699
5
citations
#7362

Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters

WenZheng Zhang, Yang Hu, Jing Shi et al.

AAAI 2025paperarXiv:2408.12596
5
citations
#7363

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Zhongyang Li, Ziyue Li, Tianyi Zhou

COLM 2025paper
5
citations
#7364

Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization

Cameron Jakub, Mihai Nica

ICML 2025arXiv:2302.09712
5
citations
#7365

Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models

Yinhong Liu, Zhijiang Guo, Tianya Liang et al.

ICML 2025spotlightarXiv:2410.02205
5
citations
#7366

Generative Intervention Models for Causal Perturbation Modeling

Nora Schneider, Lars Lorch, Niki Kilbertus et al.

ICML 2025arXiv:2411.14003
5
citations
#7367

The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations

Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.

ICML 2025arXiv:2503.16398
5
citations
#7368

Continuous Visual Autoregressive Generation via Score Maximization

Chenze Shao, Fandong Meng, Jie Zhou

ICML 2025arXiv:2505.07812
5
citations
#7369

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Jiecheng Lu, Shihao Yang

ICML 2025arXiv:2502.07244
5
citations
#7370

Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation

Alessandro Palma, Sergei Rybakov, Leon Hetzel et al.

ICML 2025spotlightarXiv:2507.11789
5
citations
#7371

GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning

Zhun Mou, Bin Xia, Zhengchao Huang et al.

ICML 2025arXiv:2503.02341
5
citations
#7372

Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC

Tyler Clark, Mark Towers, Christine Evers et al.

ICML 2025arXiv:2411.03820
5
citations
#7373

MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models

Jing Zhao, Heliang Zheng, Chaoyue Wang et al.

AAAI 2025paperarXiv:2412.14902
5
citations
#7374

SysBench: Can LLMs Follow System Message?

Yanzhao Qin, Tao Zhang, Tao Zhang et al.

ICLR 2025
5
citations
#7375

Attribute-based Visual Reprogramming for Vision-Language Models

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICLR 2025arXiv:2501.13982
5
citations
#7376

Gating is Weighting: Understanding Gated Linear Attention through In-context Learning

Yingcong Li, Davoud Ataee Tarzanagh, Ankit Singh Rawat et al.

COLM 2025paper
5
citations
#7377

Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents

Karina Zainullina, Aleksandr Golubev, Maria Trofimova et al.

ICML 2025arXiv:2505.13652
5
citations
#7378

Score-based Pullback Riemannian Geometry: Extracting the Data Manifold Geometry using Anisotropic Flows

Willem Diepeveen, Georgios Batzolis, Zakhar Shumaylov et al.

ICML 2025arXiv:2410.01950
5
citations
#7379

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Quan Wei, Chung-Yiu Yau, Hoi To Wai et al.

ICML 2025arXiv:2502.09003
5
citations
#7380

Fast and Low-Cost Genomic Foundation Models via Outlier Removal

Haozheng Luo, Chenghao Qiu, Maojiang Su et al.

ICML 2025arXiv:2505.00598
5
citations
#7381

Position: Build Agent Advocates, Not Platform Agents

Sayash Kapoor, Noam Kolt, Seth Lazar

ICML 2025
5
citations
#7382

LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning

Gabriel Jacob Perin, Runjin Chen, Xuxi Chen et al.

COLM 2025paperarXiv:2506.15606
5
citations
#7383

Intra and Inter Parser-Prompted Transformers for Effective Image Restoration

Cong Wang, Jinshan Pan, Liyan Wang et al.

AAAI 2025paperarXiv:2503.14037
5
citations
#7384

Efficient ANN-SNN Conversion with Error Compensation Learning

chang liu, Jiangrong Shen, Xuming Ran et al.

ICML 2025arXiv:2506.01968
5
citations
#7385

On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation

Nghiem Diep, Huy Nguyen, Chau Nguyen et al.

ICML 2025arXiv:2502.03029
5
citations
#7386

BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning

Ahmed Masry, Abhay Puri, Masoud Hashemi et al.

COLM 2025paperarXiv:2508.09804
5
citations
#7387

Learning Causal Alignment for Reliable Disease Diagnosis

Mingzhou Liu, Ching-Wen Lee, Xinwei Sun et al.

ICLR 2025arXiv:2310.01766
5
citations
#7388

Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

Rafał Karczewski, Markus Heinonen, Vikas Garg

ICML 2025arXiv:2502.05807
5
citations
#7389

Refining Adaptive Zeroth-Order Optimization at Ease

Yao Shu, Qixin Zhang, Kun He et al.

ICML 2025arXiv:2502.01014
5
citations
#7390

BLS-GAN: A Deep Layer Separation Framework for Eliminating Bone Overlap in Conventional Radiographs

Haolin Wang, Yafei Ou, Prasoon Ambalathankandy et al.

AAAI 2025paperarXiv:2409.07304
5
citations
#7391

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

Tianxin Huang, Zhiwen Yan, Yuyang Zhao et al.

ICLR 2025arXiv:2404.06814
5
citations
#7392

MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance

Jialong Guo, Ke Liu, Jiangchao Yao et al.

AAAI 2025paperarXiv:2501.02427
5
citations
#7393

Scalable Quantum-Inspired Optimization Through Dynamic Qubit Compression

Co Tran, Quoc-Bao Tran, Hy Truong Son et al.

AAAI 2025paperarXiv:2412.18571
5
citations
#7394

Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity

Zhufeng Li, Sandeep Suresh Cranganore, Nicholas Youngblut et al.

AAAI 2025paperarXiv:2405.05998
5
citations
#7395

Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes

Jongmin Lee, Ernest Ryu

ICLR 2025arXiv:2504.09913
5
citations
#7396

FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training

Filipp Zmushko, Aleksandr Beznosikov, Martin Takac et al.

ICML 2025arXiv:2411.07837
5
citations
#7397

Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function

Anna Grim, Jayaram Chandrashekar, Uygar Sümbül

AAAI 2025paperarXiv:2501.01022
5
citations
#7398

Natural Language Inference Improves Compositionality in Vision-Language Models

Paola Cascante-Bonilla, Yu (Hope) Hou, Yang Cao et al.

ICLR 2025arXiv:2410.22315
5
citations
#7399

A Simple Approach to Unifying Diffusion-based Conditional Generation

Xirui Li, Charles Herrmann, Kelvin Chan et al.

ICLR 2025arXiv:2410.11439
5
citations
#7400

Robust Multimodal Large Language Models Against Modality Conflict

Zongmeng Zhang, Wengang Zhou, Jie Zhao et al.

ICML 2025arXiv:2507.07151
5
citations