Most Cited 2025 "tokenization" Papers
22,274 papers found • Page 37 of 112
Conference
ELICIT: LLM Augmentation Via External In-context Capability
Futing Wang, Jianhao (Elliott) Yan, Yue Zhang et al.
ALLVB: All-in-One Long Video Understanding Benchmark
Xichen Tan, Yuanjing Luo, Yunfan Ye et al.
CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness
Shoucheng Song, Youfang Lin, Sheng Han et al.
Certification of Speaker Recognition Models to Additive Perturbations
Dmitrii Korzh, Elvir Karimov, Mikhail Pautov et al.
Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization
Simone Bombari, Marco Mondelli
Simulate and Eliminate: Revoke Backdoors for Generative Large Language Models
Haoran Li, Yulin Chen, Zihao Zheng et al.
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow
Jiaqi Bai, Hongcheng Guo, Zhongyuan Peng et al.
Understanding Synthetic Context Extension via Retrieval Heads
Xinyu Zhao, Fangcong Yin, Greg Durrett
Models of Heavy-Tailed Mechanistic Universality
Liam Hodgkinson, Zhichao Wang, Michael Mahoney
Tuning LLM Judge Design Decisions for 1/1000 of the Cost
David Salinas, Omar Swelam, Frank Hutter
Selective Response Strategies for GenAI
Boaz Taitler, Omer Ben-Porat
Severing Spurious Correlations with Data Pruning
Varun Mulchandani, Jung-Eun Kim
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization
Juntao Dai, Taiye Chen, Yaodong Yang et al.
When Maximum Entropy Misleads Policy Optimization
Ruipeng Zhang, Ya-Chien Chang, Sicun Gao
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
Christopher Ackerman, Nina Panickssery
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
Hongsong Wang, Andi Xu, Pinle Ding et al.
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Chen Zhang, Dading Chong, Feng Jiang et al.
Features are fate: a theory of transfer learning in high-dimensional regression
Javan Tahir, Surya Ganguli, Grant Rotskoff
ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport Plans
Ashkan Shahbazi, Elaheh Akbari, Darian Salehi et al.
Scaling Probabilistic Circuits via Monarch Matrices
Honghua Zhang, Meihua Dang, Benjie Wang et al.
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models
Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov et al.
Elucidating the design space of language models for image generation
Xuantong Liu, Shaozhe Hao, Xianbiao Qi et al.
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Xinyi Wang, Na Zhao, Zhiyuan Han et al.
Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process
Jing Yang
Revisiting a Design Choice in Gradient Temporal Difference Learning
Xiaochi Qian, Shangtong Zhang
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
Yifu Yuan, Zhenrui Zheng, Zibin Dong et al.
Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models
Fusheng Liu, Qianxiao Li
LLM-RG4: Flexible and Factual Radiology Report Generation Across Diverse Input Contexts
Zhuhao Wang, Yihua Sun, Zihan Li et al.
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Mohit Pandey, Gopeshh Subbaraj, Artem Cherkasov et al.
PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening
RuoCheng Wu, Zien Zhang, Shangqi Deng et al.
A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models
Mengyang Sun, Yihao Wang, Tao Feng et al.
Can LLMs Handle WebShell Detection? Overcoming Detection Challenges with Behavioral Function-Aware Framework
Feijiang Han, Jiaming Zhang, Chuyi Deng et al.
FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning
Jialuo He, Wei Chen, Xiaojin Zhang
Predicting mutational effects on protein binding from folding energy
Arthur Deng, Karsten Householder, Fang Wu et al.
Activation Space Interventions Can Be Transferred Between Large Language Models
Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.
PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection
Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.
Offline Safe Reinforcement Learning Using Trajectory Classification
Ze Gong, Akshat Kumar, Pradeep Varakantham
Learning Physics Informed Neural ODEs with Partial Measurements
Paul Ghanem, Ahmet Demirkaya, Tales Imbiriba et al.
Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games
David Guzman Piedrahita, Yongjin Yang, Mrinmaya Sachan et al.
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret
Lukas Fluri, Leon Lang, Alessandro Abate et al.
No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization
Martino Bernasconi, Matteo Castiglioni, Andrea Celli
SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
Ke Yan, Qing Cai, Fan Zhang et al.
Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck
Xingcheng Fu, Yisen Gao, Beining Yang et al.
6D Object Pose Tracking in Internet Videos for Robotic Manipulation
Georgy Ponimatkin, Martin Cífka, Tomas Soucek et al.
SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints
Ziqi Sheng, Wei Lu, Xiangyang Luo et al.
PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization
Mingjing Xu, Peizhong Ju, Jia Liu et al.
Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution
Jiarui Yang, Tao Dai, Yufei Zhu et al.
DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback
Xuening Feng, Zhaohui Jiang, Timo Kaufmann et al.
Self-Normalized Resets for Plasticity in Continual Learning
Vivek Farias, Adam Jozefiak
Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing
Zijie Qiu, Jiaqi Wei, Xiang Zhang et al.
Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning
Yiming Yang, Yueru Luo, Bingkun He et al.
Counterfactual Concept Bottleneck Models
Gabriele Dominici, Pietro Barbiero, Francesco Giannini et al.
SADA: Stability-guided Adaptive Diffusion Acceleration
Ting Jiang, Yixiao Wang, Hancheng Ye et al.
On the Optimal Memorization Capacity of Transformers
Tokio Kajitsuka, Issei Sato
AdaFisher: Adaptive Second Order Optimization via Fisher Information
Damien GOMES, Yanlei Zhang, Eugene Belilovsky et al.
Deep Incomplete Multi-view Learning via Cyclic Permutation of VAEs
Xin Gao, Jian Pu
T-JEPA: Augmentation-Free Self-Supervised Learning for Tabular Data
Hugo Thimonier, José Lucas De Melo Costa, Fabrice Popineau et al.
SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression
Mohammad Mozaffari, Amir Yazdanbakhsh, Maryam Mehri Dehnavi
Residual Matrix Transformers: Scaling the Size of the Residual Stream
Brian Mak, Jeffrey Flanigan
One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation
Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.
Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models
Guosheng Zhang, Keyao Wang, Haixiao Yue et al.
UV-Attack: Physical-World Adversarial Attacks on Person Detection via Dynamic-NeRF-based UV Mapping
Yanjie Li, Kaisheng Liang, Bin Xiao
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling
Xinyue Fang, Zhen Huang, Zhiliang Tian et al.
StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
Jinlu Zhang, Jiji Tang, Rongsheng Zhang et al.
Spherical Tree-Sliced Wasserstein Distance
Viet-Hoang Tran, Thanh Chu, Minh-Khoi Nguyen-Nhat et al.
End-to-end Learning of Gaussian Mixture Priors for Diffusion Sampler
Denis Blessing, Xiaogang Jia, Gerhard Neumann
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Hung Le, Dung Nguyen, Kien Do et al.
MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading
Wenhao Zhang, Jun Wang, Yong Luo et al.
Beyond Spatial Domain: Cross-domain Promoted Fourier Convolution Helps Single Image Dehazing
Xiaozhe Zhang, Haidong Ding, Fengying Xie et al.
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Yan Zhang, Gangyan Zeng, Huawen Shen et al.
Category Prompt Mamba Network for Nuclei Segmentation and Classification
Ye Zhang, Zijie Fang, Yifeng Wang et al.
Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization
Yechao Zhang, Yingzhe Xu, Junyu Shi et al.
Supercharging Graph Transformers with Advective Diffusion
Qitian Wu, Chenxiao Yang, Kaipeng Zeng et al.
Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative Experts
Lan Li, Da-Wei Zhou, Han-Jia Ye et al.
Robust Conformal Outlier Detection under Contaminated Reference Data
Meshi Bashari, Matteo Sesia, Yaniv Romano
ZeroHAR: Sensor Context Augments Zero-Shot Wearable Action Recognition
Ranak Roy Chowdhury, Ritvik Kapila, Ameya Panse et al.
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang, Zhihan Liu, Boyi Liu et al.
Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks
Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.
Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification
Hsun-Yu Kuo, Yin-Hsiang Liao, Yu-Chieh Chao et al.
Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation
Jan Pauls, Max Zimmer, Berkant Turan et al.
Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes
Kuiyuan Zhang, Zhongyun Hua, Rushi Lan et al.
Locally Convex Global Loss Network for Decision-Focused Learning
Haeun Jeon, Hyunglip Bae, Minsu Park et al.
Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching
Tinglin Huang, Tianyu Liu, Mehrtash Babadi et al.
Differential Coding for Training-Free ANN-to-SNN Conversion
Zihan Huang, Wei Fang, Tong Bu et al.
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
Weizhi Wang, Yu Tian, Linjie Yang et al.
IDInit: A Universal and Stable Initialization Method for Neural Network Training
Yu Pan, Chaozheng Wang, Zekai Wu et al.
Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models
Rongchao Zhang, Yu Huang, Yiwei Lou et al.
Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis
Hyunwoo Lee, Hayoung Choi, Hyunju Kim
MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction
Yitao Zhu, Sheng Wang, Mengjie Xu et al.
Revisiting the Predictability of Performative, Social Events
Juan Perdomo
Towards Trustworthy Federated Learning with Untrusted Participants
Youssef Allouah, Rachid Guerraoui, John Stephan
Constrained Belief Updates Explain Geometric Structures in Transformer Representations
Mateusz Piotrowski, Paul Riechers, Daniel Filan et al.
Massively Parallel Continuous Local Search for Hybrid SAT Solving on GPUs
Yunuo Cen, Zhiwei Zhang, Xuanyao Fong
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Yilun Kong, Guozheng Ma, Qi Zhao et al.
To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models
Anna Hedström, Salim I. Amoukou, Tom Bewley et al.
DRL: Decomposed Representation Learning for Tabular Anomaly Detection
Hangting Ye, He Zhao, Wei Fan et al.
Inverse Bridge Matching Distillation
Nikita Gushchin, David Li, Daniil Selikhanovych et al.
Efficient Time Series Processing for Transformers and State-Space Models through Token Merging
Leon Götz, Marcel Kollovieh, Stephan Günnemann et al.
The Lock-in Hypothesis: Stagnation by Algorithm
Tianyi Qiu, Zhonghao He, Tejasveer Chugh et al.
Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning
Fengyu Gao, Ruida Zhou, Tianhao Wang et al.
Scaling Laws for Floating–Point Quantization Training
Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin, Jianhao Ma, Zechun Liu et al.
Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images
Jonathan Brokman, Amit Giloni, Omer Hofman et al.
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
Ruiquan Huang, Yingbin LIANG, Jing Yang
Scaling Analysis of Interleaved Speech-Text Language Models
Gallil Maimon, Michael Hassid, Amit Roth et al.
Test-Time Training Provably Improves Transformers as In-context Learners
Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.
QA-Calibration of Language Model Confidence Scores
Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.
MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Erle Zhu, Yadi Liu, Zhe Zhang et al.
When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets
Chen Zeno, Hila Manor, Gregory Ongie et al.
Learning Soft Sparse Shapes for Efficient Time-Series Classification
Zhen Liu, Yicheng Luo, Boyuan Li et al.
Reflection-Window Decoding: Text Generation with Selective Refinement
Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.
ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps
Xingke Song, Xiaoying Yang, Chenglin Yao et al.
Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets
Yuxin Wang, Maresa Schröder, Dennis Frauen et al.
Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning
Yunyue Wei, Shanning Zhuang, Vincent Zhuang et al.
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
Liyao Jiang, Negar Hassanpour, Mohammad Salameh et al.
Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation
Wenxuan Bao, Zhichen Zeng, Zhining Liu et al.
Learning Graph Invariance by Harnessing Spuriosity
Tianjun Yao, Yongqiang Chen, Kai Hu et al.
RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
Shuo Yang, Bardh Prenkaj, Gjergji Kasneci
Offline Hierarchical Reinforcement Learning via Inverse Optimization
Carolin Schmidt, Daniele Gammelli, James Harrison et al.
DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
Yueru Jia, Aosong Cheng, Yuhui Yuan et al.
ReFF: Reinforcing Format Faithfulness in Language Models Across Varied Tasks
Jiashu Yao, Heyan Huang, Zeming Liu et al.
MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis
Jun-Yan He, Zhi-Qi Cheng, Chenyang Li et al.
Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation
Federico Julian Camerota Verdù, Lorenzo Castelli, Luca Bortolussi
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks
Alexander Jaus, Constantin Marc Seibold, Simon Reiß et al.
FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update
Ziru Niu, Hai Dong, A. K. Qin
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
Di Wu, Siyuan Li, Chen Feng et al.
Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain
Gaozheng Pei, Ke Ma, Yingfei Sun et al.
Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks
Angelica Chen, Samuel Stanton, Frances Ding et al.
Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model
Weilin Sun, Xinran Li, Manyi Li et al.
Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding
Xiaolong Sun, Liushuai Shi, Le Wang et al.
Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes
Dongjae Jeon, Dueun Kim, Albert No
Generative Medical Segmentation
Jiayu Huo, Xi Ouyang, Sébastien Ourselin et al.
FeatSharp: Your Vision Model Features, Sharper
Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.
p-Mean Regret for Stochastic Bandits
Anand Krishna, Philips George John, Adarsh Barik et al.
MTVHunter: Smart Contracts Vulnerability Detection Based on Multi-Teacher Knowledge Translation
Guokai Sun, Yuan Zhuang, Shuo Zhang et al.
Wasserstein Distances, Neuronal Entanglement, and Sparsity
Shashata Sawmya, Linghao Kong, Ilia Markov et al.
Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction
Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao et al.
AdaSplash: Adaptive Sparse Flash Attention
Nuno Gonçalves, Marcos V. Treviso, Andre Martins
Dueling Convex Optimization with General Preferences
Aadirupa Saha, Tomer Koren, Yishay Mansour
PFDiff: Training-Free Acceleration of Diffusion Models Combining Past and Future Scores
Guangyi Wang, Yuren Cai, lijiang Li et al.
A Training-free Synthetic Data Selection Method for Semantic Segmentation
Hao Tang, Siyue Yu, Jian Pang et al.
Expressive Power of Temporal Message Passing
Przemysław Andrzej Wałęga, Michael Rawson
On the Hölder Stability of Multiset and Graph Neural Networks
Yair Davidson, Nadav Dym
Active Large Language Model-Based Knowledge Distillation for Session-Based Recommendation
Yingpeng Du, Zhu Sun, Ziyan Wang et al.
Efficient Construction of Model Family through Progressive Training Using Model Expansion
Kazuki Yano, Sho Takase, Sosuke Kobayashi et al.
SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness
Sangamesh Kodge, Deepak Ravikumar, Gobinda Saha et al.
Density Ratio Estimation with Conditional Probability Paths
Hanlin Yu, Arto Klami, Aapo Hyvarinen et al.
Direct Motion Models for Assessing Generated Videos
Kelsey Allen, Carl Doersch, Guangyao Zhou et al.
ScImage: How good are multimodal large language models at scientific text-to-image generation?
Leixin Zhang, Steffen Eger, Yinjie Cheng et al.
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda, Shalabh Bhatnagar
Specifying What You Know or Not for Multi-Label Class-Incremental Learning
Aoting Zhang, Dongbao Yang, Chang Liu et al.
CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs
Jinpeng Li, Haiping Wang, Jiabin chen et al.
VProChart: Answering Chart Question Through Visual Perception Alignment Agent and Programmatic Solution Reasoning
Muye Huang, Lingling Zhang, Han Lai et al.
Contextualizing biological perturbation experiments through language
Menghua (Rachel) Wu, Russell Littman, Jacob Levine et al.
The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products
YuQing Xie, Ameya Daigavane, Mit Kotak et al.
SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Xuehang Guo, Xingyao Wang, Yangyi Chen et al.
(Im)possibility of Automated Hallucination Detection in Large Language Models
Amin Karbasi, Omar Montasser, John Sous et al.
On-the-fly Preference Alignment via Principle-Guided Decoding
Mingye Zhu, Yi Liu, Lei Zhang et al.
MonoBox: Tightness-Free Box-Supervised Polyp Segmentation Using Monotonicity Constraint
Qiang Hu, Zhenyu Yi, Ying Zhou et al.
Spatial Reasoning with Denoising Models
Christopher Wewer, Bartlomiej Pogodzinski, Bernt Schiele et al.
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
Tianze Wang, Dongnan Gui, Yifan Hu et al.
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
WenZheng Zhang, Yang Hu, Jing Shi et al.
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing
Zhongyang Li, Ziyue Li, Tianyi Zhou
Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization
Cameron Jakub, Mihai Nica
Aligning with Logic: Measuring, Evaluating and Improving Logical Preference Consistency in Large Language Models
Yinhong Liu, Zhijiang Guo, Tianya Liang et al.
Generative Intervention Models for Causal Perturbation Modeling
Nora Schneider, Lars Lorch, Niki Kilbertus et al.
The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations
Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.
Continuous Visual Autoregressive Generation via Score Maximization
Chenze Shao, Fandong Meng, Jie Zhou
Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting
Jiecheng Lu, Shihao Yang
Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation
Alessandro Palma, Sergei Rybakov, Leon Hetzel et al.
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning
Zhun Mou, Bin Xia, Zhengchao Huang et al.
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark, Mark Towers, Christine Evers et al.
MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models
Jing Zhao, Heliang Zheng, Chaoyue Wang et al.
SysBench: Can LLMs Follow System Message?
Yanzhao Qin, Tao Zhang, Tao Zhang et al.
Attribute-based Visual Reprogramming for Vision-Language Models
Chengyi Cai, Zesheng Ye, Lei Feng et al.
Gating is Weighting: Understanding Gated Linear Attention through In-context Learning
Yingcong Li, Davoud Ataee Tarzanagh, Ankit Singh Rawat et al.
Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents
Karina Zainullina, Aleksandr Golubev, Maria Trofimova et al.
Score-based Pullback Riemannian Geometry: Extracting the Data Manifold Geometry using Anisotropic Flows
Willem Diepeveen, Georgios Batzolis, Zakhar Shumaylov et al.
RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models
Quan Wei, Chung-Yiu Yau, Hoi To Wai et al.
Fast and Low-Cost Genomic Foundation Models via Outlier Removal
Haozheng Luo, Chenghao Qiu, Maojiang Su et al.
Position: Build Agent Advocates, Not Platform Agents
Sayash Kapoor, Noam Kolt, Seth Lazar
LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning
Gabriel Jacob Perin, Runjin Chen, Xuxi Chen et al.
Intra and Inter Parser-Prompted Transformers for Effective Image Restoration
Cong Wang, Jinshan Pan, Liyan Wang et al.
Efficient ANN-SNN Conversion with Error Compensation Learning
chang liu, Jiangrong Shen, Xuming Ran et al.
On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation
Nghiem Diep, Huy Nguyen, Chau Nguyen et al.
BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning
Ahmed Masry, Abhay Puri, Masoud Hashemi et al.
Learning Causal Alignment for Reliable Disease Diagnosis
Mingzhou Liu, Ching-Wen Lee, Xinwei Sun et al.
Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models
Rafał Karczewski, Markus Heinonen, Vikas Garg
Refining Adaptive Zeroth-Order Optimization at Ease
Yao Shu, Qixin Zhang, Kun He et al.
BLS-GAN: A Deep Layer Separation Framework for Eliminating Bone Overlap in Conventional Radiographs
Haolin Wang, Yafei Ou, Prasoon Ambalathankandy et al.
ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors
Tianxin Huang, Zhiwen Yan, Yuyang Zhao et al.
MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance
Jialong Guo, Ke Liu, Jiangchao Yao et al.
Scalable Quantum-Inspired Optimization Through Dynamic Qubit Compression
Co Tran, Quoc-Bao Tran, Hy Truong Son et al.
Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity
Zhufeng Li, Sandeep Suresh Cranganore, Nicholas Youngblut et al.
Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes
Jongmin Lee, Ernest Ryu
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training
Filipp Zmushko, Aleksandr Beznosikov, Martin Takac et al.
Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function
Anna Grim, Jayaram Chandrashekar, Uygar Sümbül
Natural Language Inference Improves Compositionality in Vision-Language Models
Paola Cascante-Bonilla, Yu (Hope) Hou, Yang Cao et al.
A Simple Approach to Unifying Diffusion-based Conditional Generation
Xirui Li, Charles Herrmann, Kelvin Chan et al.
Robust Multimodal Large Language Models Against Modality Conflict
Zongmeng Zhang, Wengang Zhou, Jie Zhao et al.