Most Cited 2025 "pseudo-label inference" Papers
22,274 papers found • Page 37 of 112
Conference
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Yan Zhang, Gangyan Zeng, Huawen Shen et al.
When Bad Data Leads to Good Models
Kenneth Li, Yida Chen, Fernanda Viégas et al.
Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss
Sangyeon Park, Isaac Han, Seungwon Oh et al.
Bayesian Experimental Design Via Contrastive Diffusions
Jacopo Iollo, Christophe Heinkelé, Pierre Alliez et al.
Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance
Lisha Chen, Quan Xiao, Ellen Fukuda et al.
Conformal Language Model Reasoning with Coherent Factuality
Maxon Rubin-Toles, Maya Gambhir, Keshav Ramji et al.
Lawma: The Power of Specialization for Legal Annotation
Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.
Are Expressive Models Truly Necessary for Offline RL?
Guan Wang, Haoyi Niu, Jianxiong Li et al.
Ask, and it shall be given: On the Turing completeness of prompting
Ruizhong Qiu, Zhe Xu, Wenxuan Bao et al.
Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model
Zhiwei Xu, Zhiyu Ni, Yixin Wang et al.
New Perspectives on the Polyak Stepsize: Surrogate Functions and Negative Results
Francesco Orabona, Ryan D'Orazio
Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization
Cheng Tang, Zhishuai Liu, Pan Xu
Tight Clusters Make Specialized Experts
Stefan Nielsen, Rachel Teo, Laziz Abdullaev et al.
Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Lucy Farnik, Tim Lawson, Conor Houghton et al.
SigDiffusions: Score-Based Diffusion Models for Time Series via Log-Signature Embeddings
Barbora Barancikova, Zhuoyue Huang, Cristopher Salvi
Understanding Synthetic Context Extension via Retrieval Heads
Xinyu Zhao, Fangcong Yin, Greg Durrett
BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference
Wonsuk Jang, Thierry Tambe
Dynamic Graph Learning with Static Relations for Credit Risk Assessment
Qi Yuan, Yang Liu, Yateng Tang et al.
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
Enshu Liu, Junyi Zhu, Zinan Lin et al.
Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems
Zhuohui Zhang, Bin He, Bin Cheng et al.
How to Train Your LLM Web Agent: A Statistical Diagnosis
Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza et al.
FedTMOS: Efficient One-Shot Federated Learning with Tsetlin Machine
Shannon How, Jagmohan Chauhan, Geoff Merrett et al.
EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations
Haotian Zhai, Connor Lawless, Ellen Vitercik et al.
CAMEx: Curvature-aware Merging of Experts
Dung Viet Nguyen, Minh Nguyen, Luc Nguyen et al.
Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment
Yang Liu, Mengyuan Liu, Shudong Huang et al.
Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation
D. Sculley, William Cukierski, Phil Culliton et al.
Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach
Chunxu Zhang, Guodong Long, Hongkuan Guo et al.
WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration
Laibin Chang, Yunke Wang, Longxiang Deng et al.
Aligning Multimodal Representations through an Information Bottleneck
Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.
Forte : Finding Outliers with Representation Typicality Estimation
Debargha Ganguly, Warren Morningstar, Andrew Yu et al.
Bootstrapped Model Predictive Control
Yuhang Wang, Hanwei Guo, Sizhe Wang et al.
Text2Relight: Creative Portrait Relighting with Text Guidance
Junuk Cha, Mengwei Ren, Krishna Kumar Singh et al.
SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
Ke Yan, Qing Cai, Fan Zhang et al.
Making Large Vision Language Models to Be Good Few-Shot Learners
Fan Liu, Wenwen Cai, Jian Huo et al.
PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection
Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.
Comparing noisy neural population dynamics using optimal transport distances
Amin Nejatbakhsh, Victor Geadah, Alex Williams et al.
Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents
Han Lin, Jaemin Cho, Amir Zadeh et al.
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling
Xinyue Fang, Zhen Huang, Zhiliang Tian et al.
Linear combinations of latents in generative models: subspaces and beyond
Erik Bodin, Alexandru Stere, Dragos Margineantu et al.
Causal Discovery from Conditionally Stationary Time Series
Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.
Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach
Zechen Bai, Tianjun Xiao, Tong He et al.
Bridging Molecular Graphs and Large Language Models
Runze Wang, Mingqi Yang, Yanming Shen
EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization
Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.
Is Large-scale Pretraining the Secret to Good Domain Generalization?
Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.
Automatically Generating Numerous Context-Driven SFT Data for LLMs Across Diverse Granularity
Shanghaoran Quan
SADA: Stability-guided Adaptive Diffusion Acceleration
Ting Jiang, Yixiao Wang, Hancheng Ye et al.
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning
Amin Karimi Monsefi, Mengxi Zhou, Nastaran Monsefi et al.
General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization
Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky
Population Aware Diffusion for Time Series Generation
Yang Li, Han Meng, Zhenyu Bi et al.
Biologically Plausible Brain Graph Transformer
Ciyuan Peng, Yuelong Huang, Qichao Dong et al.
Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning
Xiaolei Wang, Xinyu Tang, Junyi Li et al.
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity
Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.
ExcluIR: Exclusionary Neural Information Retrieval
Wenhao Zhang, Mengqi Zhang, Shiguang Wu et al.
SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification
Zhenglin Lai, Mengyao Liao, Bingzhe Wu et al.
Precedence-Constrained Winter Value for Effective Graph Data Valuation
Hongliang Chi, Wei Jin, Charu Aggarwal et al.
Robust Conformal Outlier Detection under Contaminated Reference Data
Meshi Bashari, Matteo Sesia, Yaniv Romano
Decoupling Layout from Glyph in Online Chinese Handwriting Generation
Minsi Ren, Yan-Ming Zhang, yi chen
EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting
Bohao Liao, Wei Zhai, Zengyu Wan et al.
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu, Hangui Lin, Yexin Liu et al.
Infer Human’s Intentions Before Following Natural Language Instructions
Yanming Wan, Yue Wu, Yiping Wang et al.
Understanding and Mitigating Memorization in Diffusion Models for Tabular Data
Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.
A Robust Prototype-Based Network with Interpretable RBF Classifier Foundations
Sascha Saralajew, Ashish Rana, Thomas Villmann et al.
One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation
Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang, Zhihan Liu, Boyi Liu et al.
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Xinyi Wang, Na Zhao, Zhiyuan Han et al.
MLC-NC: Long-Tailed Multi-Label Image Classification Through the Lens of Neural Collapse
Zijian Tao, Shao-Yuan Li, Wenhai Wan et al.
Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models
Fusheng Liu, Qianxiao Li
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
Minkyoung Cho, Yulong Cao, Jiachen Sun et al.
Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks
Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning
Yiming Yang, Yueru Luo, Bingkun He et al.
DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback
Xuening Feng, Zhaohui Jiang, Timo Kaufmann et al.
MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula
Sieun Hyeon, Kyudan Jung, Jaehee Won et al.
Activation Space Interventions Can Be Transferred Between Large Language Models
Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.
Stiefel Flow Matching for Moment-Constrained Structure Elucidation
Austin H Cheng, Alston Lo, Kin Long Kelvin Lee et al.
DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts
Zheng-Peng Duan, Jiawei Zhang, Zheng Lin et al.
Neuron-based Multifractal Analysis of Neuron Interaction Dynamics in Large Models
Xiongye Xiao, Heng Ping, Chenyu Zhou et al.
Revisiting a Design Choice in Gradient Temporal Difference Learning
Xiaochi Qian, Shangtong Zhang
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matt Riemer, Gopeshh Raaj Subbaraj, Glen Berseth et al.
Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation
Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.
DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype
Qiang Wang, Yuhang He, Songlin Dong et al.
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Chuhan ZHANG, Chaoyang Zhu, Pingcheng Dong et al.
Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation
Jan Pauls, Max Zimmer, Berkant Turan et al.
Collapse-Proof Non-Contrastive Self-Supervised Learning
EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars
Q-MAML: Quantum Model-Agnostic Meta-Learning for Variational Quantum Algorithms
Junyong Lee, Jeihee Cho, Shiho Kim
Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion
Xingpei Ma, Jiaran Cai, Yuansheng Guan et al.
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation
Jiashuo Sun, Xianrui Zhong, Sizhe Zhou et al.
B-score: Detecting biases in large language models using response history
An Vo, Mohammad Reza Taesiri, Daeyoung Kim et al.
Predicting mutational effects on protein binding from folding energy
Arthur Deng, Karsten Householder, Fang Wu et al.
Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models
Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
Yanbo Ding, Shaobin Zhuang, Kunchang Li et al.
PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose Manipulation
Dong Feng, Ping Guo, Encheng Peng et al.
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su, Man Luo, Kris Pan et al.
SPD: Sync-Point Drop for Efficient Tensor Parallelism of Large Language Models
Han-Byul Kim, Duc Hoang, Arnav Kundu et al.
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Guangyuan Ma, Yongliang Ma, Xing Wu et al.
MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment
Tianze Wang, Dongnan Gui, Yifan Hu et al.
Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models
Chengyu Du, Jinyi Han, Yizhou Ying et al.
Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation
Rong Tang, Lizhen Lin, Yun Yang
Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views
Yulun Wu, Han Huang, Wenyuan Zhang et al.
Faithful and Accurate Self-Attention Attribution for Message Passing Neural Networks via the Computation Tree Viewpoint
Yong-Min Shin, Siqing Li, Xin Cao et al.
Handling Delay in Real-Time Reinforcement Learning
Ivan Anokhin, Rishav Rishav, Matt Riemer et al.
Scaling Sparse Feature Circuits For Studying In-Context Learning
Dmitrii Kharlapenko, Stepan Shabalin, Arthur Conmy et al.
SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Xuehang Guo, Xingyao Wang, Yangyi Chen et al.
Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes
Jongmin Lee, Ernest Ryu
Zeroth-Order Optimization Finds Flat Minima
Liang Zhang, Bingcong Li, Kiran Thekumparampil et al.
A Generic Framework for Conformal Fairness
Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.
Generative Intervention Models for Causal Perturbation Modeling
Nora Schneider, Lars Lorch, Niki Kilbertus et al.
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework
Feiran Li, Qianqian Xu, Shilong Bao et al.
Neural Entropy
Akhil Premkumar
Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization
Kyurae Kim, Zuheng Xu, Jacob Gardner et al.
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
Xinglin Wang, Yiwei Li, Shaoxiong Feng et al.
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.
Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models
Shizhan Gong, Yankai Jiang, DOU QI et al.
Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy
Haoqi Wu, Wei Dai, Wang Li et al.
Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models
Linh Tran, Wei Sun, Stacy Patterson et al.
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
Po-han Li, Sandeep Chinchali, ufuk topcu
ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps
Xingke Song, Xiaoying Yang, Chenglin Yao et al.
On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains
Xun Xian, Ganghua Wang, Xuan Bi et al.
Focus On This, Not That! Steering LLMs with Adaptive Feature Specification
Tom A. Lamb, Adam Davies, Alasdair J Paren et al.
Spatial Reasoning with Denoising Models
Christopher Wewer, Bartlomiej Pogodzinski, Bernt Schiele et al.
FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields
Gwanhyeong Koo, Sunjae Yoon, Younghwan Lee et al.
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models
Fan Wang, Juyong Jiang, Chansung Park et al.
Feature Clipping for Uncertainty Calibration
Linwei Tao, Minjing Dong, Chang Xu
Compressed and distributed least-squares regression: convergence rates with applications to federated learning
Constantin Philippenko, Aymeric Dieuleveut
Self-Consuming Generative Models with Adversarially Curated Data
Xiukun Wei, Xueru Zhang
When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets
Chen Zeno, Hila Manor, Gregory Ongie et al.
Adaptive Retention & Correction: Test-Time Training for Continual Learning
Haoran Chen, Micah Goldblum, Zuxuan Wu et al.
Structured IB: Improving Information Bottleneck with Structured Feature Learning
Hanzhe Yang, Youlong Wu, Dingzhu Wen et al.
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
Zihang Liu, Tianyu Pang, Oleg Balabanov et al.
Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data
Guan Zhong, Likang Wu, Hongke Zhao et al.
HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting
Fengyu Yan, Xiaobao Wang, Dongxiao He et al.
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series
Byoungwoo Park, Hyungi Lee, Juho Lee
Conditional Diffusion Models Based Conditional Independence Testing
Yanfeng Yang, Shuai Li, Yingjie Zhang et al.
Reflection-Window Decoding: Text Generation with Selective Refinement
Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.
Multimodal Variational Autoencoder: A Barycentric View
Peijie Qiu, Wenhui Zhu, Sayantan Kumar et al.
Streamlining Prediction in Bayesian Deep Learning
Rui Li, Marcus Klasson, Arno Solin et al.
Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization
Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.
WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction
Fanmeng Wang, Minjie Cheng, Hongteng Xu
SeRA: Self-Reviewing and Alignment of LLMs using Implicit Reward Margins
Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh et al.
Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks
Emanuel Sommer, Jakob Robnik, Giorgi Nozadze et al.
Learning High-Degree Parities: The Crucial Role of the Initialization
Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.
Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality
Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.
Test-Time Training Provably Improves Transformers as In-context Learners
Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.
AdaSplash: Adaptive Sparse Flash Attention
Nuno Gonçalves, Marcos V. Treviso, Andre Martins
MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks
Nayoung Kim, Seongsu Kim, Minsu Kim et al.
Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting
Jiecheng Lu, Shihao Yang
ELITE: Enhanced Language-Image Toxicity Evaluation for Safety
Wonjun Lee, Doehyeon Lee, Eugene Choi et al.
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
Heyang Zhao, Chenlu Ye, Wei Xiong et al.
Linear Mixture Distributionally Robust Markov Decision Processes
Zhishuai Liu, Pan Xu
Preserving AUC Fairness in Learning with Noisy Protected Groups
Mingyang Wu, Li Lin, Wenbin Zhang et al.
Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism
Aviv Bick, Eric Xing, Albert Gu
BiMark: Unbiased Multilayer Watermarking for Large Language Models
Xiaoyan Feng, He Zhang, Yanjun Zhang et al.
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda, Shalabh Bhatnagar
Importance Corrected Neural JKO Sampling
Johannes Hertrich, Robert Gruhlke
Prot2Text-V2: Protein Function Prediction with Multimodal Contrastive Alignment
Xiao Fei, Michail Chatzianastasis, Sarah Carneiro et al.
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng, Weihao Tan, Zhiyi Lyu et al.
Homophily Enhanced Graph Domain Adaptation
Ruiyi Fang, Bingheng Li, Jingyu Zhao et al.
Toward Generalizing Visual Brain Decoding to Unseen Subjects
Xiangtao Kong, Kexin Huang, Ping Li et al.
Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Rui Lu, Runzhe Wang, Kaifeng Lyu et al.
$K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting
Xingjian Wu, Xiangfei Qiu, Hongfan Gao et al.
Model-Free Offline Reinforcement Learning with Enhanced Robustness
Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.
FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update
Ziru Niu, Hai Dong, A. K. Qin
AI-Generated Video Detection via Perceptual Straightening
Christian Internò, Robert Geirhos, Markus Olhofer et al.
Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions
Xiaoran Jiao, Weian Mao, Wengong Jin et al.
Field Matching: an Electrostatic Paradigm to Generate and Transfer Data
Alexander Kolesov, S. Manukhov, Vladimir Palyulin et al.
Scaling Laws for Floating–Point Quantization Training
Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin, Jianhao Ma, Zechun Liu et al.
Learning Soft Sparse Shapes for Efficient Time-Series Classification
Zhen Liu, Yicheng Luo, Boyuan Li et al.
GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs
Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev
Knee-Deep in C-RASP: A Transformer Depth Hierarchy
Andy J Yang, Michaël Cadilhac, David Chiang
Inverse Bridge Matching Distillation
Nikita Gushchin, David Li, Daniil Selikhanovych et al.
Feature-Based Online Bilateral Trade
Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.
Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)
SUBBA REDDY OOTA, Akshett Rai Jindal, Ishani Mondal et al.
Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study
Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.
The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations
Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.
Interpreting Language Reward Models via Contrastive Explanations
Junqi Jiang, Tom Bewley, Saumitra Mishra et al.
Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs
Yu-Zhe Shi, Mingchen Liu, Fanxu Meng et al.
Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models
Rafał Karczewski, Markus Heinonen, Vikas Garg
Graph Structure Refinement with Energy-based Contrastive Learning
Xianlin Zeng, Yufeng Wang, Yuqi Sun et al.
Specifying What You Know or Not for Multi-Label Class-Incremental Learning
Aoting Zhang, Dongbao Yang, Chang Liu et al.
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
Jingjing Jiang, Chongjie Si, Jun Luo et al.
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.
From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy
Julian Dörfler, Benito van der Zander, Markus Bläser et al.
Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations
Abdolmehdi Behroozi, Chaopeng Shen, Daniel Kifer
Sharpness-Aware Black-Box Optimization
Feiyang YE, YUEMING LYU, Xuehao Wang et al.
Learning to Communicate Through Implicit Communication Channels
Han Wang, Binbin Chen, zhang et al.
MGDA Converges under Generalized Smoothness, Provably
Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.
any4: Learned 4-bit Numeric Representation for LLMs
Mostafa Elhoushi, Jeff Johnson
Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation
Cheng Jin, Zhenyu Xiao, Chutao Liu et al.
On Generalization Across Environments In Multi-Objective Reinforcement Learning
Jayden Teoh, Pradeep Varakantham, Peter Vamplew
Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications
Yutong Xia, Runpeng Yu, Yuxuan Liang et al.
Universal Approximation Theorem of Deep Q-Networks
Qian Qi
On-the-fly Preference Alignment via Principle-Guided Decoding
Mingye Zhu, Yi Liu, Lei Zhang et al.
How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation
Yining Pan, Qiongjie Cui, Xulei Yang et al.
Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model
Weilin Sun, Xinran Li, Manyi Li et al.
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits
Zikai Zhou, Qizheng Zhang, Hermann Kumbong et al.
Improved Rates of Differentially Private Nonconvex-Strongly-Concave Minimax Optimization
Ruijia Zhang, Mingxi Lei, Meng Ding et al.
Feedback Favors the Generalization of Neural ODEs
Jindou Jia, Zihan Yang, Meng Wang et al.
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
Song Duong, Florian Le Bronnec, Alexandre Allauzen et al.
Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents
Karina Zainullina, Aleksandr Golubev, Maria Trofimova et al.