Most Cited 2025 "language model contrast" Papers
22,274 papers found • Page 45 of 112
Conference
Regression for the Mean: Auto-Evaluation and Inference with Few Labels through Post-hoc Regression
Benjamin Eyre, David Madras
Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval-Augmented Generation
Tobias Leemann, Periklis Petridis, Giuseppe Vietri et al.
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
Chengjie Ge, Xueyang Fu, Peng He et al.
Statistical Hypothesis Testing for Auditing Robustness in Language Models
Paulius Rauba, Qiyao Wei, Mihaela van der Schaar
Flopping for FLOPs: Leveraging Equivariance for Computational Efficiency
Georg Bökman, David Nordström, Fredrik Kahl
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu, Lingfeng Zhao, Shivangi Agarwal et al.
SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition
Haoran Zhang, Xiangdong Su, Xingxiang Zhou et al.
CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step
Zheyuan Liu, Munan Ning, Qihui Zhang et al.
PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation
Albert Gong, Kamilė Stankevičiūtė, Chao Wan et al.
Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval
Guofeng Ding, Yiding Lu, Peng Hu et al.
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures
Shreya Shukla, Nakul Sharma, Manish Gupta et al.
Efficient and Accurate Explanation Estimation with Distribution Compression
Hubert Baniecki, Giuseppe Casalicchio, Bernd Bischl et al.
SADBA: Self-Adaptive Distributed Backdoor Attack Against Federated Learning
Jun Feng, Yuzhe Lai, Hong Sun et al.
Regional Expected Improvement for Efficient Trust Region Selection in High-Dimensional Bayesian Optimization
Nobuo Namura, Sho Takemori
Tensor Product Neural Networks for Functional ANOVA Model
Seokhun Park, Insung Kong, yongchan Choi et al.
Exploring The Loss Landscape Of Regularized Neural Networks Via Convex Duality
Sungyoon Kim, Aaron Mishkin, Mert Pilanci
Learning Robust and Privacy-Preserving Representations via Information Theory
Binghui Zhang, Sayedeh Leila Noorbakhsh, Yun Dong et al.
Long-Tailed Out-of-Distribution Detection: Prioritizing Attention to Tail
Yina He, Lei Peng, Yongcun Zhang et al.
Position: A Theory of Deep Learning Must Include Compositional Sparsity
David A. Danhofer, Davide DAscenzo, Rafael Dubach et al.
Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal
Yuwen He, Wei Wang, Wanyu Wu et al.
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou, Josiah Hanna, Jin Zhu et al.
Complexity Lower Bounds of Adaptive Gradient Algorithms for Non-convex Stochastic Optimization under Relaxed Smoothness
Michael Crawshaw, Mingrui Liu
Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution
Simiao Li, Yun Zhang, Wei Li et al.
Delay as Payoff in MAB
Ofir Schlisselberg, Ido Cohen, Tal Lancewicki et al.
Non-Equilibrium Dynamics of Hybrid Continuous-Discrete Ground-State Sampling
Timothee Leleu, Sam Reifenstein
When Witnesses Defend: A Witness Graph Topological Layer for Adversarial Graph Learning
Naheed Anjum Arafat, Debabrota Basu, Yulia Gel et al.
Wasserstein Policy Optimization
David Pfau, Ian Davies, Diana Borsa et al.
HUANG: A Robust Diffusion Model-based Targeted Adversarial Attack Against Deep Hashing Retrieval
Chihan Huang, Xiaobo Shen
MamKO: Mamba-based Koopman operator for modeling and predictive control
Zhaoyang Li, Minghao Han, Xunyuan Yin
SAFE: Finding Sparse and Flat Minima to Improve Pruning
Dongyeop Lee, Kwanhee Lee, Jinseok Chung et al.
Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework
Guiyu Zhao, Zhentao Guo, Zewen Du et al.
Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification
Hyunji Jung, Hanseul Cho, Chulhee Yun
Self-supervised contrastive learning performs non-linear system identification
Rodrigo Gonzalez Laiz, Tobias Schmidt, Steffen Schneider
Learning to engineer protein flexibility
Petr Kouba, Joan Planas-Iglesias, Jiri Damborsky et al.
EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models
Yupeng Chen, Penglin Chen, Xiaoyu Zhang et al.
Pilot: Building the Federated Multimodal Instruction Tuning Framework
Baochen Xiong, Xiaoshan Yang, Yaguang Song et al.
RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
Xinnuo Xu, Rachel Lawrence, Kshitij Dubey et al.
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension
Amaia Cardiel, Eloi Zablocki, Elias Ramzi et al.
Improving Pareto Set Learning for Expensive Multi-objective Optimization via Stein Variational Hypernetworks
Minh-Duc Nguyen, Phuong Mai Dinh, Quang-Huy Nguyen et al.
Efficient Pre-Training of LLMs via Topology-Aware Communication Alignment on More Than 9600 GPUs
Guoliang He, Youhe Jiang, Wencong Xiao et al.
Comparing Targeting Strategies for Maximizing Social Welfare with Limited Resources
Vibhhu Sharma, Bryan Wilder
RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo
Jianfei Jiang, Liyong Wang, Haochen Yu et al.
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors
Jing Huang, Junyi Tao, Thomas Icard et al.
Continuous Diffusion Model for Language Modeling
Jaehyeong Jo, Sung Ju Hwang
Quality over Quantity: Boosting Data Efficiency Through Ensembled Multimodal Data Curation
Jinda Xu, Yuhao Song, Daming Wang et al.
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement
Nan Jiang, Shanchao Liang, Chengxiao Wang et al.
A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset
Junhuan Yang, Yuzhou Zhang, Yi Sheng et al.
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
Wenbo Zhang, Lu Zhang, Ping Hu et al.
Enhancing Implicit Neural Representations via Symmetric Power Transformation
Weixiang Zhang, Shuzhao Xie, Chengwei Ren et al.
SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos
Yingying Jiao, Zhigang Wang, Sifan Wu et al.
Higher-Order Graphon Neural Networks: Approximation and Cut Distance
Daniel Herbst, Stefanie Jegelka
Higher Order Structures for Graph Explanations
Akshit Sinha, Sreeram Vennam, Charu Sharma et al.
Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson–Romberg Extrapolation
Marina Sheshukova, Denis Belomestny, Alain Oliviero Durmus et al.
Dynamic Target Distribution Estimation for Source-Free Open-Set Domain Adaptation
Zhiqi Yu, Zhichao Liao, Jingjing Li et al.
Efficient Self-Supervised Video Hashing with Selective State Spaces
Jinpeng Wang, Niu Lian, Jun Li et al.
Benchmarking and Understanding Compositional Relational Reasoning of LLMs
Ruikang Ni, Da Xiao, Qingye Meng et al.
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks
Bowei He, Lihao Yin, Huiling Zhen et al.
DRoP: Distributionally Robust Data Pruning
Artem Vysogorets, Kartik Ahuja, Julia Kempe
HiCM²: Hierarchical Compact Memory Modeling for Dense Video Captioning
Minkuk Kim, Hyeon Bae Kim, Jinyoung Moon et al.
Adjacent Words, Divergent Intents: Jailbreaking Large Language Models via Task Concurrency
Yukun Jiang, Mingjie Li, Michael Backes et al.
Balancing Interference and Correlation in Spatial Experimental Designs: A Causal Graph Cut Approach
Jin Zhu, Jingyi Li, Hongyi Zhou et al.
Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Zi Wang, Divyam Anshumaan, Ashish Hooda et al.
On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions
Omer Madmon, Idan Pipano, Itamar Jacob Reinman et al.
MHBench: Demystifying Motion Hallucination in VideoLLMs
Ming Kong, Xianzhou Zeng, Luyuan Chen et al.
Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization
kaiyuan Li, Xiaoyue Chen, Chen Gao et al.
DSRC: Learning Density-Insensitive and Semantic-Aware Collaborative Representation Against Corruptions
Jingyu Zhang, Yilei Wang, Lang Qian et al.
On Understanding Attention-Based In-Context Learning for Categorical Data
Aaron Wang, William Convertino, Xiang Cheng et al.
PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans
Giang Nguyen, Valerie Chen, Mohammad Reza Taesiri et al.
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Wesley Suttle, Aamodh Suresh, Carlos Nieto-Granda
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
Jianliang He, Xintian Pan, Siyu Chen et al.
3D Annotation-Free Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
Boyi Sun, Yuhang Liu, Xingxia Wang et al.
Design Considerations in Offline Preference-based RL
Alekh Agarwal, Christoph Dann, Teodor Vanislavov Marinov
DualDynamics: Synergizing Implicit and Explicit Methods for Robust Irregular Time Series Analysis
YongKyung Oh, Dong-Young Lim, Sungil Kim
Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions
Youngmin Oh, Hyunju Lee, Bumsub Ham
MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction
Cheng Tan, Zhenxiao Cao, Zhangyang Gao et al.
Sable: a Performant, Efficient and Scalable Sequence Model for MARL
Omayma Mahjoub, Sasha Abramowitz, Ruan de Kock et al.
ToMA: Token Merge with Attention for Diffusion Models
Wenbo Lu, Shaoyi Zheng, Yuxuan Xia et al.
Debiased Distillation for Consistency Regularization
Lu Wang, Liuchi Xu, Xiong Yang et al.
RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation
Tianyi Yan, Wencheng Han, xia zhou et al.
CALLIC: Content Adaptive Learning for Lossless Image Compression
Daxin Li, Yuanchao Bai, Kai Wang et al.
Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models
Eunseop Yoon, Hee Suk Yoon, Mark Hasegawa-Johnson et al.
Optimizing Adaptive Attacks against Watermarks for Language Models
Abdulrahman Diaa, Toluwani Aremu, Nils Lukas
VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence
Hao Li, Hao Fei, Zechao Hu et al.
AIF-SFDA: Autonomous Information Filter Driven Source-Free Domain Adaptation for Medical Image Segmentation
Haojin Li, Heng Li, Jianyu Chen et al.
CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition
Zebin Wang, Menghan Lin, Bolin Shen et al.
Gaze Label Alignment: Alleviating Domain Shift for Gaze Estimation
Guanzhong Zeng, Jingjing Wang, Zefu Xu et al.
Enhanced Sample Selection with Confidence Tracking: Identifying Correctly Labeled Yet Hard-to-Learn Samples in Noisy Data
Weiran Pan, Wei Wei, Feida Zhu et al.
Laplace Transform Based Low-Complexity Learning of Continuous Markov Semigroups
Vladimir Kostic, Karim Lounici, Hélène Halconruy et al.
Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots
Guangting Zheng, Yehao Li, Yingwei Pan et al.
Rethinking U-Net: Task-Adaptive Mixture of Skip Connections for Enhanced Medical Image Segmentation
Zichen Luo, Xinshan Zhu, Lan Zhang et al.
AIM: Additional Image Guided Generation of Transferable Adversarial Attacks
Teng Li, Xingjun Ma, Yu-Gang Jiang
PIORF: Physics-Informed Ollivier-Ricci Flow for Long–Range Interactions in Mesh Graph Neural Networks
Youn-Yeol Yu, Jeongwhan Choi, Jaehyeon Park et al.
RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning
Kanghoon Yoon, Kibum Kim, Jaehyeong Jeon et al.
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium
Xinzhe Li, Jiahui Zhan, Shengfeng He et al.
Learn Singularly Perturbed Solutions via Homotopy Dynamics
Chuqi CHEN, Yahong Yang, Yang Xiang et al.
Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests
Kristin Blesch, Niklas Koenen, Jan Kapar et al.
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
Shaobin Zhuang, Yiwei Guo, Yanbo Ding et al.
On the Role of Label Noise in the Feature Learning Process
Andi Han, Wei Huang, Zhanpeng Zhou et al.
Provable In-Context Vector Arithmetic via Retrieving Task Concepts
Dake Bu, Wei Huang, Andi Han et al.
FairPFN: A Tabular Foundation Model for Causal Fairness
Jake Robertson, Noah Hollmann, Samuel Gabriel Müller et al.
GCD: Advancing Vision-Language Models for Incremental Object Detection via Global Alignment and Correspondence Distillation
Xu Wang, Zilei Wang, Zihan Lin
Realistic Noise Synthesis with Diffusion Models
Qi Wu, Mingyan Han, Ting Jiang et al.
Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision
Kangsheng Yin, Quan Liu, Xuelin Shen et al.
ReGen: Generative Robot Simulation via Inverse Design
Peter (Phat) Nguyen, Johnson (Tsun-Hsuan) Wang, Zhang-Wei Hong et al.
Improving Generalization with Flat Hilbert Bayesian Inference
Tuan Truong, Quyen Tran, Ngoc Quan Pham et al.
Transformers Handle Endogeneity in In-Context Linear Regression
Haodong Liang, Krishna Balasubramanian, Lifeng Lai
Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU Networks
Devon Jarvis, Richard Klein, Benjamin Rosman et al.
Adversarial Attacks on Event-Based Pedestrian Detectors: A Physical Approach
Guixu Lin, Muyao Niu, Qingtian Zhu et al.
Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages
Michael Sun, Weize Yuan, Gang Liu et al.
Bayesian Low-Rank Learning (Bella): A Practical Approach to Bayesian Neural Networks
Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo et al.
FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation
Min Lin, Gangwei Xu, Yun Wang et al.
Attention-Only Transformers via Unrolled Subspace Denoising
Peng Wang, Yifu Lu, Yaodong Yu et al.
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski, Bartosz Cywiński, Franziska Boenisch et al.
Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning
Dunqiang Liu, Shujun Huang, Wen Li et al.
Single Image Rolling Shutter Removal with Diffusion Models
Zhanglei Yang, Haipeng Li, Mingbo Hong et al.
ProtoCar: Learning 3D Vehicle Prototypes from Single-View and Unconstrained Driving Scene Images
Hongyuan Liu, Haochen Yu, Bochao Zou et al.
Does Low Rank Adaptation Lead to Lower Robustness against Training-Time Attacks?
Zi Liang, Haibo Hu, Qingqing Ye et al.
DECT: Harnessing LLM-assisted Fine-Grained Linguistic Knowledge and Label-Switched and Label-Preserved Data Generation for Diagnosis of Alzheimer’s Disease
Tingyu Mo, Jacqueline C. K. Lam, Victor O. K. Li et al.
Feedforward Few-shot Species Range Estimation
Christian Lange, Max Hamilton, Elijah Cole et al.
Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount
Yanbiao Ma, Wei Dai, Jiayi Chen
TraceGrad: a Framework Learning Expressive SO(3)-equivariant Non-linear Representations for Electronic-Structure Hamiltonian Prediction
Shi Yin, Xinyang Pan, fengyan wang et al.
Bridge Diffusion Model: Bridge Chinese Text-to-Image Diffusion Model with English Communities
Shanyuan Liu, Bo Cheng, Yuhang Ma et al.
SOLA-GCL: Subgraph-Oriented Learnable Augmentation Method for Graph Contrastive Learning
Tianhao Peng, Xuhong Li, Haitao Yuan et al.
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
GUOJUN XIONG, Shufan Wang, Daniel Jiang et al.
Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving
Jin Zhang, Flood Sung, Zhilin Yang et al.
How Far Are We from True Unlearnability?
Kai Ye, Liangcai Su, Chenxiong Qian
GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians
Shuyi Jiang, Qihao Zhao, Hossein Rahmani et al.
Are Large Brainwave Foundation Models Capable Yet ? Insights from Fine-Tuning
Na Lee, Konstantinos Barmpas, Yannis Panagakis et al.
Non-Asymptotic Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes
Zaiwei Chen
A Large-Scale 3D Face Mesh Video Dataset via Neural Re-parameterized Optimization
Kim Youwang, Lee Hyun, Kim Sung-Bin et al.
Revisiting Convolution Architecture in the Realm of DNA Foundation Models
Yu Bo, Weian Mao, Daniel Shao et al.
RMath: A Logic Reasoning-Focused Datasets Toward Mathematical Multistep Reasoning Tasks
Ziyi Hu, Jun Liu, Zhongzhi Liu et al.
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Shaozhe Hao, Xuantong LIU, Xianbiao Qi et al.
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Tao Liu, Rongjie Li, Chongyu Wang et al.
Lightspeed Geometric Dataset Distance via Sliced Optimal Transport
Khai Nguyen, Hai Nguyen, Tuan Pham et al.
GRAIL: Graph Edit Distance and Node Alignment using LLM-Generated Code
Samidha Verma, Arushi Goyal, Ananya Mathur et al.
Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control
Bruce D. Lee, Leonardo F. Toso, Thomas T. Zhang et al.
Tree of Attributes Prompt Learning for Vision-Language Models
Tong Ding, Wanhua Li, Zhongqi Miao et al.
Bridging the Semantic Gap Between Text and Table: A Case Study on NL2SQL
Lin Long, Xijun Gu, Xinjie Sun et al.
Pause Tokens Strictly Increase the Expressivity of Constant-Depth Transformers
Charles London, Varun Kanade
Training Matting Models Without Alpha Labels
Wenze Liu, Zixuan Ye, Hao Lu et al.
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li, Wenzhao Zheng, Xiaonan Huang et al.
Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues
Mingshen Wang, Zhao Zhang, Feng Li et al.
High-Dimensional Bayesian Optimisation with Gaussian Process Prior Variational Autoencoders
Siddharth Ramchandran, Manuel Haussmann, Harri Lähdesmäki
Breaking the Reclustering Barrier in Centroid-based Deep Clustering
Lukas Miklautz, Timo Klein, Kevin Sidak et al.
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model
Jincheng Zhong, XiangCheng Zhang, Jianmin Wang et al.
MetaAgent: Automatically Constructing Multi-Agent Systems Based on Finite State Machines
Yaolun Zhang, Xiaogeng Liu, Chaowei Xiao
Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates
Connor Mooney, Zhongjian Wang, Jack Xin et al.
On Linear Convergence in Smooth Convex-Concave Bilinearly-Coupled Saddle-Point Optimization: Lower Bounds and Optimal Algorithms
Ekaterina Borodich, Alexander Gasnikov, Dmitry Kovalev
GenDataAgent: On-the-fly Dataset Augmentation with Synthetic Data
Zhiteng Li, Lele Chen, Jerone Andrews et al.
Enhancing Portuguese Variety Identification with Cross-Domain Approaches
Hugo Sousa, Rúben Almeida, Purificação Silvano et al.
Rethinking Time Encoding via Learnable Transformation Functions
Xi Chen, Yateng Tang, Jiarong Xu et al.
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Xiaoyang Liu, Boran Wen, Xinpeng Liu et al.
MARS: A Malignity-Aware Backdoor Defense in Federated Learning
Wei Wan, Ning Yuxuan, Zhicong Huang et al.
Raptor: Scalable Train-Free Embeddings for 3D Medical Volumes Leveraging Pretrained 2D Foundation Models
Ulzee An, Moonseong Jeong, Simon Lee et al.
GridMix: Exploring Spatial Modulation for Neural Fields in PDE Modeling
Honghui Wang, Shiji Song, Gao Huang
Exposure Bracketing Is All You Need For A High-Quality Image
Zhilu Zhang, Shuohao Zhang, Renlong Wu et al.
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning
Shijie Liu, Andrew Cullen, Paul Montague et al.
Inductive Gradient Adjustment for Spectral Bias in Implicit Neural Representations
Kexuan Shi, Hai Chen, Leheng Zhang et al.
k-HyperEdge Medoids for Clustering Ensemble
Feijiang Li, Jieting Wang, Liuya Zhang et al.
Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors
Jinhyeok Choi, Heehyeon Kim, Joyce Jiyoung Whang
Projection Optimization: A General Framework for Multi-Objective and Multi-Group RLHF
Nuoya Xiong, Aarti Singh
Understanding protein function with a multimodal retrieval-augmented foundation model
Timothy Truong Jr, Tristan Bepler
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective
Weizhong Huang, Yuxin Zhang, Xiawu Zheng et al.
LiteReality: Graphic-Ready 3D Scene Reconstruction from RGB-D Scans
Zhening Huang, Xiaoyang Wu, Fangcheng Zhong et al.
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners
Fang-Duo Tsai, Shih-Lun Wu, Weijaw Lee et al.
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
Matteo Saponati, Pascal J. Sager, Pau Vilimelis Aceituno et al.
DFF: Decision-Focused Fine-Tuning for Smarter Predict-Then-Optimize with Limited Data
Jiaqi Yang, Enming Liang, Zicheng Su et al.
3D StreetUnveiler with Semantic-aware 2DGS - a simple baseline
Jingwei Xu, Yikai Wang, Yiqun Zhao et al.
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning
Prajwal Koirala, Zhanhong Jiang, Soumik Sarkar et al.
Enhancing Decision-Making of Large Language Models via Actor-Critic
Heng Dong, Kefei Duan, Chongjie Zhang
Learning Mask Invariant Mutual Information for Masked Image Modeling
Tao Huang, Yanxiang Ma, Shan You et al.
Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Han Wang, Yilin Zhao, Dian Li et al.
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
Zhongyang Li, Ziyue Li, Tianyi Zhou
Promoting Knowledge Base Question Answering by Directing LLMs to Generate Task-relevant Logical Forms
Jianqi Gao, Jian Cao, Ranran Bu et al.
Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations
Julius Aka, Johannes Brunnemann, Jörg Eiden et al.
To Predict or Not to Predict? Proportionally Masked Autoencoders for Tabular Data Imputation
Jungkyu Kim, Kibok Lee, Taeyoung Park
ParaSolver: A Hierarchical Parallel Integral Solver for Diffusion Models
Jianrong Lu, Zhiyu Zhu, Junhui Hou
Human-Aligned Chess With a Bit of Search
Yiming Zhang, Athul Jacob, Vivian Lai et al.
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Jihan Yao, Wenxuan Ding, Shangbin Feng et al.
HyPoGen: Optimization-Biased Hypernetworks for Generalizable Policy Generation
Hanxiang Ren, Li Sun, Xulong Wang et al.
Learning to Discover Regulatory Elements for Gene Expression Prediction
Xingyu Su, Haiyang Yu, Degui Zhi et al.
Position: An Empirically Grounded Identifiability Theory Will Accelerate Self Supervised Learning Research
Patrik Reizinger, Randall Balestriero, David Klindt et al.
Uncertainty Herding: One Active Learning Method for All Label Budgets
Wonho Bae, Danica Sutherland, Gabriel Oliveira
Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers
Qi Deng, Shuaicheng Niu, Ronghao Zhang et al.
MELODI: Exploring Memory Compression for Long Contexts
Yinpeng Chen, DeLesley Hutchins, Aren Jansen et al.
Linear Transformer Topological Masking with Graph Random Features
Isaac Reid, Kumar Dubey, Deepali Jain et al.
Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
Yao Shu, Wenyang Hu, See-Kiong Ng et al.
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Jiahuan Long, Zhengqin Xu, Tingsong Jiang et al.
Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning
Fanrui Zhang, Dian Li, Qiang Zhang et al.
Better Estimation of the Kullback--Leibler Divergence Between Language Models
Afra Amini, Tim Vieira, Ryan Cotterell
Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners
Hung Manh Pham, Aaqib Saeed, Dong Ma
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
Xianliang Li, Jun Luo, Zhiwei Zheng et al.
Adaptive Decision Boundary for Few-Shot Class-Incremental Learning
Linhao Li, Yongzhang Tan, Siyuan Yang et al.
The Panaceas for Improving Low-Rank Decomposition in Communication-Efficient Federated Learning
Shiwei Li, Xiandi Luo, Haozhao Wang et al.
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng, Winnie Lin, Lingxiao Li et al.
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Wanyun Xie, Francesco Tonin, Volkan Cevher
BARNN: A Bayesian Autoregressive and Recurrent Neural Network
Dario Coscia, Max Welling, Nicola Demo et al.
Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model
Yaxuan Huang, Xili Dai, Jianan Wang et al.
Discriminating image representations with principal distortions
Jenelle Feather, David Lipshutz, Sarah Harvey et al.