Most Cited 2025 Poster Papers
22,274 papers found • Page 38 of 112
Conference
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
WenZheng Zhang, Yang Hu, Jing Shi et al.
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval
Sepanta Zeighami, Zac Wellmer, Aditya Parameswaran
Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding
Xiaolong Sun, Liushuai Shi, Le Wang et al.
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings
Qiong Wu, Wenhao Lin, Yiyi Zhou et al.
Neural Interactive Proofs
Lewis Hammond, Sam Adam-Day
Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach
Hugo Math, Rainer Lienhart, Robin Schön
LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning
Ante Wang, Linfeng Song, Ye Tian et al.
Graph Data Selection for Domain Adaptation: A Model-Free Approach
Ting-Wei Li, Ruizhong Qiu, Hanghang Tong
ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors
Tianxin Huang, Zhiwen Yan, Yuyang Zhao et al.
PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time Adaptation
Sarthak Kumar Maharana, Baoming Zhang, Yunhui Guo
QA-Calibration of Language Model Confidence Scores
Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.
The Relationship Between No-Regret Learning and Online Conformal Prediction
Ramya Ramalingam, Shayan Kiyani, Aaron Roth
Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning
Fengyu Gao, Ruida Zhou, Tianhao Wang et al.
ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention
Qiuhao Zeng, Jierui Huang, Peng Lu et al.
Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities
Yifang Chen, Xiaoyu Li, Yingyu Liang et al.
Is There No Such Thing as a Bad Question? H4R: HalluciBot for Ratiocination, Rewriting, Ranking, and Routing
William Watson, Nicole Cho, Nishan Srishankar
Learning Causal Alignment for Reliable Disease Diagnosis
Mingzhou Liu, Ching-Wen Lee, Xinwei Sun et al.
MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding
YUXIANG WEI, Yanteng Zhang, Xi Xiao et al.
Where is the Truth? The Risk of Getting Confounded in a Continual World
Florian Peter Busch, Roshni Ramanna Kamath, Rupert Mitchell et al.
Incomplete and Unpaired Multi-View Graph Clustering with Cross-View Feature Fusion
Liang Zhao, Ziyue Wang, Xiao Wang et al.
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
Yuxin Jiang, Bo Huang, Yufei Wang et al.
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement
Hui Yuan, Yifan Zeng, Yue Wu et al.
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu, Jiyuan Tu, Xi Chen et al.
What should a neuron aim for? Designing local objective functions based on information theory
Andreas C. Schneider, Valentin Neuhaus, David Ehrlich et al.
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
Zhicheng Zhang, Wuyou Xia, Chenxi Zhao et al.
Robust Multi-bit Text Watermark with LLM-based Paraphrasers
Xiaojun Xu, jinghan jia, Yuanshun Yao et al.
LeanVec: Searching vectors faster by making them fit
Ishwar Bhati, Cecilia Aguerrebere, Mark Hildebrand et al.
No Free Lunch: Fundamental Limits of Learning Non-Hallucinating Generative Models
Changlong Wu, Ananth Grama, Wojciech Szpankowski
KV Shifting Attention Enhances Language Modeling
Mingyu Xu, Bingning Wang, Weipeng Chen
Does Data Scaling Lead to Visual Compositional Generalization?
Arnas Uselis, Andrea Dittadi, Seong Joon Oh
Attribute-based Visual Reprogramming for Vision-Language Models
Chengyi Cai, Zesheng Ye, Lei Feng et al.
Core Context Aware Transformers for Long Context Language Modeling
Yaofo Chen, Zeng You, Shuhai Zhang et al.
GraSP: Simple Yet Effective Graph Similarity Predictions
Haoran Zheng, Jieming Shi, Renchi Yang
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation
Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu et al.
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Timofei Gritsaev, Nikita Morozov, Sergey Samsonov et al.
Variational Search Distributions
Dan Steinberg, Rafael Oliveira, Cheng Soon Ong et al.
Refining Adaptive Zeroth-Order Optimization at Ease
Yao Shu, Qixin Zhang, Kun He et al.
Tree-Sliced Wasserstein Distance with Nonlinear Projection
Thanh Tran, Viet Hoang Tran, Thanh Chu et al.
Scalable Gaussian Processes with Latent Kronecker Structure
Jihao Andreas Lin, Sebastian Ament, Maximilian Balandat et al.
Guaranteed Generation from Large Language Models
Minbeom Kim, Thibaut Thonet, Jos Rozen et al.
AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation
Prashanth Vijayaraghavan, Luyao Shi, Ehsan Degan et al.
Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets
Yuxin Wang, Maresa Schröder, Dennis Frauen et al.
Disentangling Representations through Multi-task Learning
Pantelis Vafidis, Aman Bhargava, Antonio Rangel
When Selection Meets Intervention: Additional Complexities in Causal Discovery
Haoyue Dai, Ignavier Ng, Jianle Sun et al.
TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree
Yu-Yang Qian, Yuan-Ze Xu, Zhen-Yu Zhang et al.
Weak-to-Strong Generalization Even in Random Feature Networks, Provably
Marko Medvedev, Kaifeng Lyu, Dingli Yu et al.
ProtCLIP: Function-Informed Protein Multi-Modal Learning
Hanjing Zhou, Mingze Yin, Wei Wu et al.
Revisiting Projection-Free Online Learning with Time-Varying Constraints
Yibo Wang, Yuanyu Wan, Lijun Zhang
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Md Rifat Arefin, Gopeshh Raaj Subbaraj, Nicolas Gontier et al.
CoInD: Enabling Logical Compositions in Diffusion Models
Sachit Gaudi, Gautam Sreekumar, Vishnu Boddeti
OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes
Sepehr Dehdashtian, Gautam Sreekumar, Vishnu Boddeti
SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics
Suyuan Zhao, YIZHEN LUO, Ganbo Yang et al.
MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM
Bowen Dong, Minheng Ni, Zitong Huang et al.
Estimating the Probabilities of Rare Outputs in Language Models
Gabriel Wu, Jacob Hilton
DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction
Xinwei Zhang, Zhiqi Bu, Borja Balle et al.
A Training-free Synthetic Data Selection Method for Semantic Segmentation
Hao Tang, Siyue Yu, Jian Pang et al.
Reward Learning from Multiple Feedback Types
Yannick Metz, Andras Geiszl, Raphaël Baur et al.
Decentralized Federated Learning with Model Caching on Mobile Agents
Xiaoyu Wang, Guojun Xiong, Houwei Cao et al.
The Lock-in Hypothesis: Stagnation by Algorithm
Tianyi Qiu, Zhonghao He, Tejasveer Chugh et al.
From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs
Ang Cao, Sergio Arnaud, Oleksandr Maksymets et al.
Blink of an eye: a simple theory for feature localization in generative models
Marvin Li, Aayush Karan, Sitan Chen
FedSA: A Unified Representation Learning via Semantic Anchors for Prototype-based Federated Learning
Yanbing Zhou, Xiangmou Qu, Chenlong You et al.
Few-Shot, No Problem: Descriptive Continual Relation Extraction
Nguyen Xuan Thanh, Anh Duc Le, Quyen Tran et al.
Cluster Based Heterogeneous Federated Foundation Model Adaptation and Fine-Tuning
Xianda Wang, Yaqi Qiao, Duo Wu et al.
Provable Efficiency of Guidance in Diffusion Models for General Data Distribution
Gen Li, Yuchen Jiao
On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists
Dongyang Fan, Bettina Messmer, Nikita Doikov et al.
Epsilon: Exploring Comprehensive Visual-Semantic Projection for Multi-Label Zero-Shot Learning
Ziming Liu, Jingcai Guo, Song Guo et al.
Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling
Aram Davtyan, Leello Dadi, Volkan Cevher et al.
TODO: Enhancing LLM Alignment with Ternary Preferences
Yuxiang Guo, Lu Yin, Bo Jiang et al.
A Simple Graph Contrastive Learning Framework for Short Text Classification
Yonghao Liu, Fausto Giunchiglia, Lan Huang et al.
Tree-Sliced Wasserstein Distance: A Geometric Perspective
Viet Hoang Tran, Trang Pham, Tho Tran Huu et al.
Unlocking the Power of SAM 2 for Few-Shot Segmentation
Qianxiong Xu, Lanyun Zhu, Xuanyi Liu et al.
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
Qian Yu, Peng-Tao Jiang, Hao Zhang et al.
AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images
Yihang Liu, Lianghua He, Ying Wen et al.
InstructSAM: A Training-free Framework for Instruction-Oriented Remote Sensing Object Recognition
Yijie Zheng, Weijie Wu, Qingyun Li et al.
Shallow diffusion networks provably learn hidden low-dimensional structure
Nicholas Boffi, Arthur Jacot, Stephen Tu et al.
Projection Head is Secretly an Information Bottleneck
Zhuo Ouyang, Kaiwen Hu, Qi Zhang et al.
HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting
Nian Ran, Peng Xiao, Yue Wang et al.
Anti-Exposure Bias in Diffusion Models
Junyu Zhang, Daochang Liu, Eunbyung Park et al.
Self-Updatable Large Language Models by Integrating Context into Model Parameters
Yu Wang, Xinshuang Liu, Xiusi Chen et al.
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu, Claire Chen, Shangtong Zhang
Dueling Convex Optimization with General Preferences
Aadirupa Saha, Tomer Koren, Yishay Mansour
GSE: Group-wise Sparse and Explainable Adversarial Attacks
Shpresim Sadiku, Moritz Wagner, Sebastian Pokutta
Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHR Data
Michael Wornow, Suhana Bedi, Miguel Angel Fuentes Hernandez et al.
eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels
Alexander DeRieux, Walid Saad
CL-DiffPhyCon: Closed-loop Diffusion Control of Complex Physical Systems
Long Wei, Haodong Feng, Yuchen Yang et al.
Metamizer: A Versatile Neural Optimizer for Fast and Accurate Physics Simulations
Nils Wandel, Stefan Schulz, Reinhard Klein
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
Jiangning Wei, Lixiong Qin, Bo Yu et al.
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis
Yuto Nishimura, Takumi Hirose, Masanari Ohi et al.
Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Bowen Liu, Haoyang Li, Shuning Wang et al.
SysBench: Can LLMs Follow System Message?
Yanzhao Qin, Tao Zhang, Tao Zhang et al.
Robust Feature Learning for Multi-Index Models in High Dimensions
Alireza Mousavi-Hosseini, Adel Javanmard, Murat A Erdogdu
Contextualizing biological perturbation experiments through language
Menghua (Rachel) Wu, Russell Littman, Jacob Levine et al.
ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning
Chau Pham, Juan C. Caicedo, Bryan Plummer
MedSG-Bench: A Benchmark for Medical Image Sequences Grounding
Jingkun Yue, Siqi Zhang, Zinan Jia et al.
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
Di Wu, Siyuan Li, Chen Feng et al.
Offline Hierarchical Reinforcement Learning via Inverse Optimization
Carolin Schmidt, Daniele Gammelli, James Harrison et al.
Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain
Gaozheng Pei, Ke Ma, Yingfei Sun et al.
Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images
Jonathan Brokman, Amit Giloni, Omer Hofman et al.
CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
Zongzhen Yang, Binhang Qi, Hailong Sun et al.
Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching
Lei Yuan, Yuqi Bian, Lihe Li et al.
CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning
Quanmin Wei, Penglin Dai, Wei Li et al.
MaRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers
Ao Li, Wei Fang, Hongbo Zhao et al.
ANaGRAM: A Natural Gradient Relative to Adapted Model for efficient PINNs learning
Nilo Schwencke, Cyril Furtlehner
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
Yifan Sun, Han Wang, Dongbai Li et al.
Multi-Scale Fusion for Object Representation
Rongzhen Zhao, Vivienne Huiling Wang, Juho Kannala et al.
Large Language Models to Diffusion Finetuning
Edoardo Cetin, Tianyu Zhao, Yujin Tang
Towards Learnable Anchor for Deep Multi-View Clustering
Bocheng Wang, Chusheng Zeng, Mulin Chen et al.
An Evolved Universal Transformer Memory
Edoardo Cetin, Qi Sun, Tianyu Zhao et al.
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents
Tristan Tomilin, Meng Fang, Mykola Pechenizkiy
3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Xiuyuan Hu, Guoqing Liu, Can Chen et al.
ADMM for Nonconvex Optimization under Minimal Continuity Assumption
Ganzhao Yuan
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani, Matthew E Taylor
Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Xinyu Tian, Shu Zou, Zhaoyuan Yang et al.
Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On
Siqi Wan, Jingwen Chen, Yingwei Pan et al.
Expressive Power of Temporal Message Passing
Przemysław Andrzej Wałęga, Michael Rawson
Exploring the Design Space of Visual Context Representation in Video MLLMs
Yifan Du, Yuqi Huo, Kun Zhou et al.
Understanding the Logic of Direct Preference Alignment through Logic
Kyle Richardson, Vivek Srikumar, Ashish Sabharwal
NestQuant: nested lattice quantization for matrix products and LLMs
Semyon Savkin, Eitan Porat, Or Ordentlich et al.
Edge Contrastive Learning: An Augmentation-Free Graph Contrastive Learning Model
Yujun Li, Hongyuan Zhang, Yuan Yuan
QMamba: On First Exploration of Vision Mamba for Image Quality Assessment
Fengbin Guan, Xin Li, Zihao Yu et al.
Deep Rank-One Tensor Functional Factorization for Multi-Dimensional Data Recovery
Yanyi Li, Xi Zhang, Yisi Luo et al.
Wasserstein-Regularized Conformal Prediction under General Distribution Shift
Rui Xu, Chao Chen, Yue Sun et al.
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Dianwen Ng, Kun Zhou, Yi-Wen Chao et al.
Falcon: Fast Visuomotor Policies via Partial Denoising
Haojun Chen, Minghao Liu, Chengdong Ma et al.
DenoiseVAE: Learning Molecule-Adaptive Noise Distributions for Denoising-based 3D Molecular Pre-training
Yurou Liu, Jiahao Chen, Rui Jiao et al.
Exploring a Principled Framework for Deep Subspace Clustering
Xianghan Meng, Zhiyuan Huang, Wei He et al.
OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation
Raktim Goswami, Prashanth Krishnamurthy, Yann LeCun et al.
Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space
Mohamed Amine Ketata, Nicholas Gao, Johanna Sommer et al.
Diversity-Rewarded CFG Distillation
Geoffrey Cideron, Andrea Agostinelli, Johan Ferret et al.
RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
Shuo Yang, Bardh Prenkaj, Gjergji Kasneci
Advancing Graph Generation through Beta Diffusion
Xinyang Liu, Yilin He, Bo Chen et al.
Improving Large Language Model Planning with Action Sequence Similarity
Xinran Zhao, Hanie Sedghi, Bernd Bohnet et al.
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
Xiaoyu Ma, Hao Chen, Yongjian Deng
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
Jinze Li, Yixing Xu, Haiduo Huang et al.
UTILITY: Utilizing Explainable Reinforcement Learning to Improve Reinforcement Learning
Shicheng Liu, Minghui Zhu
Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis
Weiwei Lin, Chenhang HE
How Many Lines to Paint the City: Exact Edge-Cover in Temporal Graphs
Argyrios Deligkas, Michelle Döring, Eduard Eiben et al.
Positional Attention: Expressivity and Learnability of Algorithmic Computation
Artur Back de Luca, George Giapitzakis, Shenghao Yang et al.
Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning
Xinran Li, Xiaolu Wang, Chenjia Bai et al.
Denoising with a Joint-Embedding Predictive Architecture
Chen Dengsheng, Jie Hu, Xiaoming Wei et al.
FloE: On-the-Fly MoE Inference on Memory-constrained GPU
Yuxin Zhou, Zheng Li, Jun Zhang et al.
Unlocking Point Processes through Point Set Diffusion
David Lüdke, Enric Rabasseda Raventós, Marcel Kollovieh et al.
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li, Deshui Miao, Zhenyu He et al.
Annealing Flow Generative Models Towards Sampling High-Dimensional and Multi-Modal Distributions
Dongze Wu, Yao Xie
InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction
Bin Lei, Weitai Kang, Zijian Zhang et al.
Analytic DAG Constraints for Differentiable DAG Learning
Zhen Zhang, Ignavier Ng, Dong Gong et al.
Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation
Federico Julian Camerota Verdù, Lorenzo Castelli, Luca Bortolussi
SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models
Muxi Diao, Rumei Li, Shiyang Liu et al.
ReFF: Reinforcing Format Faithfulness in Language Models Across Varied Tasks
Jiashu Yao, Heyan Huang, Zeming Liu et al.
What makes an Ensemble (Un) Interpretable?
Shahaf Bassan, Guy Amir, Meirav Zehavi et al.
Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness
Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.
Graph Assisted Offline-Online Deep Reinforcement Learning for Dynamic Workflow Scheduling
Yifan Yang, Gang Chen, Hui Ma et al.
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing
Grace Zhang, Ayush Jain, Injune Hwang et al.
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation
Wenxuan Bao, Zhichen Zeng, Zhining Liu et al.
Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities
Jiayi Kuang, Haojing Huang, Yinghui Li et al.
Learning Graph Invariance by Harnessing Spuriosity
Tianjun Yao, Yongqiang Chen, Kai Hu et al.
FeatSharp: Your Vision Model Features, Sharper
Mike Ranzinger, Greg Heinrich, Pavlo Molchanov et al.
MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis
Jun-Yan He, Zhi-Qi Cheng, Chenyang Li et al.
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
Sungnyun Kim, Sungwoo Cho, Sangmin Bae et al.
Wasserstein Distances, Neuronal Entanglement, and Sparsity
Shashata Sawmya, Linghao Kong, Ilia Markov et al.
TTFSFormer: A TTFS-based Lossless Conversion of Spiking Transformer
Lusen Zhao, Zihan Huang, Ding Jianhao et al.
PFDiff: Training-Free Acceleration of Diffusion Models Combining Past and Future Scores
Guangyi Wang, Yuren Cai, lijiang Li et al.
ScImage: How good are multimodal large language models at scientific text-to-image generation?
Leixin Zhang, Steffen Eger, Yinjie Cheng et al.
CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs
Jinpeng Li, Haiping Wang, Jiabin chen et al.
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models
Yuchen Fan, Yuzhong Hong, Qiushi Wang et al.
Vintix: Action Model via In-Context Reinforcement Learning
Andrei Polubarov, Nikita Lyubaykin, Alexander Derevyagin et al.
When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings
Jérémy Perez, Grgur Kovac, Corentin Léger et al.
CITI: Enhancing Tool Utilizing Ability in Large Language Models Without Sacrificing General Performance
Yupu Hao, Pengfei Cao, Zhuoran Jin et al.
Thousand Voices of Trauma: A Large-Scale Synthetic Dataset for Modeling Prolonged Exposure Therapy Conversations
Suhas BN, Andrew Sherrill, Rosa I. Arriaga et al.
Natural Language Inference Improves Compositionality in Vision-Language Models
Paola Cascante-Bonilla, Yu (Hope) Hou, Yang Cao et al.
On Teacher Hacking in Language Model Distillation
Daniil Tiapkin, Daniele Calandriello, Johan Ferret et al.
RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models
Quan Wei, Chung-Yiu Yau, Hoi To Wai et al.
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat et al.
SymmetricDiffusers: Learning Discrete Diffusion on Finite Symmetric Groups
Yongxing Zhang, Donglin Yang, Renjie Liao
RILQ: Rank-Insensitive LoRA-Based Quantization Error Compensation for Boosting 2-Bit Large Language Model Accuracy
Geonho Lee, Janghwan Lee, Sukjin Hong et al.
CommVQ: Commutative Vector Quantization for KV Cache Compression
Junyan Li, Yang Zhang, Muhammad Yusuf Hassan et al.
Differentiable and Learnable Wireless Simulation with Geometric Transformers
Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy et al.
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
Chenyu Zhou, Mengdan Zhang, Peixian Chen et al.
Taming Transformer Without Using Learning Rate Warmup
Xianbiao Qi, Yelin He, Jiaquan Ye et al.
Random Forest Autoencoders for Guided Representation Learning
Adrien Aumon, Shuang Ni, Myriam Lizotte et al.
Neural Genetic Search in Discrete Spaces
Hyeonah Kim, Sanghyeok Choi, Jiwoo Son et al.
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Jan Metzen, Piyapat Saranrittichai, Chaithanya Kumar Mummadi
DeblurDiff: Real-Word Image Deblurring with Generative Diffusion Models
Lingshun Kong, Jiawei Zhang, Dongqing Zou et al.
Efficient Perplexity Bound and Ratio Matching in Discrete Diffusion Language Models
Etrit Haxholli, Yeti Z. Gurbuz, Oğul Can et al.
Outlier Gradient Analysis: Efficiently Identifying Detrimental Training Samples for Deep Learning Models
Anshuman Chhabra, Bo Li, Jian Chen et al.
Doubly Protected Estimation for Survival Outcomes Utilizing External Controls for Randomized Clinical Trials
Chenyin Gao, Shu Yang, Mingyang Shan et al.
MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
Erle Zhu, Yadi Liu, Zhe Zhang et al.
Composable Interventions for Language Models
Arinbjörn Kolbeinsson, Kyle O'Brien, Tianjin Huang et al.
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang, Howe Tissue, Lu Wang et al.
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Anle Ke, Xu Zhang, Tong Chen et al.
p-Mean Regret for Stochastic Bandits
Anand Krishna, Philips George John, Adarsh Barik et al.
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction
Sicheng Zuo, Wenzhao Zheng, Xiaoyong Han et al.
On the Hölder Stability of Multiset and Graph Neural Networks
Yair Davidson, Nadav Dym
Uncertainty and Influence aware Reward Model Refinement for Reinforcement Learning from Human Feedback
Zexu Sun, Yiju Guo, Yankai Lin et al.
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui, Thanh Nguyen, Tien Mai
SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness
Sangamesh Kodge, Deepak Ravikumar, Gobinda Saha et al.
CodeSync: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
Chenlong Wang, Zhaoyang Chu, Zhengxiang Cheng et al.
From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms
Jessica Dai, Paula Gradu, Inioluwa Raji et al.