Spotlight Papers
1,421 papers found • Page 24 of 29
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models
Tianci Liu, Haoyu Wang, Shiyang Wang et al.
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures
Vimal Thilak, Chen Huang, Omid Saremi et al.
Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps
Henry Li, Ronen Basri, Yuval Kluger
Linearity of Relation Decoding in Transformer Language Models
Evan Hernandez, Arnab Sen Sharma, Tal Haklay et al.
Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts
Lizhang Chen, Bo Liu, Kaizhao Liang et al.
LLM Maybe LongLM: SelfExtend LLM Context Window Without Tuning
Hongye Jin, Xiaotian Han, Jingfeng Yang et al.
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng et al.
Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization
Ziqing Fan, Shengchao Hu, Jiangchao Yao et al.
Local Search GFlowNets
Minsu Kim, Yun Taeyoung, Emmanuel Bengio et al.
Local vs. Global Interpretability: A Computational Complexity Perspective
Shahaf Bassan, Guy Amir, Guy Katz
Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data
Young-Jae Park, Minseok Seo, Doyi Kim et al.
Making Pre-trained Language Models Great on Tabular Prediction
Jiahuan Yan, Bo Zheng, Hongxia Xu et al.
MALIBO: Meta-learning for Likelihood-free Bayesian Optimization
Jiarong Pan, Stefan Falkner, Felix Berkenkamp et al.
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue, Xingwei Qu, Ge Zhang et al.
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding
Lirong Wu, Yijun Tian, Yufei Huang et al.
Mask-Based Modeling for Neural Radiance Fields
Ganlin Yang, Guoqiang Wei, Zhizheng Zhang et al.
Masked Face Recognition with Generative-to-Discriminative Representations
Shiming Ge, Weijia Guo, Chenyu Li et al.
Masks, Signs, And Learning Rate Rewinding
Advait Gadhikar, Rebekka Burkholz
Massively Scalable Inverse Reinforcement Learning in Google Maps
Matt Barnes, Matthew Abueg, Oliver Lange et al.
Maximum Entropy Heterogeneous-Agent Reinforcement Learning
Jiarong Liu, Yifan Zhong, Siyi Hu et al.
Mayfly: a Neural Data Structure for Graph Stream Summarization
yuan feng, Yukun Cao, Hairu Wang et al.
Memorization Capacity of Multi-Head Attention in Transformers
Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis
Memorization Through the Lens of Curvature of Loss Function Around Samples
Isha Garg, Deepak Ravikumar, Kaushik Roy
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Pingzhi Li, Zhenyu Zhang, Prateek Yadav et al.
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Longhui Yu, Weisen JIANG, Han Shi et al.
MetaPhysiCa: Improving OOD Robustness in Physics-informed Machine Learning
S Chandra Mouli, Muhammad Alam, Bruno Ribeiro
Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions
Kaihong Zhang, Heqi Yin, Feng Liang et al.
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Johan Obando Ceron, Ghada Sokar, Timon Willi et al.
MMD Graph Kernel: Effective Metric Learning for Graphs via Maximum Mean Discrepancy
Yan Sun, Jicong Fan
Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff et al.
ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis
DongHao Luo, Xue Wang
MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field
Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang et al.
MT-Ranker: Reference-free machine translation evaluation by inter-system ranking
Ibraheem Muhammad Moosa, Rui Zhang, Wenpeng Yin
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction
Jiatong Shi, Hirofumi Inaguma, Xutai Ma et al.
Multiscale Positive-Unlabeled Detection of AI-Generated Texts
Yuchuan Tian, Hanting Chen, Xutao Wang et al.
Multi-Track Message Passing: Tackling Oversmoothing and Oversquashing in Graph Learning via Preventing Heterophily Mixing
Hongbin Pei, Yu Li, Huiqi Deng et al.
Multi-View Causal Representation Learning with Partial Observability
Dingling Yao, Danru Xu, Sébastien Lachapelle et al.
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning
Zayne Sprague, Xi Ye, Kaj Bostrom et al.
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
Yinya Huang, Xiaohan Lin, Zhengying Liu et al.
Nash Learning from Human Feedback
REMI MUNOS, Michal Valko, Daniele Calandriello et al.
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
Kai Shen, Zeqian Ju, Xu Tan et al.
Navigating Scaling Laws: Compute Optimality in Adaptive Model Training
Sotiris Anagnostidis, Gregor Bachmann, Imanol Schlag et al.
Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization
Joe Benton, Valentin De Bortoli, Arnaud Doucet et al.
Negative Label Guided OOD Detection with Pretrained Vision-Language Models
Xue JIANG, Feng Liu, Zhen Fang et al.
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models
Shuai Fu, Shuai Fu, Xiequn Wang et al.
NetInfoF Framework: Measuring and Exploiting Network Usable Information
Meng-Chieh Lee, Haiyang Yu, Jian Zhang et al.
Neural Contractive Dynamical Systems
Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.
Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization
Yibing Liu, Chris Xing TIAN, Haoliang Li et al.
No Dimensional Sampling Coresets for Classification
Meysam Alishahi, Jeff Phillips
NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation
Pengfei Zheng, Yonggang Zhang, Zhen Fang et al.