Most Cited 2025 "numerical reconstruction" Papers
22,274 papers found • Page 66 of 112
Conference
ViLU: Learning Vision-Language Uncertainties for Failure Prediction
Marc Lafon, Yannis Karmim, Julio Silva-Rodríguez et al.
Subjective Camera 1.0: Bridging Human Cognition and Visual Reconstruction through Sequence-Aware Sketch-Guided Diffusion
Haoyang Chen, Dongfang Sun, Caoyuan Ma et al.
CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis
Florian Barthel, Wieland Morgenstern, Paul Hinzer et al.
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models
Mainak Singha, Subhankar Roy, Sarthak Mehrotra et al.
On Feasible Rewards in Multi-Agent Inverse Reinforcement Learning
Till Freihaut, Giorgia Ramponi
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
Tao Han, Wanghan Xu, Junchao Gong et al.
Adapting Dense Matching for Homography Estimation with Grid-based Acceleration
Kaining Zhang, Yuxin Deng, Jiayi Ma et al.
CAFA: a Controllable Automatic Foley Artist
Roi Benita, Michael Finkelson, Tavi Halperin et al.
Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts
Viet Nguyen, Anh Nguyen, Trung Dao et al.
Reward-Aware Proto-Representations in Reinforcement Learning
Hon Tik Tse, Siddarth Chandrasekar, Marlos C. Machado
Denoising Token Prediction in Masked Autoregressive Models
Ting Yao, Yehao Li, Yingwei Pan et al.
Alignment of Large Language Models with Constrained Learning
Botong Zhang, Shuo Li, Ignacio Hounie et al.
Multimodal Prompt Alignment for Facial Expression Recognition
Fuyan Ma, Yiran He, Bin Sun et al.
CarGait: Cross-Attention based Re-ranking for Gait recognition
Gavriel Habib, Noa Barzilay, Or Shimshi et al.
EntitySAM: Segment Everything in Video
Mingqiao Ye, Seoung Wug Oh, Lei Ke et al.
Multi-Group Proportional Representations for Text-to-Image Models
Sangwon Jung, Alex Oesterling, Claudio Mayrink Verdun et al.
GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection
Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.
Encapsulated Composition of Text-to-Image and Text-to-Video Models for High-Quality Video Synthesis
Tongtong Su, Chengyu Wang, Bingyan Liu et al.
PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation
Uyoung Jeong, Jonathan Freer, Seungryul Baek et al.
DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering
Rongjia Zheng, Qing Zhang, Chengjiang Long et al.
Can Large Language Models Master Complex Card Games?
Wei Wang, Fuqing Bie, Junzhe Chen et al.
ScaleLSD: Scalable Deep Line Segment Detection Streamlined
Zeran Ke, Bin Tan, Xianwei Zheng et al.
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement
Tewodros W. Ayalew, Xiao Zhang, Kevin Y Wu et al.
Controlling the Flow: Stability and Convergence for Stochastic Gradient Descent with Decaying Regularization
Sebastian Kassing, Simon Weissmann, Leif Döring
On the Impact of Performative Risk Minimization for Binary Random Variables
Nikita Tsoy, Ivan Kirev, Negin Rahimiyazdi et al.
Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning
Ian Gemp, Andreas Haupt, Luke Marris et al.
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking
Tao Tang, Lijun Zhou, Pengkun Hao et al.
GenZSL: Generative Zero-Shot Learning Via Inductive Variational Autoencoder
Shiming Chen, Dingjie Fu, Salman Khan et al.
Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
Tianjie Ju, Yi Hua, Hao Fei et al.
Impact-driven Context Filtering For Cross-file Code Completion
Yanzhou Li, Shangqing Liu, Kangjie Chen et al.
Context-Adaptive Multi-Prompt Embedding with Large Language Models for Vision-Language Alignment
Dahun Kim, Anelia Angelova
Humans overrely on overconfident language models, across languages
Neil Rathi, Dan Jurafsky, Kaitlyn Zhou
RARe: Retrieval Augmented Retrieval with In-Context Examples
Atula Tejaswi, Yoonsang Lee, sujay sanghavi et al.
Bridging Layout and RTL: Knowledge Distillation based Timing Prediction
Mingjun Wang, Yihan Wen, Bin Sun et al.
From Queries to Criteria: Understanding How Astronomers Evaluate LLMs
Alina Hyk, Kiera McCormick, Mian Zhong et al.
Function-to-Style Guidance of LLMs for Code Translation
Longhui Zhang, Bin Wang, Jiahao Wang et al.
Neural Representational Consistency Emerges from Probabilistic Neural-Behavioral Representation Alignment
Yu Zhu, Chunfeng Song, Wanli Ouyang et al.
Differentiable Solver Search for Fast Diffusion Sampling
shuai wang, Zexian Li, Qipeng zhang et al.
GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models
Pengxiang Zhao, Xiaoming Yuan
Learning With Multi-Group Guarantees For Clusterable Subpopulations
Jessica Dai, Nika Haghtalab, Eric Zhao
Enabling Optimal Decisions in Rehearsal Learning under CARE Condition
Wen-Bo Du, Hao-Yi Lei, Lue Tao et al.
Probing Syntax in Large Language Models: Successes and Remaining Challenges
Pablo J. Diego Simon, Emmanuel Chemla, Jean-Remi King et al.
In-Context Occam’s Razor: How Transformers Prefer Simpler Hypotheses on the Fly
Puneesh Deora, Bhavya Vasudeva, Tina Behnia et al.
BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution
Kai Liu, Kaicheng Yang, Zheng Chen et al.
Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents
Shuo Han, German Espinosa, Junda Huang et al.
Approximating Language Model Training Data from Weights
John Xavier Morris, Junjie Oscar Yin, Woojeong Kim et al.
In-Context Reinforcement Learning From Suboptimal Historical Data
Juncheng Dong, Moyang Guo, Ethan Fang et al.
Stochastic Online Conformal Prediction with Semi-Bandit Feedback
Haosen Ge, Hamsa Bastani, Osbert Bastani
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen, Dongyan Lin, Mandana Samiei et al.
Self-Bootstrapping for Versatile Test-Time Adaptation
Shuaicheng Niu, Guohao Chen, Peilin Zhao et al.
Deep Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions
Minwoo Kang, Suhong Moon, Seung Hyeong Lee et al.
MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling
Mahdi Karami, Ali Behrouz, Peilin Zhong et al.
Optimal Transport Barycenter via Nonconvex-Concave Minimax Optimization
Kaheon Kim, Rentian Yao, Changbo Zhu et al.
Combinatorial Reinforcement Learning with Preference Feedback
Joongkyu Lee, Min-hwan Oh
Visual Representations inside the Language Model
Benlin Liu, Amita Kamath, Madeleine Grunde-McLaughlin et al.
SpectR: Dynamically Composing LM Experts with Spectral Routing
William Fleshman, Benjamin Van Durme
When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Rei Higuchi, Ryotaro Kawata, Naoki Nishikawa et al.
Nonparametric Teaching for Graph Property Learners
Chen Zhang, Weixin Bu, Zeyi Ren et al.
RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models
Juan Diego Rodriguez, Wenxuan Ding, Katrin Erk et al.
Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF
Syrine Belakaria, Joshua Kazdan, Charles Marx et al.
UniMate: A Unified Model for Mechanical Metamaterial Generation, Property Prediction, and Condition Confirmation
Wangzhi Zhan, Chen Jianpeng, Dongqi Fu et al.
CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective
Jiayu Liu, Zhenya Huang, Wei Dai et al.
Single-Pass Document Scanning for Question Answering
Weili Cao, Jianyou Wang, Youze Zheng et al.
Style over Substance: Distilled Language Models Reason Via Stylistic Replication
Philip Lippmann, Jie Yang
Nesterov Method for Asynchronous Pipeline Parallel Optimization
Thalaiyasingam Ajanthan, Sameera Ramasinghe, Yan Zuo et al.
Olica: Efficient Structured Pruning of Large Language Models without Retraining
Jiujun He, Huazhen Lin
Compositional Generalization via Forced Rendering of Disentangled Latents
Qiyao Liang, Daoyuan Qian, Liu Ziyin et al.
Correctness-Guaranteed Code Generation via Constrained Decoding
Lingxiao Li, salar rahili, Yiwei Zhao
Ehrenfeucht-Haussler Rank and Chain of Thought
Pablo Barcelo, Alexander Kozachinskiy, Tomasz Steifer
FedECADO: A Dynamical System Model of Federated Learning
Aayushya Agarwal, Gauri Joshi, Lawrence Pileggi
Neutral residues: revisiting adapters for model extension
Franck TALLA, Edouard Grave, Herve Jegou
Relational Conformal Prediction for Correlated Time Series
Andrea Cini, Alexander Jenkins, Danilo Mandic et al.
Adaptive Multi-prompt Contrastive Network for Few-shot Out-of-distribution Detection
Xiang Fang, Arvind Easwaran, Blaise Genest
Prediction via Shapley Value Regression
Amr Alkhatib, Roman Bresson, Henrik Boström et al.
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval
Sangam Lee, Ryang Heo, SeongKu Kang et al.
A Taxonomy of Transcendence
Natalie Abreu, Edwin Zhang, Eran Malach et al.
Efficient Quantification of Multimodal Interaction at Sample Level
Zequn Yang, Hongfa Wang, Di Hu
Learning from Loss Landscape: Generalizable Mixed-Precision Quantization via Adaptive Sharpness-Aware Gradient Aligning
Lianbo Ma, Jianlun Ma, Yuee Zhou et al.
CursorCore: Assist Programming through Aligning Anything
Hao Jiang, Qi Liu, Rui Li et al.
Are Large Language Models Ready for Multi-Turn Tabular Data Analysis?
Jinyang Li, Nan Huo, Yan Gao et al.
Distributed Conformal Prediction via Message Passing
Haifeng Wen, Hong XING, Osvaldo Simeone
MixAssist: An Audio-Language Dataset for Co-Creative AI Assistance in Music Mixing
Michael Paul Clemens, Ana Marasovic
Beyond One-Hot Labels: Semantic Mixing for Model Calibration
Haoyang Luo, Linwei Tao, Minjing Dong et al.
AutoGFM: Automated Graph Foundation Model with Adaptive Architecture Customization
Haibo Chen, Xin Wang, Zeyang Zhang et al.
Noiser: Bounded Input Perturbations for Attributing Large Language Models
Mohammad Reza Ghasemi Madani, Aryo Pradipta Gema, Yu Zhao et al.
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Arjun Subramonian, Vagrant Gautam, Preethi Seshadri et al.
Demonstration Selection for In-Context Learning via Reinforcement Learning
Xubin Wang, Jianfei Wu, Yuan Yichen et al.
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Yuhui Wang, Qingyuan Wu, Dylan Ashley et al.
Long-Short Alignment for Effective Long-Context Modeling in LLMs
Tianqi Du, Haotian Huang, Yifei Wang et al.
A Cognac Shot To Forget Bad Memories: Corrective Unlearning for Graph Neural Networks
Varshita Kolipaka, Akshit Sinha, Debangan Mishra et al.
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
Xinyang Li, Siqi Liu, Bochao Zou et al.
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning
Chi Zhang, Ziying Jia, George Atia et al.
A Theoretical Justification for Asymmetric Actor-Critic Algorithms
Gaspard Lambrechts, Damien Ernst, Aditya Mahajan
Reflection-Bench: Evaluating Epistemic Agency in Large Language Models
Lingyu Li, Yixu Wang, Haiquan Zhao et al.
ADAPT: Actively Discovering and Adapting to Preferences for any Task
Maithili Patel, Xavier Puig, Ruta Desai et al.
AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Bingxiang He, Wenbin Zhang, Jiaxi Song et al.
Permutation Equivariant Neural Networks for Symmetric Tensors
Edward Pearce-Crump
LLM Unlearning Without an Expert Curated Dataset
Xiaoyuan Zhu, Muru Zhang, Ollie Liu et al.
LADA: Scalable Label-Specific CLIP Adapter for Continual Learning
Mao-Lin Luo, Zi-Hao Zhou, Tong Wei et al.
Adjustment for Confounding using Pre-Trained Representations
Rickmer Schulte, David Rügamer, Thomas Nagler
SQuat: Subspace-orthogonal KV Cache Quantization
Hao Wang, Ligong Han, Kai Xu et al.
Position: We Need Responsible, Application-Driven (RAD) AI Research
Sarah Hartman, Cheng Soon Ong, Julia Powles et al.
CLIPPER: Compression enables long-context synthetic data generation
Chau Minh Pham, Yapei Chang, Mohit Iyyer
The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage
Skyler Hallinan, Jaehun Jung, Melanie Sclar et al.
Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees
Katsuaki Nakano, Reza Fayyazi, Shanchieh Yang et al.
SPACE: Your Genomic Profile Predictor is a Powerful DNA Foundation Model
Zhao Yang, jiwei zhu, Bing Su
Online Learning with Unknown Constraints
Karthik Sridharan, Seung Won Wilson Yoo
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge, Michael Lanier, Anindya Sarkar et al.
Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality
Sewoong Lee, Adam Davies, Marc E. Canby et al.
The Zero Body Problem: Probing LLM Use of Sensory Language
Rebecca M. M. Hicke, Sil Hamilton, David Mimno
Sparse Spectral Training and Inference on Euclidean and Hyperbolic Neural Networks
Jialin Zhao, Yingtao Zhang, Xinghang Li et al.
Distilling the Knowledge in Data Pruning
Emanuel Ben Baruch, Adam Botach, Igor Kviatkovsky et al.
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao, Tao Wang, Wenjian Huang et al.
An in depth look at the Procrustes-Wasserstein distance: properties and barycenters
Davide Adamo, Marco Corneli, Manon Vuillien et al.
UTF-8 Plumbing: Byte-level Tokenizers Unavoidably Enable LLMs to Generate Ill-formed UTF-8
Preston Firestone, Shubham Ugare, Gagandeep Singh et al.
Learning Curves of Stochastic Gradient Descent in Kernel Regression
Haihan Zhang, Weicheng Lin, Yuanshi Liu et al.
Noise Conditional Variational Score Distillation
Xinyu Peng, Ziyang Zheng, Yaoming Wang et al.
Consensus Based Stochastic Optimal Control
Liyao Lyu, Jingrun Chen
Tracking The Best Expert Privately
Hilal Asi, Vinod Raman, Aadirupa Saha
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang, Martin Magnusson, Johannes Stork et al.
SUICA: Learning Super-high Dimensional Sparse Implicit Neural Representations for Spatial Transcriptomics
Qingtian Zhu, Yumin Zheng, Yuling Sang et al.
Model Uncertainty Quantification by Conformal Prediction in Continual Learning
Rui Gao, Weiwei Liu
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection
Zhijing Wan, Zhixiang Wang, Zheng Wang et al.
BiAssemble: Learning Collaborative Affordance for Bimanual Geometric Assembly
Yan Shen, Ruihai Wu, Yubin Ke et al.
Time to Spike? Understanding the Representational Power of Spiking Neural Networks in Discrete Time
Duc Anh Nguyen, Ernesto Araya, Adalbert Fono et al.
Active Learning with Selective Time-Step Acquisition for PDEs
Yegon Kim, Hyunsu Kim, Gyeonghoon Ko et al.
A Bayesian Model Selection Criterion for Selecting Pretraining Checkpoints
Michael Munn, Susan Wei
From Token to Rhythm: A Multi-Scale Approach for ECG-Language Pretraining
Fuying Wang, Jiacheng Xu, Lequan Yu
Modeling All-Atom Glycan Structures via Hierarchical Message Passing and Multi-Scale Pre-training
Minghao Xu, Jiaze Song, Keming Wu et al.
A Versatile Influence Function for Data Attribution with Non-Decomposable Loss
Junwei Deng, Weijing Tang, Jiaqi Ma
Fast, Accurate Manifold Denoising by Tunneling Riemannian Optimization
Shiyu Wang, Mariam Avagyan, Yihan Shen et al.
Enforcing Idempotency in Neural Networks
Nikolaj Jensen, Jamie Vicary
Rhomboid Tiling for Geometric Graph Deep Learning
Yipeng Zhang, Longlong Li, Kelin Xia
RISE: Radius of Influence based Subgraph Extraction for 3D Molecular Graph Explanation
Jingxiang Qu, Wenhan Gao, Jiaxing Zhang et al.
Learning Dynamics under Environmental Constraints via Measurement-Induced Bundle Structures
Dongzhe Zheng, Wenjie Mei
You Always Recognize Me (YARM): Robust Texture Synthesis Against Multi-View Corruption
Weihang Ran, Wei Yuan, Yinqiang Zheng
Splitting with Importance-aware Updating for Heterogeneous Federated Learning with Large Language Models
Yangxu Liao, Wenke Huang, Guancheng Wan et al.
When do neural networks learn world models?
Tianren Zhang, Guanyu Chen, Feng Chen
Efficient Fine-Grained Guidance for Diffusion Model Based Symbolic Music Generation
Tingyu Zhu, Haoyu Liu, Ziyu Wang et al.
Concentration Distribution Learning from Label Distributions
Jiawei Tang, Yuheng Jia
Trust-Region Twisted Policy Improvement
Joery de Vries, Jinke He, Yaniv Oren et al.
$S^2$FGL: Spatial Spectral Federated Graph Learning
Zihan Tan, Suyuan Huang, Guancheng Wan et al.
Information Bottleneck-guided MLPs for Robust Spatial-temporal Forecasting
Min Chen, Guansong Pang, Wenjun Wang et al.
Prediction models that learn to avoid missing values
Lena Stempfle, Anton Matsson, Newton Mwai et al.
Latent Imputation before Prediction: A New Computational Paradigm for De Novo Peptide Sequencing
Ye DU, Chen Yang, Nanxi Yu et al.
TeDS: Joint Learning of Diachronic and Synchronic Perspectives in Quaternion Space for Temporal Knowledge Graph Completion
Jiujiang Guo, Mankun Zhao, Wenbin Zhang et al.
Breaking the Quadratic Barrier: Robust Cardinality Sketches for Adaptive Queries
Edith Cohen, Mihir Singhal, Uri Stemmer
Vision Graph Prompting via Semantic Low-Rank Decomposition
Zixiang Ai, Zichen Liu, Jiahuan Zhou
Variational Phylogenetic Inference with Products over Bipartitions
Evan Sidrow, Alexandre Bouchard-Côté, Lloyd Elliott
Aggregation Buffer: Revisiting DropEdge with a New Parameter Block
Dooho Lee, Myeong Kong, Sagad Hamid et al.
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Minting Pan, Yitao Zheng, Jiajian Li et al.
A Recipe for Causal Graph Regression: Confounding Effects Revisited
Yujia Yin, Tianyi Qu, Zihao Wang et al.
Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning
Ngoc Bui, Menglin Yang, Runjin Chen et al.
CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty
Harry Zhang, Luca Carlone
A Chaotic Dynamics Framework Inspired by Dorsal Stream for Event Signal Processing
yu chen, Jing Lian, Zhaofei Yu et al.
On Exact Bit-level Reversible Transformers Without Changing Architecture
Guoqiang Zhang, John Lewis, W. Bastiaan Kleijn
Automatic Differentiation of Optimization Algorithms with Time-Varying Updates
Sheheryar Mehmood, Peter Ochs
Learning Mixtures of Experts with EM: A Mirror Descent Perspective
Quentin Fruytier, Aryan Mokhtari, Sujay Sanghavi
Dimensionality Reduction on Complex Vector Spaces for Euclidean Distance with Dynamic Weights
Simone Moretti, Paolo Pellizzoni, Francesco Silvestri
Compositional Scene Understanding through Inverse Generative Modeling
Yanbo Wang, Justin Dauwels, Yilun Du
Variance-Reduced Forward-Reflected-Backward Splitting Methods for Nonmonotone Generalized Equations
Quoc Tran-Dinh
Learning Event Completeness for Weakly Supervised Video Anomaly Detection
Yu Wang, Shiwei Chen
FlexiClip: Locality-Preserving Free-Form Character Animation
Anant Khandelwal
TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs
Felipe Nuti, Tim Franzmeyer, Joao Henriques
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning
Qingqing Cao, Mahyar Najibi, Sachin Mehta
3D Question Answering via only 2D Vision-Language Models
FENGYUN WANG, Sicheng Yu, Jiawei Wu et al.
A Mathematical Framework for AI-Human Integration in Work
L. Elisa Celis, Lingxiao Huang, Nisheeth K. Vishnoi
SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity
Shihao Zou, Qingfeng Li, Wei Ji et al.
Position: Causal Machine Learning Requires Rigorous Synthetic Experiments for Broader Adoption
Audrey Poinsot, Panayiotis Panayiotou, Alessandro Leite et al.
Stable Offline Value Function Learning with Bisimulation-based Representations
Brahma Pavse, Yudong Chen, Qiaomin Xie et al.
Rethinking the Bias of Foundation Model under Long-tailed Distribution
Jiahao Chen, Bin Qin, Jiangmeng Li et al.
Meta Optimality for Demographic Parity Constrained Regression via Post-Processing
Kazuto Fukuchi
Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance
Takuya Tamura, Taro Yano, Masafumi Enomoto et al.
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding
Fabian David Schmidt, Ivan Vulić, Goran Glavaš et al.
Hyperparameter Loss Surfaces Are Simple Near their Optima
Nicholas Lourie, He He, Kyunghyun Cho
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models
Patrick Leask, Neel Nanda, Noura Al Moubayed
Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups
Rijul Magu, Arka Dutta, Sean Kim et al.
Diagonal Symmetrization of Neural Network Solvers for the Many-Electron Schrödinger Equation
Kevin Han Huang, Ni Zhan, Elif Ertekin et al.
BiXSE: Improving Dense Retrieval via Probabilistic Graded Relevance Distillation
Christos Tsirigotis, Vaibhav Adlakha, Joao Monteiro et al.
The Negation Bias in Large Language Models: Investigating bias reflected in linguistic markers
Yishan Wang, Pia Sommerauer, Jelke Bloem
Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models
Wataru Ikeda, Kazuki Yano, Ryosuke Takahashi et al.
Generative Modeling Reinvents Supervised Learning: Label Repurposing with Predictive Consistency Learning
Yang Li, Jiale Ma, Yebin Yang et al.
GenerationPrograms: Fine-grained Attribution with Executable Programs
David Wan, Eran Hirsch, Elias Stengel-Eskin et al.
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
Anirban Saha Anik, Xiaoying Song, Elliott Wang et al.
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression
Hanqi Xiao, Yi-Lin Sung, Elias Stengel-Eskin et al.
Identifying Causal Direction via Variational Bayesian Compression
Quang-Duy Tran, Bao Duong, Phuoc Nguyen et al.
Learning Effective Language Representations for Sequential Recommendation via Joint Embedding Predictive Architecture
Nguyen Anh Minh, Dung D. Le
SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models
Zhenwei Tang, Difan Jiao, Blair Yang et al.
Implicit In-Context Learning: Evidence from Artificial Language Experiments
Xiaomeng Ma, Qihui Xu
Exploring Large Language Model Agents for Piloting Social Experiments
Jinghua Piao, Yuwei Yan, Nian Li et al.
Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task
Jared Moore, Ned Cooper, Rasmus Overmark et al.
Zero-Inflated Bandits
Haoyu Wei, Runzhe Wan, Lei Shi et al.
URANIA: Differentially Private Insights into AI Use
Daogao Liu, Edith Cohen, Badih Ghazi et al.
OpinioRAG: Towards Generating User-Centric Opinion Highlights from Large-scale Online Reviews
Mir Tafseer Nayeem, Davood Rafiei
Investigating Intersectional Bias in Large Language Models using Confidence Disparities in Coreference Resolution
Falaah Arif Khan, Nivedha Sivakumar, Yinong Oliver Wang et al.
Learning Utilities from Demonstrations in Markov Decision Processes
Filippo Lazzati, Alberto Maria Metelli
Arrow: Accelerator for Time Series Causal Discovery with Time Weaving
Yuanyuan Yao, Yuan Dong, Lu Chen et al.
Privately Learning from Graphs with Applications in Fine-tuning Large Language Models
Haoteng Yin, Rongzhe Wei, Eli Chien et al.
Exploring Sparse Adapters for Scalable Merging of Parameter Efficient Experts
Samin Yeasar Arnob, Zhan Su, Minseon Kim et al.
The World According to LLMs: How Geographic Origin Influences LLMs' Entity Deduction Capabilities
Harsh Nishant Lalai, Raj Sanjay Shah, Jiaxin Pei et al.