Most Cited 2025 "neural network embeddings" Papers
22,274 papers found • Page 111 of 112
Conference
Efficient Knowledge Transfer in Federated Recommendation for Joint Venture Ecosystem
Yichen Li, Yijing Shan, YI LIU et al.
AngleRoCL: Angle-Robust Concept Learning for Physically View-Invariant Adversarial Patches
Wenjun Ji, Yuxiang Fu, Luyang Ying et al.
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
Wanquan Feng, Jiawei Liu, Pengqi Tu et al.
Improving the Straight-Through Estimator with Zeroth-Order Information
Ningfeng Yang, Tor Aamodt
Enhancing GUI Agent with Uncertainty-Aware Self-Trained Evaluator
Gongwei Chen, Lirong Jie, Lexiao Zou et al.
Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
Peter Ochieng
GLNCD: Graph-Level Novel Category Discovery
Bowen Deng, Lele Fu, Sheng Huang et al.
Exploring Semantic-constrained Adversarial Example with Instruction Uncertainty Reduction
Jin Hu, Jiakai Wang, linna Jing et al.
Addressing Mark Imbalance in Integration-free Marked Temporal Point Processes
Sishun Liu, KE DENG, Yongli Ren et al.
The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage
Skyler Hallinan, Jaehun Jung, Melanie Sclar et al.
Dynamic Diffusion Schrödinger Bridge in Astrophysical Observational Inversions
Ye Zhu, Duo Xu, Zhiwei Deng et al.
Bridging Time and Linguistics: LLMs as Time Series Analyzer through Symbolization and Segmentation
Jianyang Qin, Chaoyang Li, Jinhao Cui et al.
Revealing Multimodal Causality with Large Language Models
Jin Li, Shoujin Wang, Qi Zhang et al.
Leveraging robust optimization for llm alignment under distribution shifts
Mingye Zhu, Yi Liu, Zheren Fu et al.
$\epsilon$-Seg: Sparsely Supervised Semantic Segmentation of Microscopy Data
Sheida Rahnamai Kordasiabi, Damian Nogare, Florian Jug
Second-order Optimization under Heavy-Tailed Noise: Hessian Clipping and Sample Complexity Limits
Abdurakhmon Sadiev, Peter Richtarik, Ilyas Fatkhullin
UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection
Yang Zhao, Kai Xiong, Xiao Ding et al.
SPARTAN: A Sparse Transformer World Model Attending to What Matters
Anson Lei, Bernhard Schölkopf, Ingmar Posner
Aha! - Predicting What Matters Next: Online Highlight Detection Without Looking Ahead
Aiden Chang, Celso de Melo, Stephanie Lukin
Mixtures of Subspaces for Bandwidth Efficient Context Parallel Training
Sameera Ramasinghe, Thalaiyasingam Ajanthan, Hadi Mohaghegh Dolatabadi et al.
Rectifying Soft-Label Entangled Bias in Long-Tailed Dataset Distillation
Chenyang Jiang, Hang Zhao, Xinyu Zhang et al.
Sample-Efficient Tabular Self-Play for Offline Robust Reinforcement Learning
Na Li, Zewu Zheng, Wei Ni et al.
World Models as Reference Trajectories for Rapid Motor Adaptation
Carlos Stein Brito, Daniel McNamee
SPARKE: Scalable Prompt-Aware Diversity and Novelty Guidance in Diffusion Models via RKE Score
Mohammad Jalali, Haoyu Lei, Amin Gohari et al.
Optimal Rates for Generalization of Gradient Descent for Deep ReLU Classification
Yuanfan Li, Yunwen Lei, Zheng-Chu Guo et al.
Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs
Gerardo Flores, Alyssa H. Smith, Julia Fukuyama et al.
Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback
Yi-Lun Wu, Bo-Kai Ruan, Chiang Tseng et al.
Residual Stream Analysis of Overfitting And Structural Disruptions
Quan Liu, Han Zhou, Wenquan Wu et al.
CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations
Guangyi Chen, Yunlong Deng, Peiyuan Zhu et al.
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Zhihao Sun, Haoran Jiang, Haoran Chen et al.
Heterogeneous Graph Transformers for Simultaneous Mobile Multi-Robot Task Allocation and Scheduling under Temporal Constraints
Batuhan Altundas, Shengkang Chen, Shivika Singh et al.
Stop the Nonconsensual Use of Nude Images in Research
Princessa Cintaqia, Arshia Arya, Elissa Redmiles et al.
Multimodal LiDAR-Camera Novel View Synthesis with Unified Pose-free Neural Fields
Weiyi Xue, Fan Lu, Yunwei Zhu et al.
BAM-ICL: Causal Hijacking In-Context Learning with Budgeted Adversarial Manipulation
Rui Chu, Bingyin Zhao, Hanling Jiang et al.
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training
Brian Bartoldson, Siddarth Venkatraman, James Diffenderfer et al.
FairDICE: Fairness-Driven Offline Multi-Objective Reinforcement Learning
Woosung Kim, Jinho Lee, Jongmin Lee et al.
A Minimalistic Unified Framework for Incremental Learning across Image Restoration Tasks
Xiaoxuan Gong, Jie Ma
Consistency of the $k_n$-nearest neighbor rule under adaptive sampling
Robi Bhattacharjee, Geelon So, Sanjoy Dasgupta
On Learning Verifiers and Implications to Chain-of-Thought Reasoning
Maria-Florina Balcan, Avrim Blum, Zhiyuan Li et al.
GOATex: Geometry & Occlusion-Aware Texturing
Hyunjin Kim, Kunho Kim, Adam Lee et al.
Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning
Tianxing Wu, Shutong Zhu, Jingting Wang et al.
Towards Generalizable Detector for Generated Image
Qianshu Cai, Chao Wu, Yonggang Zhang et al.
Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness
Stephen Pfohl, Natalie Harris, Chirag Nagpal et al.
Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL
Matthew Zurek, Guy Zamir, Yudong Chen
Learning to Zoom with Anatomical Relations for Medical Structure Detection
Bin Pu, Liwen Wang, Xingbo Dong et al.
vHector and HeisenVec: Scalable Vector Graphics Generation Through Large Language Models
Leonardo Zini, Elia Frigieri, Sebastiano Aloscari et al.
Private Hyperparameter Tuning with Ex-Post Guarantee
Badih Ghazi, Pritish Kamath, Alexander Knop et al.
Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Qixin Zhang, Yan Sun, Can Jin et al.
Error Forcing in Recurrent Neural Networks
A Sağtekin, Colin Bredenberg, Cristina Savin
Proper Hölder-Kullback Dirichlet Diffusion: A Framework for High Dimensional Generative Modeling
Wanpeng Zhang, Yuhao Fang, Xihang Qiu et al.
LD-RoViS: Training-free Robust Video Steganography for Deterministic Latent Diffusion Model
Xiangkun Wang, Kejiang Chen, Lincong Li et al.
A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules
Xiang Li, Feng Ruan, Huiyuan Wang et al.
DevFD : Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces
Tianshuo Zhang, Li Gao, Siran Peng et al.
Clustering via Hedonic Games: New Concepts and Algorithms
Gergely Csáji, Alexander Gundert, Jörg Rothe et al.
TRELLIS: Learning to Compress Key-Value Memory in Attention Models
Mahdi Karami, Ali Behrouz, Praneeth Kacham et al.
Fine-grained Analysis and Faster Algorithms for Iteratively Solving Linear Systems
Michal Derezinski, Daniel LeJeune, Deanna Needell et al.
Stochastic-Constrained Stochastic Optimization with Markovian Data
Yeongjong Kim, Dabeen Lee
Position: Towards Bidirectional Human-AI Alignment
Hua Shen, Tiffany Knearem, Reshmi Ghosh et al.
Defining and Discovering Hyper-meta-paths for Heterogeneous Hypergraphs
Yaming Yang, Ziyu Zheng, Weigang Lu et al.
DataSIR: A Benchmark Dataset for Sensitive Information Recognition
Fan Mo, Bo Liu, Yuan Fan et al.
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
Ethan Chern, Steffi Chern, Shiqi Chen et al.
Intend to Move: A Multimodal Dataset for Intention-Aware Human Motion Understanding
Ryo Umagami, Liu Yue, Xuangeng Chu et al.
DAVE: Diagnostic benchmark for Audio Visual Evaluation
Gorjan Radevski, Teodora Popordanoska, Matthew Blaschko et al.
OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models
Ziheng Cheng, Yixiao Huang, Hui Xu et al.
MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models
Zimeng Huang, Jinxin Ke, Xiaoxuan Fan et al.
MolVision: Molecular Property Prediction with Vision Language Models
Deepan Adak, Yogesh Rawat, Shruti Vyas
CheMixHub: Datasets and Benchmarks for Chemical Mixture Property Prediction
Ella Miray Rajaonson, Mahyar Rajabi Kochi, Luis Martin Mejia Mendoza et al.
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence
Yining Hong, Rui Sun, Bingxuan Li et al.
SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models
Xianda Guo, Ruijun Zhang, Yiqun Duan et al.
TreeFinder: A US-Scale Benchmark Dataset for Individual Tree Mortality Monitoring Using High-Resolution Aerial Imagery
Zhihao Wang, Cooper Li, Ruichen Wang et al.
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
Ashwinee Panda, Vatsal Baherwani, Zain Sarwar et al.
Rethinking Entropy in Test-Time Adaptation: The Missing Piece from Energy Duality
Mincheol Park, Heeji Won, Won Woo Ro et al.
Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
Xue-Feng Zhu, Tianyang Xu, Yifan Pan et al.
REFED: A Subject Real-time Dynamic Labeled EEG-fNIRS Synchronized Recorded Emotion Dataset
Xiaojun Ning, Jing Wang, Zhiyang Feng et al.
PF∆: A Benchmark Dataset for Power Flow under Load, Generation, and Topology Variations
Ana Rivera Him, Anvita Bhagavathula, Alvaro Carbonero et al.
GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
Manish Shetty, Naman Jain, Jinjian Liu et al.
PREAMBLE: Private and Efficient Aggregation via Block Sparse Vectors
Hilal Asi, Vitaly Feldman, Hannah Keller et al.
STSBench: A Large-Scale Dataset for Modeling Neuronal Activity in the Dorsal Stream of Primate Visual Cortex
Ethan Trepka, Ruobing Xia, Shude Zhu et al.
SeasonBench-EA: A Multi-Source Benchmark for Seasonal Prediction and Numerical Model Post-Processing in East Asia
Mengxuan Chen, Li, Zou Ziheng et al.
PUO-Bench: A Panel Understanding and Operation Benchmark with A Privacy-Preserving Framework
Wei LIN, Yiwei Zhou, Junkai Zhang et al.
Sequential Monte Carlo for Policy Optimization in Continuous POMDPs
Hany Abdulsamad, Sahel Mohammad Iqbal, Simo Sarkka
A Scalable, Causal, and Energy Efficient Framework for Neural Decoding with Spiking Neural Networks
Georgios Mentzelopoulos, Ioannis Asmanis, Konrad Kording et al.
Augmenting Biological Fitness Prediction Benchmarks with Landscapes Features from GraphFLA
Mingyu Huang, Shasha Zhou, Ke Li
egoEMOTION: Egocentric Vision and Physiological Signals for Emotion and Personality Recognition in Real-world Tasks
Matthias Jammot, Björn Braun, Paul Streli et al.
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes
Tianxu Wang, Zhuofan Zhang, Ziyu Zhu et al.
VMDT: Decoding the Trustworthiness of Video Foundation Models
Yujin Potter, Zhun Wang, Nicholas Crispino et al.
Towards Automated Petrography
Isai Daniel Chacon, Paola Ruiz Puentes, Jillian Pearse et al.
QCircuitBench: A Large-Scale Dataset for Benchmarking Quantum Algorithm Design
Rui Yang, Ziruo Wang, Yuntian Gu et al.
Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms
William Réveillard, Richard Combes
Can Large Language Models Integrate Spatial Data? Empirical Insights into Reasoning Strengths and Computational Weaknesses
Bin HAN, Robert Wolfe, Anat Caspi et al.
MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models
Hang Hua, Ziyun Zeng, Yizhi Song et al.
Sampled Estimators For Softmax Must Be Biased
Li-Chung Lin, Yaxu Liu, Chih-Jen Lin
Beyond Last-Click: An Optimal Mechanism for Ad Attribution
Nan An, Weian Li, Qi Qi et al.
Benchmarking Egocentric Multimodal Goal Inference for Assistive Wearable Agents
Vijay Veerabadran, Fanyi Xiao, Nitin Kamra et al.
Improving Progressive Generation with Decomposable Flow Matching
Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov et al.
Long-term Intracortical Neural activity and Kinematics (LINK): An intracortical neural dataset for chronic brain-machine interfaces, neuroscience, and machine learning
Hisham Temmar, Yixuan Wang, Nina Gill et al.
IR-OptSet: An Optimization-Sensitive Dataset for Advancing LLM-Based IR Optimizer
Zi Yang, Lei Qiu, FANG LYU et al.
CGBench: Benchmarking Language Model Scientific Reasoning for Clinical Genetics Research
Owen Queen, Harrison Zhang, James Zou
Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need
Kecheng Chen, Pingping Zhang, Hui Liu et al.
Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition
Andy Zou, Maxwell Lin, Eliot Jones et al.
Generative Distribution Embeddings
Nic Fishman, Gokul Gowri, Peng Yin et al.
Pretraining on the Test Set Is No Longer All You Need: A Debate-Driven Approach to QA Benchmarks
Linbo Cao, Jinman Zhao
HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks
Hongjin Qian, Zheng Liu, Chao Gao et al.
Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining
Raghuveer Thirukovalluru, Rui Meng, Ye Liu et al.
Toward Real-world Text Image Forgery Localization: Structured and Interpretable Data Synthesis
Zeqin Yu, Haotao Xie, Jian Zhang et al.
Multi-Objective One-Shot Pruning for Large Language Models
Weiyu Chen, Hansi Yang, Yunhao Gou et al.
EngiBench: A Framework for Data-Driven Engineering Design Research
Florian Felten, Gabriel Apaza, Gerhard Bräunlich et al.
Contextual Tokenization for Graph Inverted Indices
Pritish Chakraborty, Indradyumna Roy, Soumen Chakrabarti et al.
Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules
Yueqi Zhang, Peiwen Yuan, Yiwei Li et al.
DermaCon-IN: A Multiconcept-Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research
Shanawaj Sahebpatel Madarkar, Mahajabeen Madarkar, Madhumitha Venkatesh et al.
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K
Tao Yuan, Xuefei Ning, Dong Zhou et al.
Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications
Agam Shah, Siddhant Sukhani, Huzaifa Pardawala et al.
PMLF: A Physics-Guided Multiscale Loss Framework for Structurally Heterogeneous Time Series
Xinghong Chen, Weilin Wu, Kunping Yang et al.
What’s in Common? Multimodal Models Hallucinate When Reasoning Across Scenes
Candace Ross, Florian Bordes, Adina Williams et al.
Deep Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions
Minwoo Kang, Suhong Moon, Seung Hyeong Lee et al.
Siegel Neural Networks
Xuan Son Nguyen, Aymeric Histace, Nistor Grozavu
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
Belinda Li, Been Kim, Zi Wang
AGI-Elo: How Far Are We From Mastering A Task?
Shuo Sun, Yimin Zhao, Christina Lee et al.
LCDB 1.1: A Database Illustrating Learning Curves Are More Ill-Behaved Than Previously Thought
Cheng Yan, Felix Mohr, Tom Viering
All that structure matches does not glitter
Maya Martirossyan, Thomas Egg, Philipp Höllmer et al.
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
Yinsicheng Jiang, Yao Fu, Yeqi Huang et al.
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models
Haohan Chi, Huan-ang Gao, Ziming Liu et al.
Chain-of-Model Learning for Language Model
Xiaohua Wang, Kaitao Song, Xu Tan et al.
Training-Free Diffusion Model Alignment with Sampling Demons
Po-Hung Yeh, Kuang-Huei Lee, Jun-Cheng Chen
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
Wenkai Yang, Shiqi Shen, Guangyao Shen et al.
Bilinear MLPs enable weight-based mechanistic interpretability
Michael Pearce, Thomas Dooms, Alice Rigg et al.
Dynamic Negative Guidance of Diffusion Models
Felix Koulischer, Johannes Deleu, Gabriel Raya et al.
Prompting Fairness: Integrating Causality to Debias Large Language Models
Jingling Li, Zeyu Tang, Xiaoyu Liu et al.
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts
Huy Nguyen, Pedram Akbarian Saravi, Trang Pham et al.
Locality Alignment Improves Vision-Language Models
Ian Covert, Tony Sun, James Y Zou et al.
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
Thomas Robert, Mher Safaryan, Ionut-Vlad Modoranu et al.
To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions
Noah Marshall, Ke Liang Xiao, Atish Agarwala et al.
ClinBench: A Standardized Multi-Domain Framework for Evaluating Large Language Models in Clinical Information Extraction
Ismael Villanueva Miranda, Zifan Gu, Donghan Yang et al.
Self-Normalized Resets for Plasticity in Continual Learning
Vivek Farias, Adam Jozefiak
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Zhewei Kang, Xuandong Zhao, Dawn Song
Regularizing Energy among Training Samples for Out-of-Distribution Generalization
Yiting Chen, Qitian Wu, Junchi Yan
Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization
Zhanfeng Mo, Long-Kai Huang, Sinno Jialin Pan
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo, Florian Eddie Dorner, Moritz Hardt
Efficient Low-Bit Quantization with Adaptive Scales for Multi-Task Co-Training
Boyu Liu, Haoyu Huang, Linlin Yang et al.
Towards Unified Human Motion-Language Understanding via Sparse Interpretable Characterization
guangtao lyu, Chenghao Xu, Jiexi Yan et al.
Disentangled Representation Learning with the Gromov-Monge Gap
Théo Uscidda, Luca Eyring, Karsten Roth et al.
COME: Test-time Adaption by Conservatively Minimizing Entropy
Qingyang Zhang, Yatao Bian, Xinke Kong et al.
Spherical Tree-Sliced Wasserstein Distance
Viet-Hoang Tran, Thanh Chu, Minh-Khoi Nguyen-Nhat et al.
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding
Zhenyu Yang, Yuhang Hu, Zemin Du et al.
Oracle efficient truncated statistics
Konstantinos Karatapanis, Vasilis Kontonis, Christos Tzamos
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
Mingjie Li, Wai Man Si, Michael Backes et al.
DeeperForward: Enhanced Forward-Forward Training for Deeper and Better Performance
Liang Sun, Yang Zhang, Weizhao He et al.
Conformalized Interactive Imitation Learning: Handling Expert Shift and Intermittent Feedback
Michelle Zhao, Henny Admoni, Reid Simmons et al.
A Computational Framework for Modeling Emergence of Color Vision in the Human Brain
Atsunobu Kotani, Yi-Ren Ng
Smooth Quadratic Prediction Markets
Enrique Nueve, Bo Waggoner
Unsupervised Multiple Kernel Learning for Graphs via Ordinality Preservation
Yan Sun, Stanley Kok
Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators
Bin Lu, Xinyu Xiao, Changzhou Zhang et al.
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION
Jingxuan Chen, Derek Yuen, Bin Xie et al.
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD
Ze Peng, Jian Zhang, Yisen Wang et al.
Policy Design in Long-run Welfare Dynamics
Jiduan Wu, Rediet Abebe, Moritz Hardt et al.
Flaws of ImageNet, Computer Vision's Favourite Dataset
Nikita Kisel, Illia Volkov, Kateřina Hanzelková et al.
Lossy Compression with Pretrained Diffusion Models
jeremy vonderfecht, Feng Liu
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma, Zhengding Luo, Thanh Vinh Vo et al.
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Jintao Zhang, Jia wei, Pengle Zhang et al.
Evidential Learning-based Certainty Estimation for Robust Dense Feature Matching
Lile Cai, Chuan Sheng Foo, Xun Xu et al.
Maintaining Structural Integrity in Parameter Spaces for Parameter Efficient Fine-tuning
Chongjie Si, Xuehui Wang, Xue Yang et al.
Rationalizing and Augmenting Dynamic Graph Neural Networks
Guibin Zhang, Yiyan Qi, Ziyang Cheng et al.
PIN: Prolate Spheroidal Wave Function-based Implicit Neural Representations
Viraj Dhananjaya Bandara Jayasundara Jayasundara Mudiyanselage, Heng Zhao, Demetrio Labate et al.
DeMo: Deep Motion Field Consensus with Learnable Kernels for Two-view Correspondence Learning
Yifan Lu, Jiajun Le, Zizhuo Li et al.
Efficient Dictionary Learning with Switch Sparse Autoencoders
Anish Mudide, Josh Engels, Eric Michaud et al.
LeanVec: Searching vectors faster by making them fit
Ishwar Bhati, Cecilia Aguerrebere, Mark Hildebrand et al.
Extendable and Iterative Structure Learning Strategy for Bayesian Networks
Hamid Kalantari, Russell Greiner, Pouria Ramazi
Transformers Provably Solve Parity Efficiently with Chain of Thought
Juno Kim, Taiji Suzuki
On Evaluating the Durability of Safeguards for Open-Weight LLMs
Xiangyu Qi, Boyi Wei, Nicholas Carlini et al.
Tailoring Mixup to Data for Calibration
Quentin Bouniot, Pavlo Mozharovskyi, Florence d'Alché-Buc
Generalized Behavior Learning from Diverse Demonstrations
Varshith Sreeramdass, Rohan Paleja, Letian Chen et al.
DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle
Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
Yuxin Jiang, Bo Huang, Yufei Wang et al.
ILLUSION: Unveiling Truth with a Comprehensive Multi-Modal, Multi-Lingual Deepfake Dataset
Kartik Thakral, Rishabh Ranjan, Akanksha Singh et al.
Large Convolutional Model Tuning via Filter Subspace
Wei Chen, Zichen Miao, Qiang Qiu
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Sang Choe, Hwijeen Ahn, Juhan Bae et al.
RuAG: Learned-rule-augmented Generation for Large Language Models
Yudi Zhang, Pei Xiao, Lu Wang et al.
SSOLE: Rethinking Orthogonal Low-rank Embedding for Self-Supervised Learning
Lun Huang, Qiang Qiu, Guillermo Sapiro
TopoLM: brain-like spatio-functional organization in a topographic language model
Neil Rathi, Johannes Mehrer, Badr AlKhamissi et al.
Looking into User’s Long-term Interests through the Lens of Conservative Evidential Learning
Dingrong Wang, Krishna Neupane, Ervine Zheng et al.
Near-Exact Privacy Amplification for Matrix Mechanisms
Christopher Choquette-Choo, Arun Ganesh, Saminul Haque et al.
The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD
Milad Nasr, Thomas Steinke, Borja Balle et al.
AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies
Jian Gao, Weidong Cao, Junyi Yang et al.
IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking
Shubham Dipak Ugare, Rohan Gumaste, Tarun Suresh et al.
Improving Deep Regression with Tightness
Shihao Zhang, Yuguang Yan, Angela Yao
GSE: Group-wise Sparse and Explainable Adversarial Attacks
Shpresim Sadiku, Moritz Wagner, Sebastian Pokutta
Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization
Jianting Yang, Srecko Durasinovic, Jean Bernard Lasserre et al.
The impact of allocation strategies in subset learning on the expressive power of neural networks
Ofir Schlisselberg, Ran Darshan
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai, Naoya Takeishi, Takehisa Yairi
SelectFormer in Data Markets: Privacy-Preserving and Efficient Data Selection for Transformers with Multi-Party Computation
Xu Ouyang, Felix Xiaozhu Lin, Yangfeng Ji
Inverse Attention Agents for Multi-Agent Systems
Qian Long, Ruoyan Li, Minglu Zhao et al.
Wavelet Diffusion Neural Operator
Peiyan Hu, Rui Wang, Xiang Zheng et al.
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
Tianyu Zhang, Suyuchen Wang, Lu Li et al.
OccProphet: Pushing the Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with an Observer-Forecaster-Refiner Framework
Junliang Chen, Huaiyuan Xu, Yi Wang et al.
Agree to Disagree: Demystifying Homogeneous Deep Ensembles through Distributional Equivalence
Yipei Wang, Xiaoqian Wang
Discovering Clone Negatives via Adaptive Contrastive Learning for Image-Text Matching
Renjie Pan, Jihao Dong, Hua Yang
RAG-SR: Retrieval-Augmented Generation for Neural Symbolic Regression
Hengzhe Zhang, Qi Chen, Bing XUE et al.
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization
Juntao Dai, Taiye Chen, Yaodong Yang et al.
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz Abdullaev, Tan Nguyen
MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling
Rachel Teo, Tan Nguyen