Most Cited NEURIPS "multimodal transformers" Papers
5,858 papers found • Page 14 of 30
Conference
Optimal community detection in dense bipartite graphs
Julien Chhor, Parker Knight
RETRO SYNFLOW: Discrete Flow-Matching for Accurate and Diverse Single-Step Retrosynthesis
Robin Yadav, Qi Yan, Guy Wolf et al.
Fast MRI for All: Bridging Access Gaps by Training without Raw Data
Yasar Utku Alcalar, Merve Gulle, Mehmet Akcakaya
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu, Ruize Zhang, Chao Yu et al.
Towards Prospective Medical Image Reconstruction via Knowledge-Informed Dynamic Optimal Transport
Taoran Zheng, Yan Yang, Xing Li et al.
DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis
Dongheon Lee, Younghoo Kwon, Jung-Woo Choi
Probing Neural Combinatorial Optimization Models
Zhiqin Zhang, Yining Ma, Zhiguang Cao et al.
Balancing Gradient and Hessian Queries in Non-Convex Optimization
Deeksha Adil, Brian Bullins, Aaron Sidford et al.
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
Yinsicheng Jiang, Yao Fu, Yeqi Huang et al.
From Euler to AI: Unifying Formulas for Mathematical Constants
Tomer Raz, Michael Shalyt, Elyasheev Leibtag et al.
Geometry-Aware Edge Pooling for Graph Neural Networks
Katharina Limbeck, Lydia Mezrag, Guy Wolf et al.
Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems
Ibrahim Alabdulmohsin, Xiaohua Zhai
Any-stepsize Gradient Descent for Separable Data under Fenchel–Young Losses
Han Bao, Shinsaku Sakaue, Yuki Takezawa
How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?
Tuan Tran Anh, Duy M. H. Nguyen, Hoai-Chau Tran et al.
Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior
Yue Gong, Raul Fernandez
Restoring Pruned Large Language Models via Lost Component Compensation
Zijian Feng, Hanzhang Zhou, Zixiao Zhu et al.
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Runzhe Zhan, Zhihong Huang, Xinyi Yang et al.
DISCO: Disentangled Communication Steering for Large Language Models
Max Torop, Aria Masoomi, Masih Eskandar et al.
Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences
Joshua Ashkinaze, Hua Shen, Saipranav Avula et al.
HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion
Lin Wu, Zhixiang Chen, Jianglin Lan
Multi-Agent Learning under Uncertainty: Recurrence vs. Concentration
Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos et al.
LCDB 1.1: A Database Illustrating Learning Curves Are More Ill-Behaved Than Previously Thought
Cheng Yan, Felix Mohr, Tom Viering
Neurons as Detectors of Coherent Sets in Sensory Dynamics
Joshua L Pughe-Sanford, Xuehao Ding, Jason Moore et al.
GreenHyperSpectra: A multi-source hyperspectral dataset for global vegetation trait prediction
Eya Cherif, Arthur Ouaknine, Luke Brown et al.
What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains
Chanakya Ekbote, Ashok Vardhan Makkuva, Marco Bondaschi et al.
On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts
Fanqi Yan, Huy Nguyen, Le Dung et al.
Graph Diffusion that can Insert and Delete
Matteo Ninniri, Marco Podda, Davide Bacciu
Impact of Layer Norm on Memorization and Generalization in Transformers
Rishi Singhal, Jung-Eun Kim
VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs
Shmuel Berman, Jia Deng
Perturbation Bounds for Low-Rank Inverse Approximations under Noise
Phuc Tran, Nisheeth K. Vishnoi
TRACE: Contrastive learning for multi-trial time series data in neuroscience
Lisa Schmors, Dominic Gonschorek, Jan Niklas Böhm et al.
VideoCAD: A Dataset and Model for Learning Long‑Horizon 3D CAD UI Interactions from Video
King Yiu Brandon Man, Ghadi Nehme, Md Ferdous Alam et al.
On the Entropy Calibration of Language Models
Steven Cao, Gregory Valiant, Percy Liang
MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures
Elena Zamaraeva, Christopher Collins, George Darling et al.
Channel Simulation and Distributed Compression with Ensemble Rejection Sampling
Truong Buu Phan, Ashish Khisti
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Ziqi Wang, Jiashun Liu, Ling Pan
Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues
Chinmay Talegaonkar, Nikhil Gandudi Suresh, Zachary Novack et al.
VIBE: Annotation-Free Video-to-Text Information Bottleneck Evaluation for TL;DR
Shenghui Chen, Po-han Li, Sandeep Chinchali et al.
Oracle-Efficient Combinatorial Semi-Bandits
Jung-hun Kim, Milan Vojnovic, Min-hwan Oh
AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation
Xinbiao Wang, Yuxuan Du, Zihan Lou et al.
Multitask Learning with Stochastic Interpolants
Hugo Negrel, Florentin Coeurdoux, Michael Albergo et al.
Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies
Ziye Wang, Li Kang, Yiran Qin et al.
Adjusted Count Quantification Learning on Graphs
Clemens Damke, Eyke Hüllermeier
What’s in Common? Multimodal Models Hallucinate When Reasoning Across Scenes
Candace Ross, Florian Bordes, Adina Williams et al.
Thresholds for sensitive optimality and Blackwell optimality in stochastic games
Stephane Gaubert, Julien Grand-Clément, Ricardo Katz
Adversarial Diffusion for Robust Reinforcement Learning
Daniele Foffano, Alessio Russo, Alexandre Proutiere
QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models
Yutong Wang, Haiyu Wang, Sai Qian Zhang
Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications
Agam Shah, Siddhant Sukhani, Huzaifa Pardawala et al.
Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs
Xiangcheng Zhang, Yige Hong, Weina Wang
FlashBias: Fast Computation of Attention with Bias
Haixu Wu, Minghao Guo, Yuezhou Ma et al.
Concentration and excess risk bounds for imbalanced classification with synthetic oversampling
Touqeer Ahmad, Mohammadreza Mousavi Kalan, François Portier et al.
VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations
Qianqian Qiao, DanDan Zheng, Yihang Bo et al.
Class-wise Balancing Data Replay for Federated Class-Incremental Learning
Zhuang Qi, Ying-Peng Tang, Lei Meng et al.
Contribution of task-irrelevant stimuli to drift of neural representations
Farhad Pashakhanloo
Revisiting Agnostic Boosting
Arthur da Cunha, Mikael Møller Høgsgaard, Andrea Paudice et al.
Creativity or Brute Force? Using Brainteasers as a Window into the Problem-Solving Abilities of Large Language Models
Sophia Han, Howard Dai, Stephen Xia et al.
Are Greedy Task Orderings Better Than Random in Continual Linear Regression?
Matan Tsipory, Ran Levinstein, Itay Evron et al.
Non-convex entropic mean-field optimization via Best Response flow
Razvan-Andrei Lascu, Mateusz Majka
Efficient Large Language Model Inference with Neural Block Linearization
Mete Erdogan, Francesco Tonin, Volkan Cevher
Differentiable extensions with rounding guarantees for combinatorial optimization over permutations
Robert (Riley) Nerem, Zhishang Luo, Akbar Rafiey et al.
EnzyControl: Adding Functional and Substrate-Specific Control for Enzyme Backbone Generation
Chao Song, ZHIYUAN LIU, Han Huang et al.
Rethinking Approximate Gaussian Inference in Classification
Bálint Mucsányi, Nathaël Da Costa, Philipp Hennig
Fast Training of Large Kernel Models with Delayed Projections
Amirhesam Abedsoltan, Siyuan Ma, Parthe Pandit et al.
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling
Tianhao Chen, Xin Xu, Zijing Liu et al.
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
Connor Dunlop, Matthew Zheng, Kavana Venkatesh et al.
The Burden of Interactive Alignment with Inconsistent Preferences
Ali Shirali
GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation
Tao Liu, Chongyu Wang, Rongjie Li et al.
Transferring Linear Features Across Language Models With Model Stitching
Alan Chen, Jack Merullo, Alessandro Stolfo et al.
In Silico Mapping of Visual Categorical Selectivity Across the Whole Brain
Ethan Hwang, Hossein Adeli, Wenxuan Guo et al.
Towards Understanding Transformers in Learning Random Walks
Wei Shi, Yuan Cao
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
Kairun Wen, Yuzhihuang, Runyu Chen et al.
Contextual Dynamic Pricing with Heterogeneous Buyers
Thodoris Lykouris, Sloan Nietert, Princewill Okoroafor et al.
Scalable Valuation of Human Feedback through Provably Robust Model Alignment
Masahiro Fujisawa, Masaki Adachi, Michael A Osborne
SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
Wenyue Chen, Peng Li, Wangguandong Zheng et al.
NegoCollab: A Common Representation Negotiation Approach for Heterogeneous Collaborative Perception
CONGZHANG SHAO, Quan Yuan, Guiyang Luo et al.
ShortListing Model: A Streamlined Simplex Diffusion for Discrete Variable Generation
Yuxuan Song, Zhe Zhang, Yu Pei et al.
$\textit{Hyper-GoalNet}$: Goal-Conditioned Manipulation Policy Learning with HyperNetworks
Pei Zhou, Wanting Yao, Qian Luo et al.
Revisiting Orbital Minimization Method for Neural Operator Decomposition
Jongha (Jon) Ryu, Samuel Zhou, Gregory Wornell
Neural Collapse under Gradient Flow on Shallow ReLU Networks for Orthogonally Separable Data
Hancheng Min, Zhihui Zhu, Rene Vidal
Neuro-Spectral Architectures for Causal Physics-Informed Networks
Arthur Bizzi, Leonardo Moreira, Márcio Marques et al.
CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D
Francis Ward, Teun van der Weij, Hanna Gábor et al.
Causal Climate Emulation with Bayesian Filtering
Sebastian H. M. Hickman, Ilija Trajković, Julia Kaltenborn et al.
Disentangling Superpositions: Interpretable Brain Encoding Model with Sparse Concept Atoms
Alicia Zeng, Jack Gallant
Evaluating Program Semantics Reasoning with Type Inference in System $F$
Yifeng He, Luning Yang, Christopher Gonzalo et al.
Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs
Gerardo Flores, Alyssa H. Smith, Julia Fukuyama et al.
Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning
Haochen Zhang, Zhong Zheng, Lingzhou Xue
Optimal kernel regression bounds under energy-bounded noise
Amon Lahr, Johannes Köhler, Anna Scampicchio et al.
Sequential Multi-Agent Dynamic Algorithm Configuration
Chen Lu, Ke Xue, Lei Yuan et al.
PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments
Weijie Zhou, Xuantang Xiong, Yi Peng et al.
Sketched Gaussian Mechanism for Private Federated Learning
Qiaobo Li, Zhijie Chen, Arindam Banerjee
Two-Steps Diffusion Policy for Robotic Manipulation via Genetic Denoising
Mateo Clémente, Leo Brunswic, Yang et al.
Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes
Hossein Zakerinia, Christoph Lampert
Robust Ego-Exo Correspondence with Long-Term Memory
Yijun Hu, Bing Fan, Xin Gu et al.
IF-Guide: Influence Function-Guided Detoxification of LLMs
Zachary Coalson, Juhan Bae, Nicholas Carlini et al.
Formal Models of Active Learning from Contrastive Examples
Farnam Mansouri, Hans Simon, Adish Singla et al.
Spike-timing-dependent Hebbian learning as noisy gradient descent
Niklas Dexheimer, Sascha Gaudlitz, Johannes Schmidt-Hieber
Opinion Maximization in Social Networks by Modifying Internal Opinions
Gengyu Wang, Runze Zhang, Zhongzhi Zhang
Robust Distortion-Free Watermark for Autoregressive Audio Generation Models
Yihan Wu, Georgios Milis, Ruibo Chen et al.
Abstract Counterfactuals for Language Model Agents
Edoardo Pona, Milad Kazemi Mehrabadi, Yali Du et al.
Mitigating the Privacy–Utility Trade-off in Decentralized Federated Learning via f-Differential Privacy
Xiang Li, Chendi Wang, Buxin Su et al.
JADE: Joint Alignment and Deep Embedding for Multi-Slice Spatial Transcriptomics
Yuanchuan Guo, Jun Liu, Huimin Cheng et al.
Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization
Longshen Ou, Jingwei Zhao, Ziyu Wang et al.
Competitive Advantage Attacks to Decentralized Federated Learning
Yuqi Jia, Minghong Fang, Neil Gong
Rotary Masked Autoencoders are Versatile Learners
Uros Zivanovic, Serafina Di Gioia, Andre Scaffidi et al.
Inferring stochastic dynamics with growth from cross-sectional data
Stephen Zhang, Suryanarayana Maddu, Xiaojie Qiu et al.
On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
Weiqing He, Xiang Li, Tianqi Shang et al.
Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain
Trinity Chung, Yuchen Shen, Nathan Kong et al.
AutoOpt: A Dataset and a Unified Framework for Automating Optimization Problem Solving
Ankur Sinha, Shobhit Arora, Dhaval Pujara
From Shortcut to Induction Head: How Data Diversity Shapes Algorithm Selection in Transformers
Ryotaro Kawata, Yujin Song, Alberto Bietti et al.
Strategic Classification with Non-Linear Classifiers
Benyamin Trachtenberg, Nir Rosenfeld
Thumb on the Scale: Optimal Loss Weighting in Last Layer Retraining
Nathan Stromberg, Christos Thrampoulidis, Lalitha Sankar
Simulating Society Requires Simulating Thought
Chance Jiajie Li, Jiayi Wu, Zhenze MO et al.
Precise Asymptotics and Refined Regret of Variance-Aware UCB
Yingying Fan, Yuxuan Han, Jinchi Lv et al.
Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs
Shulun Chen, Runlong Zhou, Zihan Zhang et al.
STNet: Spectral Transformation Network for Solving Operator Eigenvalue Problem
Hong Wang, Yixuan Jiang, Jie Wang et al.
Automaton Constrained Q-Learning
Anastasios Manganaris, Vittorio Giammarino, Ahmed Qureshi
An Adaptive Algorithm for Bilevel Optimization on Riemannian Manifolds
Xu Shi, Rufeng Xiao, Rujun Jiang
Linear Transformers Implicitly Discover Unified Numerical Algorithms
Patrick Lutz, Aditya Gangrade, Hadi Daneshmand et al.
Minimax Adaptive Online Nonparametric Regression over Besov spaces
Paul Liautaud, Pierre Gaillard, Olivier Wintenberger
U-CAN: Unsupervised Point Cloud Denoising with Consistency-Aware Noise2Noise Matching
Junsheng Zhou, XingYu Shi, Haichuan Song et al.
3DID: Direct 3D Inverse Design for Aerodynamics with Physics-Aware Optimization
Yuze Hao, Linchao Zhu, Yi Yang
The Unseen Threat: Residual Knowledge in Machine Unlearning under Perturbed Samples
Hsiang Hsu, Pradeep Niroula, Zichang He et al.
Towards Pre-trained Graph Condensation via Optimal Transport
Yeyu Yan, Shuai Zheng, Wenjun Hui et al.
QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Changxin Ke, Rui Zhang, Shuo Wang et al.
scSplit: Bringing Severity Cognizance to Image Decomposition in Fluorescence Microscopy
Ashesh Ashesh, Florian Jug
GSAlign: Geometric and Semantic Alignment Network for Aerial-Ground Person Re-Identification
Qiao Li, Jie Li, Yukang Zhang et al.
Structure Matters: Dynamic Policy Gradient
Sara Klein, Xiangyuan Zhang, Tamer Basar et al.
ML4CFD Competition: Results and Retrospective Analysis
Mouadh Yagoubi, David Danan, Milad LEYLI ABADI et al.
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning
Haolong Yan, Yeqing Shen, Xin Huang et al.
Learning to Add, Multiply, and Execute Algorithmic Instructions Exactly with Neural Networks
Artur Back de Luca, George Giapitzakis, Kimon Fountoulakis
Dataset Distillation for Pre-Trained Self-Supervised Vision Models
George Cazenavette, Antonio Torralba, Vincent Sitzmann
HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models
Zelin Peng, Zhengqin Xu, Qingyang Liu et al.
Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits
Yuzhou Gu, Yanjun Han, Jian Qian
The Parameterized Complexity of Computing the VC-Dimension
Florent Foucaud, Harmender Gahlawat, Fionn Mc Inerney et al.
Attention-based clustering
Rodrigo Maulen Soto, Pierre Marion, Claire Boyer
On Evaluating LLM Alignment by Evaluating LLMs as Judges
Yixin Liu, Pengfei Liu, Arman Cohan
C3Po: Cross-View Cross-Modality Correspondence by Pointmap Prediction
Kuan Wei Huang, Brandon Li, Bharath Hariharan et al.
VERA: Variational Inference Framework for Jailbreaking Large Language Models
Anamika Lochab, Lu Yan, Patrick Pynadath et al.
DUAL: Learning Diverse Kernels for Aggregated Two-sample and Independence Testing
Zhijian Zhou, Xunye Tian, Liuhua Peng et al.
Evaluating LLMs in Open-Source Games
Swadesh Sistla, Max Kleiman-Weiner
Asymmetric Duos: Sidekicks Improve Uncertainty
Tim G. Zhou, Evan Shelhamer, Geoff Pleiss
Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning
Yichen Li, Xiuying Wang, Wenchao Xu et al.
Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards
Honghao Chen, Xingzhou Lou, Xiaokun Feng et al.
CoFFT: Chain of Foresight-Focus Thought for Visual Language Models
Xinyu Zhang, Yuxuan Dong, Lingling Zhang et al.
Learning Relative Gene Expression Trends from Pathology Images in Spatial Transcriptomics
Kazuya Nishimura, Haruka Hirose, Ryoma Bise et al.
Contact Map Transfer with Conditional Diffusion Model for Generalizable Dexterous Grasp Generation
Yiyao Ma, Kai Chen, Kexin ZHENG et al.
Spurious-Aware Prototype Refinement for Reliable Out-of-Distribution Detection
Reihaneh Zohrabi, Hosein Hasani, Mahdieh Soleymani et al.
Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials
Yifan Pu, Jixuan Ying, Qixiu Li et al.
Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains
Qiankun Li, Feng He, Huabao Chen et al.
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
WonJun Moon, MinSeok Jung, Gilhan Park et al.
Image Token Matters: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing
Weixing Wang, Zifeng Ding, Jindong Gu et al.
Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding
Yue Guan, Changming Yu, Shihan Fang et al.
NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs
Haeun Lee, Omin Kwon, Yeonhong Park et al.
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
Silin Cheng, Kai Han
DuetGraph: Coarse-to-Fine Knowledge Graph Reasoning with Dual-Pathway Global-Local Fusion
Jin Li, Zezhong Ding, Xike Xie
Rescaled Influence Functions: Accurate Data Attribution in High Dimension
Ittai Rubinstein, Samuel Hopkins
Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation
Xiang Li, Zirui Wang, Zixuan Huang et al.
AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition
Parsa Rahimi, Damien Teney, Sébastien Marcel
HyPINO: Multi-Physics Neural Operators via HyperPINNs and the Method of Manufactured Solutions
Rafael Bischof, Michal Piovarci, Michael Kraus et al.
Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality
Junyan Liu, Ziyun Chen, Kun Wang et al.
Strategic Cost Selection in Participatory Budgeting
Piotr Faliszewski, Łukasz Janeczko, Andrzej Kaczmarczyk et al.
Continual Optimization with Symmetry Teleportation for Multi-Task Learning
Zhipeng Zhou, Ziqiao Meng, Pengcheng Wu et al.
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Zixuan Xie, Xinyu Liu, Rohan Chandra et al.
Masked Diffusion Models as Energy Minimization
Sitong Chen, Shen Nie, Jiacheng Sun et al.
Joint Hierarchical Representation Learning of Samples and Features via Informed Tree-Wasserstein Distance
Ya-Wei Eileen Lin, Ronald Coifman, Gal Mishne et al.
Amortized Variational Transdimensional Inference
Laurence Davies, Daniel MacKinlay, Rafael Oliveira et al.
UrbanIng-V2X: A Large-Scale Multi-Vehicle, Multi-Infrastructure Dataset Across Multiple Intersections for Cooperative Perception
Karthikeyan Chandra Sekaran, Markus Geisler, Dominik Rößle et al.
UGM2N: An Unsupervised and Generalizable Mesh Movement Network via M-Uniform Loss
Zhichao Wang, Xinhai Chen, Qinglin Wang et al.
ModuLM: Enabling Modular and Multimodal Molecular Relational Learning with Large Language Models
Zhuo Chen, YIZHEN ZHENG, Huan Yee Koh et al.
Unlabeled Data Improves Fine-Grained Image Zero-shot Classification with Multimodal LLMs
Yunqi Hong, Sohyun An, Andrew Bai et al.
Effects of Dropout on Performance in Long-range Graph Learning Tasks
Jasraj Singh, Keyue Jiang, Brooks Paige et al.
Manipulating 3D Molecules in a Fixed-Dimensional E(3)-Equivariant Latent Space
Zitao Chen, Yinjun Jia, Zitong Tian et al.
Non-Adaptive Adversarial Face Generation
Sunpill Kim, Seunghun Paik, Chanwoo Hwang et al.
Directed-Tokens: A Robust Multi-Modality Alignment Approach to Large Language-Vision Models
Thanh-Dat Truong, Huu-Thien Tran, Tran Son et al.
On topological descriptors for graph products
Mattie Ji, Amauri Souza, Vikas Garg
On the Universal Near Optimality of Hedge in Combinatorial Settings
Zhiyuan Fan, Arnab Maiti, Lillian Ratliff et al.
Native Segmentation Vision Transformers
Guillem Brasó, Aljosa Osep, Laura Leal-Taixé
Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation
Ting Wei, Biao Mei, Junliang Lyu et al.
Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Mohammad Shahab Sepehri, Berk Tinaz, Zalan Fabian et al.
Learning Interestingness in Automated Mathematical Theory Formation
George Tsoukalas, Rahul Saha, Amitayush Thakur et al.
On Universality Classes of Equivariant Networks
Marco Pacini, Gabriele Santin, Bruno Lepri et al.
Breaking the Gradient Barrier: Unveiling Large Language Models for Strategic Classification
Xinpeng Lv, Yunxin Mao, Haoxuan Li et al.
Graph Your Own Prompt
Xi Ding, Lei Wang, Piotr Koniusz et al.
Tight analyses of first-order methods with error feedback
Daniel Berg Thomsen, Adrien Taylor, Aymeric Dieuleveut
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai, Qihang Fan, Xuefeng Hu et al.
DPA: A one-stop metric to measure bias amplification in classification datasets
Bhanu Tokas, Rahul Nair, Hannah Kerner
Degrees of Freedom for Linear Attention: Distilling Softmax Attention with Optimal Feature Efficiency
Naoki Nishikawa, Rei Higuchi, Taiji Suzuki
One SPACE to Rule Them All: Jointly Mitigating Factuality and Faithfulness Hallucinations in LLMs
Pengbo Wang, Chaozhuo Li, Chenxu Wang et al.
Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning
Yeongbin Seo, Dongha Lee, Jaehyung Kim et al.
Scaling Up Active Testing to Large Language Models
Gabrielle Berrada, Jannik Kossen, Freddie Bickford Smith et al.
OpenBox: Annotate Any Bounding Boxes in 3D
In-Jae Lee, Mungyeom Kim, Kwonyoung Ryu et al.
Comparison requires valid measurement: Rethinking attack success rate comparisons in AI red teaming
Alex Chouldechova, A. Feder Cooper, Solon Barocas et al.
Channel Matters: Estimating Channel Influence for Multivariate Time Series
Muyao Wang, Zeke Xie, Bo Chen et al.
Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings
Xingguang Wei, Haomin Wang, Shenglong Ye et al.
EA3D: Online Open-World 3D Object Extraction from Streaming Videos
Xiaoyu Zhou, Jingqi Wang, Yuang Jia et al.
Non-Markovian Discrete Diffusion with Causal Language Models
Yangtian Zhang, Sizhuang He, Daniel Levine et al.
Online Segment Any 3D Thing as Instance Tracking
Hanshi Wang, Cai Zijian, Jin Gao et al.
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes
Tianxu Wang, Zhuofan Zhang, Ziyu Zhu et al.
Eluder dimension: localise it!
Alireza Bakhtiari, Alex Ayoub, Samuel Robertson et al.
PASS: Path-selective State Space Model for Event-based Recognition
Jiazhou Zhou, Kanghao Chen, Lei Zhang et al.