Most Cited NEURIPS "monocular video input" Papers
5,858 papers found • Page 13 of 30
Conference
YOLOv12: Attention-Centric Real-Time Object Detectors
Yunjie Tian, Qixiang Ye, DAVID DOERMANN
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
Mohan Zhang, Yihua Zhang, Jinghan Jia et al.
Boosting Resilience of Large Language Models through Causality-Driven Robust Optimization
Xiaoling Zhou, Mingjie Zhang, Zhemg Lee et al.
NeuroH-TGL: Neuro-Heterogeneity Guided Temporal Graph Learning Strategy for Brain Disease Diagnosis
Shengrong Li, Qi Zhu, Chunwei Tian et al.
REOrdering Patches Improves Vision Models
Declan Kutscher, David Chan, Yutong Bai et al.
Learning Parameterized Skills from Demonstrations
Vedant Gupta, Haotian Fu, Calvin Luo et al.
Investigating and Mitigating Catastrophic Forgetting in Medical Knowledge Injection through Internal Knowledge Augmentation Learning
Yuxuan Zhou, Xien Liu, Xiao Zhang et al.
Cloud4D: Estimating Cloud Properties at a High Spatial and Temporal Resolution
Jacob Lin, Edward Gryspeerdt, Ronald Clark
Tracking and Understanding Object Transformations
Yihong Sun, Xinyu Yang, Jennifer Sun et al.
A-Mem: Agentic Memory for LLM Agents
Wujiang Xu, Zujie Liang, Kai Mei et al.
Adaptive Cannistraci-Hebb Network Automata Modelling of Complex Networks for Path-based Link Prediction
Jialin Zhao, Alessandro Muscoloni, Umberto Michieli et al.
Meta CLIP 2: A Worldwide Scaling Recipe
Yung-Sung Chuang, Yang Li, Dong Wang et al.
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking
Tianhao Li, Tingfa Xu, Ying Wang et al.
On Learning Verifiers and Implications to Chain-of-Thought Reasoning
Maria-Florina Balcan, Avrim Blum, Zhiyuan Li et al.
Robustly Learning Monotone Single-Index Models
Puqian Wang, Nikos Zarifis, Ilias Diakonikolas et al.
Distributed mediation analysis with communication efficiency
Shaomin Li
Edit Flows: Variable Length Discrete Flow Matching with Sequence-Level Edit Operations
Marton Havasi, Brian Karrer, Itai Gat et al.
Revitalizing SVD for Global Covariance Pooling: Halley’s Method to Overcome Over-Flattening
Jiawei Gu, Ziyue Qiao, Xinming Li et al.
Self-Boost via Optimal Retraining: An Analysis via Approximate Message Passing
Adel Javanmard, Rudrajit Das, Alessandro Epasto et al.
Structured Initialization for Vision Transformers
Jianqiao Zheng, Xueqian Li, Hemanth Saratchandran et al.
SpectraLDS: Provable Distillation for Linear Dynamical Systems
Devan Shah, Shlomo Fortgang, Sofiia Druchyna et al.
FLiP: Towards Comprehensive and Reliable Evaluation of Federated Prompt Learning
Dongping Liao, Xitong Gao, Cheng-Zhong Xu
On Hierarchies of Fairness Notions in Cake Cutting: From Proportionality to Super Envy-Freeness
Arnav Mehra, Alexandros Psomas
STAR: Spatial-Temporal Tracklet Matching for Multi-Object Tracking
Xuewei Bai, Yongcai Wang, Deying Li et al.
BrainFlow: A Holistic Pathway of Dynamic Neural System on Manifold
Zhixuan Zhou, Tingting Dan, Guorong Wu
Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Nimrod Berman, Omkar Joglekar, Eitan Kosman et al.
Learning from Delayed Feedback in Games via Extra Prediction
Yuma Fujimoto, Kenshi Abe, Kaito Ariu
ArchPower: Dataset for Architecture-Level Power Modeling of Modern CPU Design
Qijun Zhang, Yao Lu, Mengming Li et al.
ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data
Patryk Marszałek, Tomasz Kuśmierczyk, Witold Wydmański et al.
World-aware Planning Narratives Enhance Large Vision-Language Model Planner
Junhao Shi, Zhaoye Fei, Siyin Wang et al.
Automatic Auxiliary Task Selection and Adaptive Weighting Boost Molecular Property Prediction
Zhiqiang Zhong, Davide Mottin
MoESD: Unveil Speculative Decoding's Potential for Accelerating Sparse MoE
Zongle Huang, Lei Zhu, ZongYuan Zhan et al.
TAPVid-360: Tracking Any Point in 360 from Narrow Field of View Video
Finlay Hudson, James Gardner, William Smith
Consistency of the $k_n$-nearest neighbor rule under adaptive sampling
Robi Bhattacharjee, Geelon So, Sanjoy Dasgupta
Enhancing Privacy in Multimodal Federated Learning with Information Theory
Tianzhe Xiao, Yichen Li, Yining Qi et al.
THD-BAR: Topology Hierarchical Derived Brain Autoregressive Modeling for EEG Generic Representations
Wenchao Yang, Weidong Yan, Wenkang Liu et al.
Hybrid-Collaborative Augmentation and Contrastive Sample Adaptive-Differential Awareness for Robust Attributed Graph Clustering
Tianxiang Zhao, Youqing Wang, Jinlu Wang et al.
Computational Efficiency under Covariate Shift in Kernel Ridge Regression
Andrea Della Vecchia, Arnaud Mavakala Watusadisi, Ernesto De Vito et al.
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Wei Shen, Guanlin Liu, Yu Yue et al.
Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text
Yize Cheng, Vinu Sankar Sadasivan, Mehrdad Saberi et al.
Think before Recommendation: Autonomous Reasoning-enhanced Recommender
Xiaoyu Kong, Junguang Jiang, Bin Liu et al.
Non-Stationary Structural Causal Bandits
Yeahoon Kwon, Yesong Choe, Soungmin Park et al.
MMTU: A Massive Multi-Task Table Understanding and Reasoning Benchmark
Junjie Xing, Yeye He, Mengyu Zhou et al.
Conditional Representation Learning for Customized Tasks
Honglin Liu, Chao Sun, Peng Hu et al.
RrED: Black-box Unsupervised Domain Adaptation via Rectifying-reasoning Errors of Diffusion
Yuwu Lu, Chunzhi Liu
Normalizing Flows are Capable Models for Continuous Control
Raj Ghugare, Benjamin Eysenbach
Statistical Guarantees for High-Dimensional Stochastic Gradient Descent
Jiaqi Li, Zhipeng Lou, Johannes Schmidt-Hieber et al.
Quantifying Generalisation in Imitation Learning
Nathan Gavenski, Odinaldo Rodrigues
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals
Stefan Stojanov, David Wendt, Seungwoo Kim et al.
Continuous Soft Actor-Critic: An Off-Policy Learning Method Robust to Time Discretization
Huimin Han, Shaolin Ji
Stochastic Optimization in Semi-Discrete Optimal Transport: Convergence Analysis and Minimax Rate
Ferdinand Genans, Antoine Godichon-Baggioni, François-Xavier Vialard et al.
STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data
Maximilian Forstenhäusler, Daniel Külzer, Christos Anagnostopoulos et al.
What Expressivity Theory Misses: Message Passing Complexity for GNNs
Niklas Kemper, Tom Wollschläger, Stephan Günnemann
NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods
Jonas Kulhanek, Torsten Sattler
FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
Shuang Zeng, Xinyuan Chang, Mengwei Xie et al.
STARC-9: A Large-scale Dataset for Multi-Class Tissue Classification for CRC Histopathology
Barathi Subramanian, Rathinaraja Jeyaraj, Mitchell Peterson et al.
3D Gaussian Splatting based Scene-independent Relocalization with Unidirectional and Bidirectional Feature Fusion
Junyi Wang, Yuze Wang, Wantong Duan et al.
Split conformal classification with unsupervised calibration
Santiago Mazuelas
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun, Huazhang Hu, Yidong Ma et al.
ARIA: Training Language Agents with Intention-driven Reward Aggregation
Ruihan Yang, yikai zhang, Aili Chen et al.
Dual Alignment Framework for Few-shot Learning with Inter-Set and Intra-Set Shifts
Siyang Jiang, Rui Fang, Hsi-Wen Chen et al.
Adaptive LoRA Experts Allocation and Selection for Federated Fine-Tuning
Lei Wang, Jieming Bian, Letian Zhang et al.
When Can Model-Free Reinforcement Learning be Enough for Thinking?
Josiah Hanna, Nicholas Corrado
Out-of-Distribution Generalized Graph Anomaly Detection with Homophily-aware Environment Mixup
Sibo Tian, Xin Wang, Zeyang Zhang et al.
Measure-Theoretic Anti-Causal Representation Learning
Arman Behnam, Binghui Wang
A Private Approximation of the 2nd-Moment Matrix of Any Subsamplable Input
Bar Mahpud, Or Sheffet
Contrastive Learning with Data Misalignment: Feature Purity, Training Dynamics and Theoretical Generalization Guarantees
Jiawei Sun, Shuai Zhang, Hongkang Li et al.
A Minimalistic Unified Framework for Incremental Learning across Image Restoration Tasks
Xiaoxuan Gong, Jie Ma
MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection
shengtian yang, Yue Feng, Yingshi Liu et al.
OpenGU: A Comprehensive Benchmark for Graph Unlearning
Bowen Fan, Yuming Ai, Xunkai Li et al.
Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs
Mehran Shakerinava, Siamak Ravanbakhsh, Adam Oberman
On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm
Huan Li, Yiming Dong, Zhouchen Lin
Joint‑Embedding vs Reconstruction: Provable Benefits of Latent Space Prediction for Self‑Supervised Learning
Hugues Van Assel, Mark Ibrahim, Tommaso Biancalani et al.
Optimal Mistake Bounds for Transductive Online Learning
Zachary Chase, Steve Hanneke, Shay Moran et al.
PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization
Honglin Li, Zhongyi Shui, Yunlong Zhang et al.
NoBOOM: Chemical Process Datasets for Industrial Anomaly Detection
Dennis Wagner, Fabian Hartung, Justus Arweiler et al.
Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models
Konstantinos Dafnis, Dimitris Metaxas
SPOT-Trip: Dual-Preference Driven Out-of-Town Trip Recommendation
Yinghui Liu, Hao Miao, Guojiang Shen et al.
Robust Satisficing Gaussian Process Bandits Under Adversarial Attacks
Artun Saday, Yaşar Cahit Yıldırım, Cem Tekin
Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs
Fangrui Zhu, Hanhui Wang, Yiming Xie et al.
Don’t Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Junyu guo, Zhi Zheng, Donghao Ying et al.
How to Scale Second-Order Optimization
Charlie Chen, Shikai Qiu, Hoang Phan et al.
Causal-R: A Causal-Reasoning Geometry Problem Solver for Optimized Solution Exploration
Wenjun Wu, Lingling Zhang, Bo Zhao et al.
Negative Feedback Really Matters: Signed Dual-Channel Graph Contrastive Learning Framework for Recommendation
Leqi Zheng, Chaokun Wang, Zixin Song et al.
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning
Qiuchen Wang, Ruixue Ding, Yu Zeng et al.
FairDICE: Fairness-Driven Offline Multi-Objective Reinforcement Learning
Woosung Kim, Jinho Lee, Jongmin Lee et al.
Accelerated Distance-adaptive Methods for Hölder Smooth and Convex Optimization
Yijin Ren, Haifeng Xu, Qi Deng
Complexity Scaling Laws for Neural Models using Combinatorial Optimization
Lowell Weissman, Michael Krumdick, A. Abbott
RayFusion: Ray Fusion Enhanced Collaborative Visual Perception
Shaohong Wang, Lu Bin, Xinyu Xiao et al.
Synergistic Tensor and Pipeline Parallelism
Mengshi Qi, Jiaxuan Peng, Jie Zhang et al.
Non-Singularity of the Gradient Descent Map for Neural Networks with Piecewise Analytic Activations
Alexandru Crăciun, Debarghya Ghoshdastidar
Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies
Efthymia Tsamoura, Kaifu Wang, Dan Roth
Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits
Xuheng Li, Quanquan Gu
Place Cells as Multi-Scale Position Embeddings: Random Walk Transition Kernels for Path Planning
Minglu Zhao, Dehong Xu, Deqian Kong et al.
DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches
Yun Xing, Yue Cao, Nhat Chung et al.
Scaling Language-centric Omnimodal Representation Learning
Chenghao Xiao, Hou Pong (Ken) Chan, Hao Zhang et al.
Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks
Xuan Tang, Han Zhang, Yuan Cao et al.
T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
Pengwei Liu, Hangjie Yuan, Bo Dong et al.
NaDRO: Leveraging Dual-Reward Strategies for LLMs Training on Noisy Data
Haolong Qian, Xianliang Yang, Ling Zhang et al.
Controlling Thinking Speed in Reasoning Models
Zhengkai Lin, Zhihang Fu, Ze Chen et al.
ForceFM: Enhancing Protein-Ligand Predictions through Force-Guided Flow Matching
HUANLEI GUO, Song LIU, Bingyi Jing
Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video
Jixuan He, Chieh Lin, Lu Qi et al.
Geometric Algorithms for Neural Combinatorial Optimization with Constraints
Nikolaos Karalias, Akbar Rafiey, Yifei Xu et al.
EDBench: Large-Scale Electron Density Data for Molecular Modeling
Hongxin Xiang, Ke Li, Mingquan Liu et al.
C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning
Antonios Valkanas, Soumyasundar Pal, Pavel Rumiantsev et al.
Sequence Modeling with Spectral Mean Flows
Jinwoo Kim, Max Beier, Petar Bevanda et al.
Kuramoto Orientation Diffusion Models
Yue Song, Andy Keller, Sevan Brodjian et al.
Nearly Dimension-Independent Convergence of Mean-Field Black-Box Variational Inference
Kyurae Kim, Yian Ma, Trevor Campbell et al.
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
Wenxiang Guo, Changhao Pan, Zhiyuan Zhu et al.
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training
Brian Bartoldson, Siddarth Venkatraman, James Diffenderfer et al.
BAM-ICL: Causal Hijacking In-Context Learning with Budgeted Adversarial Manipulation
Rui Chu, Bingyin Zhao, Hanling Jiang et al.
Enhanced Cyclic Coordinate Descent Methods for Elastic Net Penalized Linear Models
Yixiao Wang, Zishan Shao, Ting Jiang et al.
Where Graph Meets Heterogeneity: Multi-View Collaborative Graph Experts
Zhihao Wu, Jinyu Cai, Yunhe Zhang et al.
Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation
Guoqing Hu, An Zhang, Shuchang Liu et al.
ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
Zixun Fang, Kai Zhu, Zhiheng Liu et al.
Human-assisted Robotic Policy Refinement via Action Preference Optimization
Wenke Xia, Yichu Yang, Hongtao Wu et al.
You Only Spectralize Once: Taking a Spectral Detour to Accelerate Graph Neural Network
Yi Li, Zhichun Guo, Guanpeng Li et al.
The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
Hoang Pham, The Anh Ta, Tom Jacobs et al.
Convergence of the Gradient Flow for Shallow ReLU Networks on Weakly Interacting Data
Léo Dana, Loucas Pillaud-Vivien, Francis Bach
Uniform Wrappers: Bridging Concave to Quadratizable Functions in Online Optimization
Mohammad Pedramfar, Christopher Quinn, Vaneet Aggarwal
LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation
Subhojyoti Khastagir, KISHALAY DAS, Pawan Goyal et al.
3EED: Ground Everything Everywhere in 3D
Rong Li, Yuhao Dong, Tianshuai Hu et al.
Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems
Hao Liang, shuqing shi, Yudi Zhang et al.
Efficient Data Selection at Scale via Influence Distillation
Mahdi Nikdan, Vincent Cohen-Addad, Dan Alistarh et al.
LoRO: Real-Time on-Device Secure Inference for LLMs via TEE-Based Low Rank Obfuscation
Gaojian Xiong, Yu Sun, Jianhua Liu et al.
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
Weihao Bo, Yanpeng Sun, Yu Wang et al.
EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Xiaorui Wu, Fei Li, Xiaofeng Mao et al.
OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction
Juntong Wang, Xiyuan Wang, Muhan Zhang
Torch-Uncertainty: Deep Learning Uncertainty Quantification
Adrien Lafage, Olivier Laurent, Firas Gabetni et al.
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai, Naoya Takeishi, Takehisa Yairi
Rethinking Fair Federated Learning from Parameter and Client View
Kaiqi Guan, Wenke Huang, Xianda Guo et al.
Less is More: Local Intrinsic Dimensions of Contextual Language Models
Benjamin Matthias Ruppik, Julius von Rohrscheidt, Carel van Niekerk et al.
Weaver: Shrinking the Generation-Verification Gap by Scaling Compute for Verification
Jon Saad-Falcon, Estefany Kelly Buchanan, Mayee Chen et al.
Tree Ensemble Explainability through the Hoeffding Functional Decomposition and TreeHFD Algorithm
Clément Bénard
Equivariant Eikonal Neural Networks: Grid-Free, Scalable Travel-Time Prediction on Homogeneous Spaces
Alejandro García-Castellanos, David Wessels, Nicky J. van den Berg et al.
When Models Don’t Collapse: On the Consistency of Iterative MLE
Daniel Barzilai, Ohad Shamir
Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets
Zixian Yang, Sushil Varma, Lei Ying
Amplifying Prominent Representations in Multimodal Learning via Variational Dirichlet Process
Tsai Hor Chan, Feng Wu, Yihang Chen et al.
Frequency-Aware Token Reduction for Efficient Vision Transformer
DongJae Lee, Jiwan Hur, Jaehyun Choi et al.
Machine Unlearning under Overparameterization
Jacob Block, Aryan Mokhtari, Sanjay Shakkottai
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Sang Choe, Hwijeen Ahn, Juhan Bae et al.
Stop the Nonconsensual Use of Nude Images in Research
Princessa Cintaqia, Arshia Arya, Elissa Redmiles et al.
Class conditional conformal prediction for multiple inputs by p-value aggregation
Jean-Baptiste Fermanian, Mohamed Hebiri, Joseph Salmon
TreeGen: A Bayesian Generative Model for Hierarchies
Marcel Kollovieh, Nils Fleischmann, Filippo Guerranti et al.
Gaussian Regression-Driven Tensorized Incomplete Multi-View Clustering with Dual Manifold Regularization
Zhenhao Zhong, Zhibin Gu, Pengpeng Yang et al.
FedRW: Efficient Privacy-Preserving Data Reweighting for Enhancing Federated Learning of Language Models
Pukang Ye, Luo Junwei, Jiachen Shen et al.
Convergence Rates of Constrained Expected Improvement
Haowei Wang, Jingyi Wang, Zhongxiang Dai et al.
A CLT for Polynomial GNNs on Community-Based Graphs
Luciano Vinas, Arash Amini
Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Xin Liu, Haoran Li, Dongbin Zhao
Guiding LLM Decision-Making with Fairness Reward Models
Zara Hall, Melanie Subbiah, Thomas Zollo et al.
ClinBench: A Standardized Multi-Domain Framework for Evaluating Large Language Models in Clinical Information Extraction
Ismael Villanueva Miranda, Zifan Gu, Donghan Yang et al.
Generalized and Invariant Single-Neuron In-Vivo Activity Representation Learning
Wei Wu, Yuxing Lu, Zhengrui Guo et al.
Crucible: Quantifying the Potential of Control Algorithms through LLM Agents
Lianchen Jia, Chaoyang Li, Qian Houde et al.
Transductive Conformal Inference for Full Ranking
Jean-Baptiste Fermanian, Pierre Humbert, Gilles Blanchard
ESCA: Contextualizing Embodied Agents via Scene-Graph Generation
Jiani Huang, Amish Sethi, Matthew Kuo et al.
Assignments for Congestion-Averse Agents: Seeking Competitive and Envy-Free Solutions
Jiehua Chen, Jiong Guo, Yinghui Wen
Chain-of-Model Learning for Language Model
Xiaohua Wang, Kaitao Song, Xu Tan et al.
Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks
Shakir Yousefi, Andreas Plesner, Till Aczel et al.
FreqExit: Enabling Early-Exit Inference for Visual Autoregressive Models via Frequency-Aware Guidance
Ying Li, Chengfei Lyu, Huan Wang
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang, Bo Feng, Zhengfeng Lai et al.
Finite-Time Analysis of Stochastic Nonconvex Nonsmooth Optimization on the Riemannian Manifolds
Emre Sahinoglu, Youbang Sun, Shahin Shahrampour
Robust LLM Alignment via Distributionally Robust Direct Preference Optimization
Zaiyan Xu, Sushil Vemuri, Kishan Panaganti et al.
Quantifying Task-relevant Similarities in Representations Using Decision Variable Correlations
Yu (Eric) Qian, Wilson Geisler, Xue-Xin Wei
OmniFC: Rethinking Federated Clustering via Lossless and Secure Distance Reconstruction
Jie Yan, Jing Liu, Zhong-Yuan Zhang
Balancing Positive and Negative Classification Error Rates in Positive-Unlabeled Learning
Ximing Li, Yuanchao Dai, Bing Wang et al.
COGNAC: Cooperative Graph-based Networked Agent Challenges for Multi-Agent Reinforcement Learning
Jules Sintes, Ana Busic
TV-Rec: Time-Variant Convolutional Filter for Sequential Recommendation
Yehjin Shin, Jeongwhan Choi, Seojin Kim et al.
Towards Multiscale Graph-based Protein Learning with Geometric Secondary Structural Motifs
Shih-Hsin Wang, Yuhao Huang, Taos Transue et al.
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo, Xinli Xu, Luozhou Wang et al.
Understanding and Enhancing Message Passing on Heterophilic Graphs via Compatibility Matrix
Zhuonan Zheng, Yuanchen Bei, Zhiyao Zhou et al.
Diversity-oriented Deep Multi-modal Clustering
Wang Yanzheng, Xin Yang, Yujun Wang et al.
Combinatorial Ski Rental Problem: Robust and Learning-Augmented Algorithms
Ziwei Li, Bo Sun, Zhiqiu Zhang et al.
MVSMamba: Multi-View Stereo with State Space Model
Jianfei Jiang, Qiankun Liu, Hongyuan Liu et al.
Alchemist: Turning Public Text-to-Image Data into Generative Gold
Valerii Startsev, Alexander Ustyuzhanin, Alexey Kirillov et al.
Repurposing AlphaFold3-like Protein Folding Models for Antibody Sequence and Structure Co-design
Nianzu Yang, Songlin Jiang, Jian Ma et al.
IMPACT: Irregular Multi-Patch Adversarial Composition Based on Two‑Phase Optimization
Zenghui Yang, Xingquan Zuo, Hai Huang et al.
Adaptive Sigmoid Clipping for Balancing the Direction–Magnitude Mismatch Trade-off in Differentially Private Learning
Faeze Moradi Kalarde, Ali Bereyhi, Ben Liang et al.
Robust Minimax Boosting with Performance Guarantees
Santiago Mazuelas, Veronica Alvarez
IntrinsiX: High-Quality PBR Generation using Image Priors
Peter Kocsis, Lukas Höllein, Matthias Niessner
Spatiotemporal Consensus with Scene Prior for Unsupervised Domain Adaptive Person Search
Yimin Jiang, Huibing Wang, Jinjia peng
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models
Haohan Chi, Huan-ang Gao, Ziming Liu et al.
Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems
Ibrahim Alabdulmohsin, Xiaohua Zhai
The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement
Ruihan Yang, Fanghua Ye, Jian Li et al.
Axial Neural Networks for Dimension-Free Foundation Models
Hyunsu Kim, Jonggeon Park, Joan Bruna et al.
Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models
Byeonghu Na, Minsang Park, Gyuwon Sim et al.
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
Yinsicheng Jiang, Yao Fu, Yeqi Huang et al.
Heterogeneous Graph Transformers for Simultaneous Mobile Multi-Robot Task Allocation and Scheduling under Temporal Constraints
Batuhan Altundas, Shengkang Chen, Shivika Singh et al.
BlockDecoder: Boosting ASR Decoders with Context and Merger Modules
Darshan Prabhu, Preethi Jyothi
Restoring Pruned Large Language Models via Lost Component Compensation
Zijian Feng, Hanzhang Zhou, Zixiao Zhu et al.
Learn and Ensemble Bridge Adapters for Multi-domain Task Incremental Learning
Ziqi Gu, Chunyan Xu, Wenxuan Fang et al.
All that structure matches does not glitter
Maya Martirossyan, Thomas Egg, Philipp Höllmer et al.
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Zhihao Sun, Haoran Jiang, Haoran Chen et al.
Physics-informed Reduced Order Modeling of Time-dependent PDEs via Differentiable Solvers
Nima Hosseini Dashtbayaz, Hesam Salehipour, Adrian Butscher et al.
SHGR: A Generalized Maximal Correlation Coefficient
Samuel Stocksieker, Denys Pommeret
Surface-Aware Feed-Forward Quadratic Gaussian for Frame Interpolation with Large Motion
Zaoming Yan, Yaomin Huang, Pengcheng Lei et al.
CoCoA: A Minimum Bayes Risk Framework Bridging Confidence and Consistency for Uncertainty Quantification in LLMs
Roman Vashurin, Maiya Goloburda, Albina Ilina et al.
Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences
Joshua Ashkinaze, Hua Shen, Saipranav Avula et al.
Exponential Convergence Guarantees for Iterative Markovian Fitting
Marta Gentiloni Silveri, Giovanni Conforti, Alain Durmus
Dynamic Diameter in High-Dimensions against Adaptive Adversary and Beyond
Kiarash Banihashem, Jeff Giliberti, Samira Goudarzi et al.