Most Cited 2025 "overthinking" Papers
22,274 papers found • Page 78 of 112
Conference
Neurons as Detectors of Coherent Sets in Sensory Dynamics
Joshua L Pughe-Sanford, Xuehao Ding, Jason Moore et al.
Understanding Museum Exhibits using Vision-Language Reasoning
Ada-Astrid Balauca, Sanjana Garai, Stefan Balauca et al.
Majority of the Bests: Improving Best-of-N via Bootstrapping
Amin Rakhsha, Kanika Madan, Tianyu Zhang et al.
Safety Depth in Large Language Models: A Markov Chain Perspective
Ching-Chia Kao, Chia-Mu Yu, Chun-Shien Lu et al.
Unlabeled Data Improves Fine-Grained Image Zero-shot Classification with Multimodal LLMs
Yunqi Hong, Sohyun An, Andrew Bai et al.
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
Subhadeep Koley, Tapas Kumar Dutta, Aneeshan Sain et al.
Zero-shot Denoising via Neural Compression: Theoretical and algorithmic framework
Ali Zafari, Xi Chen, Shirin Jalali
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Edoardo Palladin, Samuel Brucker, Filippo Ghilotti et al.
EditInfinity: Image Editing with Binary-Quantized Generative Models
Jiahuan Wang, Yuxin Chen, Jun Yu et al.
QiMeng-SALV: Signal-Aware Learning for Verilog Code Generation
Yang Zhang, Rui Zhang, Jiaming Guo et al.
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation
Siyu Chen, Ting Han, Chengzheng Fu et al.
Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Runzhe Zhan, Zhihong Huang, Xinyi Yang et al.
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
Yongxin He, Shan Zhang, Yixuan Cao et al.
Joint Hierarchical Representation Learning of Samples and Features via Informed Tree-Wasserstein Distance
Ya-Wei Eileen Lin, Ronald Coifman, Gal Mishne et al.
PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution
Yong Liu, Hang Dong, Jinshan Pan et al.
ForCenNet: Foreground-Centric Network for Document Image Rectification
Peng Cai, liqiang liqiang, Kaicheng Yang et al.
Learning Juntas under Markov Random Fields
Gautam Chandrasekaran, Adam Klivans
VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models
Zhicheng Zhang, Weicheng Wang, Yongjie Zhu et al.
EvOcc: Accurate Semantic Occupancy for Automated Driving Using Evidence Theory
Jonas Kälble, Sascha Wirges, Maxim Tatarchenko et al.
NegoCollab: A Common Representation Negotiation Approach for Heterogeneous Collaborative Perception
CONGZHANG SHAO, Quan Yuan, Guiyang Luo et al.
SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly
Wei Zhu, Zhiwen Tang, Kun Yue
What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains
Chanakya Ekbote, Ashok Vardhan Makkuva, Marco Bondaschi et al.
Self-diffusion for Solving Inverse Problems
Guanxiong Luo, Shoujin Huang
VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing
Juan Luis Gonzalez Bello, Xu Yao, Alex Whelan et al.
PoseAnchor: Robust Root Position Estimation for 3D Human Pose Estimation
Jun-Hee Kim, Jumin Han, Seong-Whan Lee
Implicit Correspondence Learning for Image-to-Point Cloud Registration
Xinjun Li, Wenfei Yang, Jiacheng Deng et al.
Verbalized Representation Learning for Interpretable Few-Shot Generalization
Cheng-Fu Yang, Da Yin, Wenbo Hu et al.
A Generalized Binary Tree Mechanism for Private Approximation of All-Pair Shortest Distances
Zongrui Zou, Chenglin Fan, Michael Dinitz et al.
CaMuViD: Calibration-Free Multi-View Detection
Amir Etefaghi Daryani, M. Usman Maqbool Bhutta, Byron Hernandez et al.
Prior-Guided Flow Matching for Target-Aware Molecule Design with Learnable Atom Number
Jingyuan Zhou, Hao Qian, Shikui Tu et al.
Optimize the Unseen - Fast NeRF Cleanup with Free Space Prior
Leo Segre, Shai Avidan
HyPINO: Multi-Physics Neural Operators via HyperPINNs and the Method of Manufactured Solutions
Rafael Bischof, Michal Piovarci, Michael Kraus et al.
ReplaceMe: Network Simplification via Depth Pruning and Transformer Block Linearization
Dmitriy Shopkhoev, Ammar Ali, Magauiya Zhussip et al.
On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks
Mingze Wang, Weinan E
Deep Compositional Phase Diffusion for Long Motion Sequence Generation
Ho Yin Au, Jie Chen, Junkun Jiang et al.
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Sara Al-Emadi, Yin Yang, Ferda Ofli
Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment
Zhenbang Du, Yonggan Fu, Lifu Wang et al.
Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
James Oldfield, Shawn Im, Sharon Li et al.
Fitting Networks with a Cancellation Trick
Jiashun Jin, Jingming Wang
A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks
Hang Su, Yunlong Feng, Daniel Gehrig et al.
Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis
Inseung Hwang, Kiseok Choi, Hyunho Ha et al.
NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs
Haeun Lee, Omin Kwon, Yeonhong Park et al.
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner, Paula Usinger, Julius Nehring-Wirxel et al.
Sparse Diffusion Autoencoder for Test-time Adapting Prediction of Complex Systems
Jingwen Cheng, Ruikun Li, Huandong Wang et al.
Image Token Matters: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing
Weixing Wang, Zifeng Ding, Jindong Gu et al.
Neural Attention Search
Difan Deng, Marius Lindauer
Provable Gradient Editing of Deep Neural Networks
Zhe Tao, Aditya V Thakur
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
Ahmed Abdelreheem, Filippo Aleotti, Jamie Watson et al.
RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters
Xiaolin Liu, Tianyi zhou, Hongbo Kang et al.
AC-LoRA: (Almost) Training-Free Access Control Aware Multi-Modal LLMs
Lara Magdalena Lazier, Aritra Dhar, Vasilije Stambolic et al.
Probing Neural Combinatorial Optimization Models
Zhiqin Zhang, Yining Ma, Zhiguang Cao et al.
Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones
Daking Rai, Samuel Miller, Kevin Moran et al.
PrimHOI: Compositional Human-Object Interaction via Reusable Primitives
Kai Jia, Tengyu Liu, Mingtao Pei et al.
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
WonJun Moon, MinSeok Jung, Gilhan Park et al.
Data Distributional Properties As Inductive Bias for Systematic Generalization
Felipe del Rio, Alain Raymond, Daniel Florea et al.
Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains
Qiankun Li, Feng He, Huabao Chen et al.
RETRO SYNFLOW: Discrete Flow-Matching for Accurate and Diverse Single-Step Retrosynthesis
Robin Yadav, Qi Yan, Guy Wolf et al.
Statistical Inference for Gradient Boosting Regression
Haimo Fang, Kevin Tan, Giles Hooker
When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners
Weixiang Zhao, Jiahe Guo, Yang Deng et al.
Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention
Soikat Hasan Ahmed, Jan Finkbeiner, Emre Neftci
Beyond Random: Automatic Inner-loop Optimization in Dataset Distillation
Muquan Li, Hang Gou, Dongyang Zhang et al.
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning
Marco P. Apolinario, Sakshi Choudhary, Kaushik Roy
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu, Ruize Zhang, Chao Yu et al.
Certifiably Optimal Anisotropic Rotation Averaging
Carl Olsson, Yaroslava Lochman, Johan Malmport et al.
Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation
Shengfang ZHAI, Jiajun Li, Yue Liu et al.
Learning Relative Gene Expression Trends from Pathology Images in Spatial Transcriptomics
Kazuya Nishimura, Haruka Hirose, Ryoma Bise et al.
CoFFT: Chain of Foresight-Focus Thought for Visual Language Models
Xinyu Zhang, Yuxuan Dong, Lingling Zhang et al.
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
Haowei Zhu, Tianxiang Pan, Rui Qin et al.
PURA: Parameter Update-Recovery Test-Time Adaption for RGB-T Tracking
Zekai Shao, Yufan Hu, Bin Fan et al.
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
Jiahui Yang, Yongjia Ma, Donglin Di et al.
Evaluating LLMs in Open-Source Games
Swadesh Sistla, Max Kleiman-Weiner
VERA: Variational Inference Framework for Jailbreaking Large Language Models
Anamika Lochab, Lu Yan, Patrick Pynadath et al.
CanFields: Consolidating Diffeomorphic Flows for Non-Rigid 4D Interpolation from Arbitrary-Length Sequences
Miaowei Wang, Changjian Li, Amir Vaxman
Optimal community detection in dense bipartite graphs
Julien Chhor, Parker Knight
Fast MRI for All: Bridging Access Gaps by Training without Raw Data
Yasar Utku Alcalar, Merve Gulle, Mehmet Akcakaya
MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration
George Ciubotariu, Zhuyun Zhou, Zongwei Wu et al.
Universally Invariant Learning in Equivariant GNNs
Jiacheng Cen, Anyi Li, Ning Lin et al.
Towards Prospective Medical Image Reconstruction via Knowledge-Informed Dynamic Optimal Transport
Taoran Zheng, Yan Yang, Xing Li et al.
Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable
Xin Jin, Simon Niklaus, Zhoutong Zhang et al.
EMatch: A Unified Framework for Event-based Optical Flow and Stereo Matching
Pengjie Zhang, Lin Zhu, Xiao Wang et al.
On Evaluating LLM Alignment by Evaluating LLMs as Judges
Yixin Liu, Pengfei Liu, Arman Cohan
Sampling by averaging: A multiscale approach to score estimation
Paula Cordero-Encinar, Andrew Duncan, Sebastian Reich et al.
FP64 is All You Need: Rethinking Failure Modes in Physics-Informed Neural Networks
Chenhui Xu, Dancheng Liu, Amir Nassereldine et al.
Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels
Chenyu Mu, Yijun Qu, Jiexi Yan et al.
Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis
Leitian Tao, Xuefeng Du, Sharon Li
LCDB 1.1: A Database Illustrating Learning Curves Are More Ill-Behaved Than Previously Thought
Cheng Yan, Felix Mohr, Tom Viering
Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits
Yuzhou Gu, Yanjun Han, Jian Qian
M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast
Jiacheng Lu, Hui Ding, Shiyu Zhang et al.
A Large-scale Dataset and Benchmark for Commuting Origin-Destination Flow Generation
Can Rong, Jingtao Ding, Yan Liu et al.
CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation
Yi Liu, Shengqian Li, Zuzeng Lin et al.
Streamlining Image Editing with Layered Diffusion Brushes
Peyman Gholami, Robert Xiao
It’s Hard to Be Normal: The Impact of Noise on Structure-agnostic Estimation
Jikai Jin, Lester Mackey, Vasilis Syrgkanis
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning
Haolong Yan, Yeqing Shen, Xin Huang et al.
SPOT-Trip: Dual-Preference Driven Out-of-Town Trip Recommendation
Yinghui Liu, Hao Miao, Guojiang Shen et al.
Structure Matters: Dynamic Policy Gradient
Sara Klein, Xiangyuan Zhang, Tamer Basar et al.
Twinner: Shining Light on Digital Twins in a Few Snaps
Jesus Zarzar, Tom Monnier, Roman Shapovalov et al.
Activated LoRA: Fine-tuned LLMs for Intrinsics
Kristjan Greenewald, Luis Lastras, Thomas Parnell et al.
Noise Consistency Training: A Native Approach for One-step Generator in Learning Additional Controls
Yihong Luo, Shuchen Xue, Tianyang Hu et al.
UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition
Meng Pang, Wenjun Zhang, Nanrun Zhou et al.
Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via $\textit{In-the-wild}$ Cascading Flow Optimization
Yixiao Chen, Shikun Sun, Jianshu Li et al.
MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment
Yachun Mi, Yu Li, Weicheng Meng et al.
Subsampled Ensemble Can Improve Generalization Tail Exponentially
Huajie Qian, Donghao Ying, Henry Lam et al.
Sequentially Auditing Differential Privacy
Tomás González Lara, Mateo Dulce Rubio, Aaditya Ramdas et al.
Semantic Representation Attack against Aligned Large Language Models
Jiawei Lian, Jianhong Pan, Lefan Wang et al.
STRIDER: Navigation via Instruction-Aligned Structural Decision Space Optimization
Diqi He, Xuehao Gao, Hao Li et al.
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories
Yicong Li, Yiyang Chen, Zhenyuan Ma et al.
Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration
Shihao Zhou, Dayu Li, Jinshan Pan et al.
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
Changyao Tian, Hao Li, Gen Luo et al.
FNOPE: Simulation-based inference on function spaces with Fourier Neural Operators
Guy Moss, Leah Muhle, Reinhard Drews et al.
PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Wei Zhou, Guoliang Li, Haoyu Wang et al.
Integration Matters for Learning PDEs with Backwards SDEs
Sungje Park, Stephen Tu
MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild
Deming Li, Kaiwen Jiang, Yutao Tang et al.
Linear Transformers Implicitly Discover Unified Numerical Algorithms
Patrick Lutz, Aditya Gangrade, Hadi Daneshmand et al.
Automaton Constrained Q-Learning
Anastasios Manganaris, Vittorio Giammarino, Ahmed Qureshi
DAMap: Distance-aware MapNet for High Quality HD Map Construction
JINPENG DONG, Chen Li, Yutong Lin et al.
Achilles' Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data
Tianyi Chen, Pengxiao Lin, Zhiwei Wang et al.
Graph Alignment via Birkhoff Relaxation
Sushil Varma, Irène Waldspurger, Laurent Massoulié
DAVE: Diagnostic benchmark for Audio Visual Evaluation
Gorjan Radevski, Teodora Popordanoska, Matthew Blaschko et al.
NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge
Hanyu Zhu, Lance Fiondella, Jiawei Yuan et al.
ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
Xiaolong Wang, Lixiang Ru, Ziyuan Huang et al.
Cooperative Retrieval-Augmented Generation for Question Answering: Mutual Information Exchange and Ranking by Contrasting Layers
Youmin Ko, Sungjong Seo, Hyunjoon Kim
Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions
Yizhou Xu, Florent Krzakala, Lenka Zdeborová
OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit
Benquan Wang, Ruyi An, Jin-Kyu So et al.
Balancing Gradient and Hessian Queries in Non-Convex Optimization
Deeksha Adil, Brian Bullins, Aaron Sidford et al.
Strategic Classification with Non-Linear Classifiers
Benyamin Trachtenberg, Nir Rosenfeld
LLM-enhanced Action-aware Multi-modal Prompt Tuning for Image-Text Matching
Meng Tian, Shuo Yang, Xinxiao Wu
pLSTM: parallelizable Linear Source Transition Mark networks
Korbinian Pöppel, Richard Freinschlag, Thomas Schmied et al.
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning
Sheng Wang, Pengan CHEN, Jingqi Zhou et al.
Reminiscence Attack on Residuals: Exploiting Approximate Machine Unlearning for Privacy
Yaxin Xiao, Qingqing Ye, Li Hu et al.
Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain
Trinity Chung, Yuchen Shen, Nathan Kong et al.
On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
Weiqing He, Xiang Li, Tianqi Shang et al.
Unifying Symbolic Music Arrangement: Track-Aware Reconstruction and Structured Tokenization
Longshen Ou, Jingwei Zhao, Ziyu Wang et al.
Future Link Prediction Without Memory or Aggregation
Lu Yi, Runlin Lei, Fengran Mo et al.
MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation
Shen Yuan, Yin Zheng, Taifeng Wang et al.
Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking
guangyao Li, Siping Zhuang, Yajun Jian et al.
Mitigating the Privacy–Utility Trade-off in Decentralized Federated Learning via f-Differential Privacy
Xiang Li, Chendi Wang, Buxin Su et al.
ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers
Hanwen Cao, Haobo Lu, Xiaosen Wang et al.
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
Haoxiang You, Yilang Liu, Ian Abraham
Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models
Yi Liu, Dianqing Liu, Mingye Zhu et al.
Robust Distortion-Free Watermark for Autoregressive Audio Generation Models
Yihan Wu, Georgios Milis, Ruibo Chen et al.
CLIPTTA: Robust Contrastive Vision-Language Test-Time Adaptation
Marc Lafon, Gustavo Vargas Hakim, Clément Rambour et al.
CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance
Myeongsoo Kim, Shweta Garg, Baishakhi Ray et al.
Opinion Maximization in Social Networks by Modifying Internal Opinions
Gengyu Wang, Runze Zhang, Zhongzhi Zhang
Spike-timing-dependent Hebbian learning as noisy gradient descent
Niklas Dexheimer, Sascha Gaudlitz, Johannes Schmidt-Hieber
Uncovering the Spectral Bias in Diagonal State Space Models
Ruben Solozabal, Velibor Bojkovic, Hilal AlQuabeh et al.
How Many Domains Suffice for Domain Generalization? A Tight Characterization via the Domain Shattering Dimension
Cynthia Dwork, Lunjia Hu, Han Shao
Diffusion-based 3D Hand Motion Recovery with Intuitive Physics
Yufei Zhang, Zijun Cui, Jeffrey Kephart et al.
Robust Ego-Exo Correspondence with Long-Term Memory
Yijun Hu, Bing Fan, Xin Gu et al.
PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models
Jenny Schmalfuss, Nadine Chang, Vibashan VS et al.
Two-Steps Diffusion Policy for Robotic Manipulation via Genetic Denoising
Mateo Clémente, Leo Brunswic, Yang et al.
CSPCL: Category Semantic Prior Contrastive Learning for Deformable DETR-Based Prohibited Item Detectors
Mingyuan Li, Tong Jia, Hao Wang et al.
Explaining and Mitigating Crosslingual Tokenizer Inequities
Catherine Arnett, Tyler Chang, Stella Biderman et al.
HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing
Junseong Shin, Seungwoo Chung, Yunjeong Yang et al.
ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression
Tom Burgert, Oliver Stoll, Paolo Rota et al.
Tight Generalization Bounds for Large-Margin Halfspaces
Kasper Green Larsen, Natascha Schalburg
Zero-cost Proxy for Adversarial Robustness Evaluation
Yuqi Feng, Yuwei Ou, Jiahao Fan et al.
IM360: Large-scale Indoor Mapping with 360 Cameras
Dongki Jung, Jaehoon Choi, Yonghan Lee et al.
Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark
Hao Guo, Xugong Qin, Jun Jie Ou Yang et al.
Reliably detecting model failures in deployment without labels
Viet Nguyen, Changjian Shui, Vijay Giri et al.
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Wataru Shimoda, Naoto Inoue, Daichi Haraguchi et al.
Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation
CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.
Long-tailed Recognition with Model Rebalancing
JIAAN LUO, Feng Hong, Qiang Hu et al.
Demystifying the Token Dynamics of Deep Selective State Space Models
Thieu Vo, Duy-Tung Pham, Xin Tong et al.
Causal Climate Emulation with Bayesian Filtering
Sebastian H. M. Hickman, Ilija Trajković, Julia Kaltenborn et al.
CTRL-ALT-DECEIT Sabotage Evaluations for Automated AI R&D
Francis Ward, Teun van der Weij, Hanna Gábor et al.
Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting
Yiming Wang, Lucy Chai, Xuan Luo et al.
Empower Words: DualGround for Structured Phrase and Sentence-Level Temporal Grounding
Minseok Kang, Minhyeok Lee, Minjung Kim et al.
Argus: A Compact and Versatile Foundation Model for Vision
Weiming Zhuang, Chen Chen, Zhizhong Li et al.
S$^3$E: Self-Supervised State Estimation for Radar-Inertial System
Shengpeng Wang, Yulong Xie, Qing Liao et al.
Robust Low-light Scene Restoration via Illumination Transition
Ze Li, Feng Zhang, Xiatian Zhu et al.
DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation
Haitao Tian
ShortListing Model: A Streamlined Simplex Diffusion for Discrete Variable Generation
Yuxuan Song, Zhe Zhang, Yu Pei et al.
SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search
Dong Li, Xujiang Zhao, Linlin Yu et al.
Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors
Shida Sun, Yue Li, Yueyi Zhang et al.
Can Generative AI Solve Your In-Context Learning Problem? A Martingale Perspective
Andrew Jesson, Nicolas Beltran-Velez, David Blei
Optimal Learning of Kernel Logistic Regression for Complex Classification Scenarios
Hongwei Wen, Annika Betken, Hanyuan Hang
Visual Intention Grounding for Egocentric Assistants
Pengzhan Sun, Junbin Xiao, Tze Ho Elden Tse et al.
REOrdering Patches Improves Vision Models
Declan Kutscher, David Chan, Yutong Bai et al.
Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment
Pengfei Zhao, Rongbo Luan, Wei Zhang et al.
Robust Equilibria in Continuous Games: From Strategic to Dynamic Robustness
Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos et al.
How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes
Mahnoor Saad, Ziad Al-Halah
InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy
Vishnu Vinod, Krishna Pillutla, Abhradeep Guha Thakurta
Adjusting Initial Noise to Mitigate Memorization in Text-to-Image Diffusion Models
Hyeonggeun Han, Sehwan Kim, Hyungjun Joo et al.
DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method
Qingwen Zhang, Xiaomeng Zhu, Yushan Zhang et al.
ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction
Sankeerth Durvasula, Sharanshangar Muhunthan, Zain Moustafa et al.
Towards Understanding Transformers in Learning Random Walks
Wei Shi, Yuan Cao
MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition
Umberto Cappellazzo, Minsu Kim, Pingchuan Ma et al.
Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework
Yi-Ting Chen, Ting-Hsuan Liao, Pengsheng Guo et al.
A Unified Reasoning Framework for Holistic Zero-Shot Video Anomaly Analysis
Dongheng Lin, Mengxue Qu, Kunyang Han et al.
UnCLe: Towards Scalable Dynamic Causal Discovery in Non-linear Temporal Systems
Tingzhu Bi, Yicheng Pan, Xinrui Jiang et al.
Automated Composition of Agents: A Knapsack Approach for Agentic Component Selection
Michelle Yuan, Khushbu Pahwa, Shuaichen Chang et al.
High Resolution UDF Meshing via Iterative Networks
Federico Stella, Nicolas Talabot, Hieu Le et al.
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback
Boyuan Chen, Donghai Hong, Jiaming Ji et al.
TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity
Yuzhuo Chen, Zehua Ma, Han Fang et al.
Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images
Elena Buglakova, Anwai Archit, Edoardo D'Imprima et al.
DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Hengyuan Zhang, Zhe Li, Xingqun Qi et al.
ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion
Nissim Maruani, Wang Yifan, Matthew Fisher et al.
Utilitarian Algorithm Configuration for Infinite Parameter Spaces
Devon Graham, Kevin Leyton-Brown
Asymptotically exact variational flows via involutive MCMC kernels
Zuheng (David) Xu, Trevor Campbell
NRGBoost: Energy-Based Generative Boosted Trees
João Bravo