Most Cited 2025 "key-value state reuse" Papers
22,274 papers found • Page 76 of 112
Conference
3D-Prover: Diversity Driven Theorem Proving With Determinantal Point Processes
Sean Lamont, Christian Walder, Amir Dezfouli et al.
Analyzing Similarity Metrics for Data Selection for Language Model Pretraining
Dylan Sam, Ayan Chakrabarti, Afshin Rostamizadeh et al.
Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
Jinsol Song, Jiamu Wang, Anh Nguyen et al.
The Impact of Coreset Selection on Spurious Correlations and Group Robustness
Amaya Dharmasiri, William Yang, Polina Kirichenko et al.
UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition
Meng Pang, Wenjun Zhang, Nanrun Zhou et al.
Concept Incongruence: An Exploration of Time and Death in Role Playing
Xiaoyan Bai, Ike Peng, Aditya Singh et al.
OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit
Benquan Wang, Ruyi An, Jin-Kyu So et al.
GraphChain: Large Language Models for Large-scale Graph Analysis via Tool Chaining
Chunyu Wei, Wenji Hu, Xingjia Hao et al.
Online Feedback Efficient Active Target Discovery in Partially Observable Environments
Anindya Sarkar, Binglin Ji, Yevgeniy Vorobeychik
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation
Zhixiang Chi, Yanan Wu, Li Gu et al.
How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime?
Wei Huang, Andi Han, Yujin Song et al.
Improving the Generation and Evaluation of Synthetic Data for Downstream Medical Causal Inference
Harry Amad, Zhaozhi Qian, Dennis Frauen et al.
Graph Neural Network Based Action Ranking for Planning
Rajesh Mangannavar, Stefan Lee, Alan Fern et al.
Self-Supervised Learning for Color Spike Camera Reconstruction
Yanchen Dong, Ruiqin Xiong, Xiaopeng Fan et al.
CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation
Bowen Song, Zecheng Zhang, Zhaoxu Luo et al.
Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems
Jeffrey Alido, Tongyu Li, Yu Sun et al.
Metropolis-Hastings Sampling for 3D Gaussian Reconstruction
Hyunjin Kim, Haebeom Jung, Jaesik Park
LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection
Wei Liao, Chunyan Xu, Chenxu Wang et al.
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation
Yuanhong Yu, Xingyi He, Chen Zhao et al.
Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation
Dogyun Park, Taehoon Lee, Minseok Joo et al.
Efficient Spiking Point Mamba for Point Cloud Analysis
Peixi Wu, Bosong Chai, Menghua Zheng et al.
Parameterized Synthetic Text Generation with SimpleStories
Lennart Finke, Chandan Sreedhara, Thomas Dooms et al.
Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size
Rustem Islamov, Niccolò Ajroldi, Antonio Orvieto et al.
DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover
Youzhuo Wang, jiayi ye, Chuyang Xiao et al.
Latent Expression Generation for Referring Image Segmentation and Grounding
Seonghoon Yu, Junbeom Hong, Joonseok Lee et al.
ReMindRAG: Low-Cost LLM-Guided Knowledge Graph Traversal for Efficient RAG
Yikuan Hu, Jifeng Zhu, Lanrui Tang et al.
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories
Yicong Li, Yiyang Chen, Zhenyuan Ma et al.
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models
Jianting Tang, Yubo Wang, Haoyu Cao et al.
STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search
Yuning Qiu, Andong Wang, Chao Li et al.
Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness
Bogdan Chornomaz, Yonatan Koren, Shay Moran et al.
Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework
Yi-Ting Chen, Ting-Hsuan Liao, Pengsheng Guo et al.
Flow Matching Neural Processes
Hussen Abu Hamad, Dan Rosenbaum
Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation
Jiaxin Cai, Jingze Su, Qi Li et al.
A Unified Framework for Provably Efficient Algorithms to Estimate Shapley Values
Tyler Chen, Akshay Seshadri, Mattia Jacopo Villani et al.
Preference Learning with Response Time: Robust Losses and Guarantees
Ayush Sawarni, Sahasrajit Sarmasarkar, Vasilis Syrgkanis
Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings
Houssam Zenati, Bariscan Bozkurt, Arthur Gretton
Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins
ZAIXI ZHANG, Ruofan Jin, Le Cong et al.
Heterogeneous Adversarial Play in Interactive Environments
Manjie Xu, Xinyi Yang, Jiayu Zhan et al.
SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement
Liwen Xiao, Zhiyu Pan, Zhicheng Wang et al.
SimWorld: An Open-ended Simulator for Agents in Physical and Social Worlds
Xiaokang Ye, Jiawei Ren, Yan Zhuang et al.
Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning
Xusheng Cao, Haori Lu, Linlan Huang et al.
VRM: Knowledge Distillation via Virtual Relation Matching
Weijia Zhang, Fei Xie, Weidong Cai et al.
Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation
Shengfang ZHAI, Jiajun Li, Yue Liu et al.
OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation
Bo-Wen Yin, Jiao-Long Cao, Xuying Zhang et al.
Visual Surface Wave Elastography: Revealing Subsurface Physical Properties via Visible Surface Waves
Alexander Ogren, Berthy Feng, Jihoon Ahn et al.
Is This Tracker On? A Benchmark Protocol for Dynamic Tracking
Ilona Demler, Saumya Chauhan, Georgia Gkioxari
ImHead: A Large-scale Implicit Morphable Model for Localized Head Modeling
Rolandos Alexandros Potamias, Stathis Galanakis, Jiankang Deng et al.
VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion
Zhiwei Lin, Yongtao Wang
Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images
Changha Shin, Woong Oh Cho, Seon Joo Kim
Smooth Sailing: Lipschitz-Driven Uncertainty Quantification for Spatial Associations
David Burt, Renato Berlinghieri, Stephen Bates et al.
Stabilizing LTI Systems under Partial Observability: Sample Complexity and Fundamental Limits
Ziyi Zhang, Yorie Nakahira, Guannan Qu
Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models
Jiajun Fan, Tong Wei, Chaoran Cheng et al.
MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects
Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.
Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models
Yuhao Cui, Xinxing Zu, Wenhua Zhang et al.
Continuous Domain Generalization
Zekun CAI, Yiheng YAO, Guangji Bai et al.
Balancing Performance and Costs in Best Arm Identification
Michael Harding, Kirthevasan Kandasamy
Polarized Color Screen Matting
Kenji Enomoto, Scott Cohen, Brian Price et al.
Fast exact recovery of noisy matrix from few entries: the infinity norm approach
BaoLinh Tran, Van Vu
D-Attn: Decomposed Attention for Large Vision-and-Language Model
Chia-Wen Kuo, Sijie Zhu, Fan Chen et al.
Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Xiaomeng Fan, Yuchuan Mao, Zhi Gao et al.
Learning to Generate Human-Human-Object Interactions from Textual Descriptions
Jeonghyeon Na, Sangwon Baik, Inhee Lee et al.
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning
Tianhong Gao, Yannian Fu, Weiqun Wu et al.
PolarAnything: Diffusion-based Polarimetric Image Synthesis
Kailong Zhang, Youwei Lyu, Heng Guo et al.
From Programs to Poses: Factored Real-World Scene Generation via Learned Program Libraries
Joy Hsu, Emily Jin, Jiajun Wu et al.
MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
Xiaohao Xu, Feng Xue, Shibo Zhao et al.
On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models
Hoigi Seo, Dong Un Kang, Hyunjin Cho et al.
HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos
Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta
REMI: Reconstructing Episodic Memory During Internally Driven Path Planning
Zhaoze Wang, Genela Morris, Dori Derdikman et al.
FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video
Andrea Boscolo Camiletto, Jian Wang, Eduardo Alvarado et al.
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino, Ruiqi Ni, Ahmed Qureshi
Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning
Tianjiao Jiang, Zhen Zhang, Yuhang Liu et al.
Enhancing CLIP Robustness via Cross-Modality Alignment
Xingyu Zhu, Beier Zhu, Shuo Wang et al.
SpikePack: Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility
Guobin Shen, Jindong Li, Tenglong Li et al.
Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization
Qingwang Zhang, Yingying Zhu
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior
Chanhui Lee, Yeonghwan Song, Jeany Son
MergeOcc: Bridge the Domain Gap between Different LiDARs for Robust Occupancy Prediction
Zikun Xu, Shaobing Xu
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
YUANTIAN SHAO, Yuanteng Chen, Peisong Wang et al.
Plenodium: Underwater 3D Scene Reconstruction with Plenoptic Medium Representation
Changguang WU, Jiangxin Dong, Chengjian Li et al.
TADA: Improved Diffusion Sampling with Training-free Augmented DynAmics
Tianrong Chen, Huangjie Zheng, David Berthelot et al.
Sampling Innovation-Based Adaptive Compressive Sensing
Zhifu Tian, Tao Hu, Chaoyang Niu et al.
Learning Latent Variable Models via Jarzynski-adjusted Langevin Algorithm
James Cuin, Davide Carbone, O. Deniz Akyildiz
Mechanism Design via the Interim Relaxation
Kshipra Bhawalkar, Marios Mertzanidis, Divyarthi Mohan et al.
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving
R.D. Lin, Pengcheng Weng, Yinqiao Wang et al.
Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
Ghada Sokar, Pablo Samuel Castro
Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling
Javier E. Santos, Agnese Marcato, Roman Colman et al.
ESC: Erasing Space Concept for Knowledge Deletion
Tae-Young Lee, Sundong Park, Minwoo Jeon et al.
LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching
Zhuo Cao, Xuan Zhao, Lena Krieger et al.
SALAD -- Semantics-Aware Logical Anomaly Detection
Matic Fučka, Vitjan Zavrtanik, Danijel Skocaj
Exploring Structural Degradation in Dense Representations for Self-supervised Learning
Siran Dai, Qianqian Xu, Peisong Wen et al.
Practical do-Shapley Explanations with Estimand-Agnostic Causal Inference
Álvaro Parafita, Tomas Garriga, Axel Brando et al.
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks
Nina Shvetsova, Arsha Nagrani, Bernt Schiele et al.
Foveated Instance Segmentation
Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.
Active Event-based Stereo Vision
Jianing Li, Yunjian Zhang, Haiqian Han et al.
GraphTOP: Graph Topology-Oriented Prompting for Graph Neural Networks
Xingbo Fu, Zhenyu Lei, Zihan Chen et al.
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
Xinhua Lu, Runhe Lai, Yanqi Wu et al.
Optimize the Unseen - Fast NeRF Cleanup with Free Space Prior
Leo Segre, Shai Avidan
Deep Compositional Phase Diffusion for Long Motion Sequence Generation
Ho Yin Au, Jie Chen, Junkun Jiang et al.
ARMO: Autoregressive Rigging for Multi-Category Objects
mingze sun, Shiwei Mao, Keyi Chen et al.
Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation
Zhi-Kai Chen, Jun-Peng Jiang, Han-Jia Ye et al.
Sparse Diffusion Autoencoder for Test-time Adapting Prediction of Complex Systems
Jingwen Cheng, Ruikun Li, Huandong Wang et al.
PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?
Martin Spitznagel, Jan Vaillant, Janis Keuper
Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones
Daking Rai, Samuel Miller, Kevin Moran et al.
CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection
Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.
Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning
Can Küçüksözen, Yucel Yemez
TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction
Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi
Activated LoRA: Fine-tuned LLMs for Intrinsics
Kristjan Greenewald, Luis Lastras, Thomas Parnell et al.
Subsampled Ensemble Can Improve Generalization Tail Exponentially
Huajie Qian, Donghao Ying, Henry Lam et al.
Semantic Representation Attack against Aligned Large Language Models
Jiawei Lian, Jianhong Pan, Lefan Wang et al.
Visual Relation Diffusion for Human-Object Interaction Detection
Ping Cao, Yepeng Tang, Chunjie Zhang et al.
FNOPE: Simulation-based inference on function spaces with Fourier Neural Operators
Guy Moss, Leah Muhle, Reinhard Drews et al.
The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation
Sara Ahmadian, Edith Cohen, Uri Stemmer
Graph Alignment via Birkhoff Relaxation
Sushil Varma, Irène Waldspurger, Laurent Massoulié
Acceleration via silver step-size on Riemannian manifolds with applications to Wasserstein space
Jiyoung Park, Abhishek Roy, Jonathan W. Siegel et al.
Poly-Autoregressive Prediction for Modeling Interactions
Neerja Thakkar, Tara Sadjadpour, Jathushan Rajasegaran et al.
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback
Jing Dong, Baoxiang Wang, Yaoliang Yu
Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection
Ting Li, Mao Ye, Tianwen Wu et al.
Future Link Prediction Without Memory or Aggregation
Lu Yi, Runlin Lei, Fengran Mo et al.
MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model
Haoyuan Wang, Zhenwei Wang, Xiaoxiao Long et al.
WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild
Morris Alper, David Novotny, Filippos Kokkinos et al.
Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians
Changfeng Ma, Ran Bi, Jie Guo et al.
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment
Juelin Zhu, Shuaibang Peng, Long Wang et al.
LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation
WEI-JER Chang, Masayoshi Tomizuka, Wei Zhan et al.
Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting
Yiming Wang, Lucy Chai, Xuan Luo et al.
Towards Scalable Human-aligned Benchmark for Text-guided Image Editing
Suho Ryu, Kihyun Kim, Eugene Baek et al.
WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction
Richard Liu, Daniel Fu, Noah Tan et al.
HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery
Yuto Matsubara, Ko Nishino
InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy
Vishnu Vinod, Krishna Pillutla, Abhradeep Guha Thakurta
GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation
Ye Tao, jiawei zhang, Yahao Shi et al.
On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study
Riccardo Alberghi, Elizaveta Demyanenko, Luca Biggio et al.
State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding
Jiahuan Zhou, Kai Zhu, Zhenyu Cui et al.
Multimodal Negative Learning
Baoquan Gong, Xiyuan Gao, Pengfei Zhu et al.
Removing Cost Volumes from Optical Flow Estimators
Simon Kiefhaber, Stefan Roth, Simone Schaub-Meyer
Resolution of Simpson's paradox via the common cause principle
Arshak Hovhannisyan, Armen Allahverdyan
AirRoom: Objects Matter in Room Reidentification
Runmao Yao, Yi Du, Zhuoqun Chen et al.
Simulation-Based Inference for Adaptive Experiments
Brian Cho, Aurelien Bibaut, Nathan Kallus
From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers
Praneet Suresh, Jack Stanley, Sonia Joseph et al.
ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training
Leonard Bruns, Axel Barroso-Laguna, Tommaso Cavallari et al.
FaCT: Faithful Concept Traces for Explaining Neural Network Decisions
Amin Parchami-Araghi, Sukrut Rao, Jonas Fischer et al.
Controllable Human-centric Keyframe Interpolation with Generative Prior
Zujin Guo, Size Wu, Zhongang Cai et al.
Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables
Zhongnan Cai, Yingying Wang, Hui Zheng et al.
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Yankai Chen, Xinni Zhang, Yifei Zhang et al.
SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM
Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger
Position: Benchmarking is Broken - Don't Let AI be Its Own Judge
Zerui Cheng, Stella Wohnig, Ruchika Gupta et al.
The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models
Alessandro Serra, Francesco Ortu, Emanuele Panizon et al.
Learning Dynamics of RNNs in Closed-Loop Environments
Yoav Ger, Omri Barak
PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation
Xiaoyang Hao, Han Li
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention
Qiang Xiang, Shuang Sun, Binglei Li et al.
Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning
Zihao Jing, Yan Sun, Yan Yi Li et al.
End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Zhenrong Wang, Qi Zheng, Sihan Ma et al.
METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models
Yuchen Liu, Yaoming Wang, Bowen Shi et al.
LoKi: Low-dimensional KAN for Efficient Fine-tuning Image Models
Xuan Cai, Renjie Pan, Hua Yang
RoFt-Mol: Benchmarking Robust Fine-tuning with Molecular Graph Foundation Models
Shikun Liu, Deyu Zou, Nima Shoghi et al.
Unlocking Generalization Power in LiDAR Point Cloud Registration
Zhenxuan Zeng, Qiao Wu, Xiyu Zhang et al.
RGNMR: A Gauss-Newton method for robust matrix completion with theoretical guarantees
Eilon Vaknin Laufer, Boaz Nadler
RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis
YANG SONGXIAO, Haolin Wang, Yao Fu et al.
Adversarial Exploitation of Data Diversity Improves Visual Localization
Sihang Li, Siqi Tan, Bowen Chang et al.
DyGS-SLAM: Real-Time Accurate Localization and Gaussian Reconstruction for Dynamic Scenes
Xinggang Hu, Chenyangguang Zhang, Mingyuan Zhao et al.
F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics
Pramit Saha, Felix Wagner, Divyanshu Mishra et al.
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification
Dongyoon Yang, Jihu Lee, Yongdai Kim
Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack
Xingshuo Han, Xuanye Zhang, Xiang Lan et al.
VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image
Sicheng Xu, Guojun Chen, Jiaolong Yang et al.
DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery
Jing Gao, Ce Zheng, Laszlo Jeni et al.
Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories
Susung Hong, Johanna Suvi Karras, Ricardo Martin et al.
Inductive Domain Transfer In Misspecified Simulation-Based Inference
Ortal Senouf, Antoine Wehenkel, Cédric Vincent-Cuaz et al.
Leader360V: A Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment
WEIMING ZHANG, Dingwen Xiao, Aobotao DAI et al.
Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models
Samuel Lavoie, Michael Noukhovitch, Aaron Courville
AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion
Liuyue Xie, Jiancong Guo, Ozan Cakmakci et al.
REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning
Sungho Jeon, Xinyue Ma, Kwang In Kim et al.
BlinkTrack: Feature Tracking over 80 FPS via Events and Images
Yichen Shen, Yijin Li, Shuo Chen et al.
DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
Jiajun Luo, Lizhuo Luo, Jianru Xu et al.
Additive Models Explained: A Computational Complexity Approach
Shahaf Bassan, Michal Moshkovitz, Guy Katz
Sound Logical Explanations for Mean Aggregation Graph Neural Networks
Matthew Morris, Ian Horrocks
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li, Junzhe Zhang, Elias Bareinboim
Decomposing stimulus-specific sensory neural information via diffusion models
Steeve Laquitaine, Simone Azeglio, Carlo Paris et al.
SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer
Chunnan Shang, Zhizhong Wang, Hongwei Wang et al.
The Logical Expressiveness of Temporal GNNs via Two-Dimensional Product Logics
Marco Sälzer, Przemyslaw Walega, Martin Lange
Web-Scale Collection of Video Data for 4D Animal Reconstruction
Brian Nlong Zhao, Jiajun Wu, Shangzhe Wu
Least squares variational inference
Yvann Le Fay, Nicolas Chopin, Simon Barthelmé
Diff2I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior
Juncheng Mu, Chengwei REN, Weixiang Zhang et al.
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning
Marco P. Apolinario, Sakshi Choudhary, Kaushik Roy
LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
Huanlin Gao, Ping Chen, Fuyuan Shi et al.
Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors
Finn Schmidt, Polina Turishcheva, Suhas Shrinivasan et al.
Why 1 + 1 < 1 in Visual Token Pruning: Beyond Naive Integration via Multi-Objective Balanced Covering
Yangfu Li, Hongjian Zhan, Tianyi Chen et al.
Bubbleformer: Forecasting Boiling with Transformers
Sheikh Md Shakeel Hassan, Xianwei Zou, Akash Dhruv et al.
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
Subhadeep Koley, Tapas Kumar Dutta, Aneeshan Sain et al.
Sparse Gaussian Processes: Structured Approximations and Power-EP Revisited
Thang Bui, Michalis Titsias
Improving Progressive Generation with Decomposable Flow Matching
Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov et al.
PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching
Hengzhe Jin, Lang Nie, Chunyu Lin et al.
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Wataru Shimoda, Naoto Inoue, Daichi Haraguchi et al.
Subgraph Federated Learning via Spectral Methods
Javad Aliakbari, Johan Oestman, Ashkan Panahi et al.
Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts
Chaitanya Kapoor, Sudhanshu Srivastava, Meenakshi Khosla
Online Segment Any 3D Thing as Instance Tracking
Hanshi Wang, Cai Zijian, Jin Gao et al.
Eluder dimension: localise it!
Alireza Bakhtiari, Alex Ayoub, Samuel Robertson et al.
Non-Markovian Discrete Diffusion with Causal Language Models
Yangtian Zhang, Sizhuang He, Daniel Levine et al.
Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment
Zhenbang Du, Yonggan Fu, Lifu Wang et al.
Measuring the Impact of Rotation Equivariance on Aerial Object Detection
Xiuyu Wu, Xinhao Wang, Xiubin Zhu et al.
Black Hole-Driven Identity Absorbing in Diffusion Models
Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung
Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings
Xingguang Wei, Haomin Wang, Shenglong Ye et al.
OFER: Occluded Face Expression Reconstruction
Pratheba Selvaraju, Victoria Abrevaya, Timo Bolkart et al.