2025 Papers
21,856 papers found • Page 432 of 438
When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class
Yujin Kim, Hyunsoo Kim, Hyunwoo Kim et al.
When Models Don’t Collapse: On the Consistency of Iterative MLE
Daniel Barzilai, Ohad Shamir
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi, Carlos Jimenez, Shunyu Yao et al.
When narrower is better: the narrow width limit of Bayesian parallel branching neural networks
Zechen Zhang, Haim Sompolinsky
When No Paths Lead to Rome: Benchmarking Systematic Neural Relational Reasoning
Anirban Das, Muhammad Irtaza Khalid, Rafael Peñaloza et al.
When One Eye Sees Less: Uncovering Perceptual Thresholds of Asymmetric Quality Degradation in 4K XR Displays
Haechan Lee, Namil Kim, Hoe Sung Ryu et al.
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
Zhuo Cao, Heming Du, Bingqing Zhang et al.
When Open-Vocabulary Visual Question Answering Meets Causal Adapter: Benchmark and Approach
Feifei Zhang, Zhaoyi Zhang, Xi Zhang et al.
When Pixel Difference Patterns Meet ViT: PiDiViT for Few-Shot Object Detection
Hongliang Zhou, Yongxiang Liu, Canyu Mo et al.
When Prompt Engineering Meets Software Engineering: CNL-P as Natural and Robust "APIs'' for Human-AI Interaction
Zhenchang Xing, Yang Liu, Zhuo Cheng et al.
When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired Training
Yunwei Lan, Zhigao Cui, Xin Luo et al.
When Selection Meets Intervention: Additional Complexities in Causal Discovery
Haoyue Dai, Ignavier Ng, Jianle Sun et al.
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Yan Shu, Hangui Lin, Yexin Liu et al.
When Senses Collide: Investigating Modality Congruence and Interference Between Task and Notification in Augmented Reality
Mehakdeep Kaur, Hyeongil Nam, Ryan Kang et al.
When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data
Rongjia Zheng, Qing Zhang, Yongwei Nie et al.
When Should We Prefer State-to-Visual DAgger over Visual Reinforcement Learning?
Tongzhou Mu, Zhaoyang Li, Stanisław Wiktor Strzelecki et al.
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning
Yang Liu, Qianqian Xu, Peisong Wen et al.
When Thinking Drifts: Evidential Grounding for Robust Video Reasoning
Romy Luo, Zihui (Sherry) Xue, Alex Dimakis et al.
When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs
Xiaomin Li, Zhou Yu, Zhiwei Zhang et al.
When to Forget? Complexity Trade-offs in Machine Unlearning
Martin Van Waerebeke, Marco Lorenzi, Giovanni Neglia et al.
When to retrain a machine learning model
Florence Regol, Leo Schwinn, Kyle Sprague et al.
When, Where and Why to Average Weights?
Niccolò Ajroldi, Antonio Orvieto, Jonas Geiping
When Will It Fail?: Anomaly to Prompt for Forecasting Future Anomalies in Time Series
Min-Yeong Park, Won-Jeong Lee, Seong Tae Kim et al.
When Witnesses Defend: A Witness Graph Topological Layer for Adversarial Graph Learning
Naheed Anjum Arafat, Debabrota Basu, Yulia Gel et al.
When Worse is Better: Navigating the Compression Generation Trade-off In Visual Tokenization
Vivek Ramanujan, Kushal Tirumala, Armen Aghajanyan et al.
Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Junyi Chen, Di Huang, Weicai Ye et al.
Where am I? Cross-View Geo-localization with Natural Language Descriptions
Junyan Ye, Honglin Lin, Leyan Ou et al.
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Donghoon Ahn, Jiwon Kang, Sanghyun Lee et al.
Where Does It Exist from the Low-Altitude: Spatial Aerial Video Grounding
Yang Zhan, Yuan Yuan
Where Graph Meets Heterogeneity: Multi-View Collaborative Graph Experts
Zhihao Wu, Jinyu Cai, Yunhe Zhang et al.
Where is the Truth? The Risk of Getting Confounded in a Continual World
Florian Peter Busch, Roshni Ramanna Kamath, Rupert Mitchell et al.
Where Precision Meets Efficiency: Transformation Diffusion Model for Point Cloud Registration
Yongzhe Yuan, Yue Wu, Xiaolong Fan et al.
Where's the Liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content
Haoyue Bai, Yiyou Sun, Wei Cheng et al.
Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted
Shuaiwei Yuan, Junyu Dong, Yuezun Li
Where, What, Why: Towards Explainable Driver Attention Prediction
Yuchen Zhou, Jiayu Tang, Xiaoyan Xiao et al.
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Shaokun Zhang, Ming Yin, Jieyu Zhang et al.
Which Algorithms Have Tight Generalization Bounds?
Michael Gastpar, Ido Nachum, Jonathan Shafer et al.
Which Attention Heads Matter for In-Context Learning?
Kayo Yin, Jacob Steinhardt
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions
Siqi Kou, Qingyuan Tian, Hanwen Xu et al.
Which Tasks Should Be Compressed Together? A Causal Discovery Approach for Efficient Multi-Task Representation Compression
Sha Guo, Jing Chen, Zixuan Hu et al.
Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Instructional Videos
Sagnik Majumder, Tushar Nagarajan, Ziad Al-Halah et al.
Whitened CLIP as a Likelihood Surrogate of Images and Captions
Roy Betser, Meir Yossef Levi, Guy Gilboa
Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems
Jeffrey Alido, Tongyu Li, Yu Sun et al.
Who Controls the Authorization? Invertible Networks for Copyright Protection in Text-to-Image Synthesis
Baoyue Hu, Yang Wei, Junhao Xiao et al.
Whoever Started the interference Should End It: Guiding Data-Free Model Merging via Task Vectors
Runxi Cheng, Feng Xiong, Yongxian Wei et al.
"Who experiences large model decay and why?" A Hierarchical Framework for Diagnosing Heterogeneous Performance Drift
Harvineet Singh, Fan Xia, Alexej Gossmann et al.
Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads
Yingjie Zhou, Jiezhang Cao, Zicheng Zhang et al.
Whole-Body Conditioned Egocentric Video Prediction
Yutong Bai, Danny Tran, Amir Bar et al.
Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity
Zhufeng Li, Sandeep Suresh Cranganore, Nicholas Youngblut et al.
Who Reasons in the Large Language Models?
Jie Shao, Jianxin Wu