2025 Papers

21,856 papers found • Page 434 of 438

WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction

Richard Liu, Daniel Fu, Noah Tan et al.

ICCV 2025poster

WISA: World simulator assistant for physics-aware text-to-video generation

Jing Wang, Ao Ma, Ke Cao et al.

NeurIPS 2025spotlightarXiv:2503.08153
34
citations

Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting

Chenchen Tan, Youyang Qu, Xinghao Li et al.

NeurIPS 2025poster

WISE: A Framework for Gigapixel Whole-Slide-Image Lossless Compression

Yu Mao, Jun Wang, Nan Guan et al.

CVPR 2025poster
4
citations

WISH: Weakly Supervised Instance Segmentation using Heterogeneous Labels

Hyeokjun Kweon, Kuk-Jin Yoon

CVPR 2025highlight

WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images

Shifan Zhang, Hongzi Zhu, Yinan He et al.

CVPR 2025poster
1
citations

With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You

Fabian Gröger, Shuo Wen, Huyen Le et al.

NeurIPS 2025poster

Witty: An Efficient Solver for Computing Minimum-Size Decision Trees

Luca Pascal Staus, Christian Komusiewicz, Frank Sommer et al.

AAAI 2025paper

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Haipeng Luo, Qingfeng Sun, Can Xu et al.

ICLR 2025posterarXiv:2308.09583
637
citations

WKV-sharing embraced random shuffle RWKV high-order modeling for pan-sharpening

man zhou, Xuanhua He, Danfeng Hong et al.

NeurIPS 2025poster

WMAdapter: Adding WaterMark Control to Latent Diffusion Models

Hai Ci, Yiren Song, Pei Yang et al.

ICML 2025poster

WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models

Tan Songbai, Xuerui Qiu, Yao Shu et al.

ICML 2025poster

WMCopier: Forging Invisible Watermarks on Arbitrary Images

Ziping Dong, Chao Shuai, Zhongjie Ba et al.

NeurIPS 2025poster

WolBanking77: Wolof Banking Speech Intent Classification Dataset

Abdou Karim KANDJI, Frederic Precioso, Cheikh BA et al.

NeurIPS 2025poster

Wolfpack Adversarial Attack for Robust Multi-Agent Reinforcement Learning

Sunwoo Lee, Jaebak Hwang, Yonghyeon Jo et al.

ICML 2025poster
1
citations

WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving

Yiheng Li, Cunxin Fan, Chongjian GE et al.

ICML 2025poster

Wonderland: Navigating 3D Scenes from a Single Image

Hanwen Liang, Junli Cao, Vidit Goel et al.

CVPR 2025poster
54
citations

WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions

Zizhang Li, Hong-Xing Yu, Wei Liu et al.

ICCV 2025highlight

WonderTurbo: Generating Interactive 3D World in 0.72 Seconds

Chaojun Ni, Xiaofeng Wang, Zheng Zhu et al.

ICCV 2025posterarXiv:2504.02261

Wonder Wins Ways: Curiosity-Driven Exploration through Multi-Agent Contextual Calibration

Yiyuan Pan, Zhe Liu, Hesheng Wang

NeurIPS 2025posterarXiv:2509.20648

WonderWorld: Interactive 3D Scene Generation from a Single Image

Hong-Xing Yu, Haoyi Duan, Charles Herrmann et al.

CVPR 2025highlight
120
citations

Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis

Tianrui Wang, Haoyu Wang, Meng Ge et al.

NeurIPS 2025arXiv:2509.24629

Words in Motion: Extracting Interpretable Control Vectors for Motion Transformers

Omer Sahin Tas, Royden Wagner

ICLR 2025posterarXiv:2406.11624
4
citations

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Ailin Deng, Tri Cao, Zhirui Chen et al.

CVPR 2025posterarXiv:2503.02199
33
citations

Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications

Agam Shah, Siddhant Sukhani, Huzaifa Pardawala et al.

NeurIPS 2025oral

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

Shengda Fan, Xin Cong, Yuepeng Fu et al.

ICLR 2025poster
14
citations

World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model

Yupeng Zheng, Pengxuan Yang, Zebin Xing et al.

ICCV 2025poster

WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment

Jiefu Ou, Arda Uzunoğlu, Benjamin Van Durme et al.

AAAI 2025paper

World-aware Planning Narratives Enhance Large Vision-Language Model Planner

Junhao Shi, Zhaoye Fei, Siyin Wang et al.

NeurIPS 2025poster

World-consistent Video Diffusion with Explicit 3D Modeling

Qihang Zhang, Shuangfei Zhai, Miguel Ángel Bautista et al.

CVPR 2025highlight

World Knowledge-Enhanced Reasoning Using Instruction-Guided Interactor in Autonomous Driving

Mingliang Zhai, Cheng Li, Zengyuan Guo et al.

AAAI 2025paper

WorldMem: Long-term Consistent World Simulation with Memory

Zeqi Xiao, Yushi LAN, Yifan Zhou et al.

NeurIPS 2025oralarXiv:2504.12369
48
citations

WorldModelBench: Judging Video Generation Models As World Models

Dacheng Li, Yunhao Fang, Yukang Chen et al.

NeurIPS 2025posterarXiv:2502.20694
31
citations

World Model Implanting for Test-time Adaptation of Embodied Agents

Minjong Yoo, Jinwoo Jang, Sihyung Yoon et al.

ICML 2025poster

World Model on Million-Length Video And Language With Blockwise RingAttention

Hao Liu, Wilson Yan, Matei Zaharia et al.

ICLR 2025oralarXiv:2402.08268
144
citations

World Models as Reference Trajectories for Rapid Motor Adaptation

Carlos Stein Brito, Daniel McNamee

NeurIPS 2025poster

World Models Should Prioritize the Unification of Physical and Social Dynamics

Xiaoyuan Zhang, Chengdong Ma, Yizhe Huang et al.

NeurIPS 2025posterarXiv:2510.21219

WorldScore: Unified Evaluation Benchmark for World Generation

Haoyi Duan, Hong-Xing Yu, Sirui Chen et al.

ICCV 2025poster

WorldSimBench: Towards Video Generation Models as World Simulators

Yiran Qin, Zhelun Shi, Jiwen Yu et al.

ICML 2025poster
806
citations

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception

Zhiheng Liu, Xueqing Deng, Shoufa Chen et al.

NeurIPS 2025oralarXiv:2508.15720
5
citations

Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals

Linda Zeng, Rithwik Gupta, Divij Motwani et al.

NeurIPS 2025poster

W-PCA Based Gradient-Free Proxy for Efficient Search of Lightweight Language Models

Shang Wang

ICLR 2025posterarXiv:2504.15983

WPMixer: Efficient Multi-Resolution Mixing for Long-Term Time Series Forecasting

Md Mahmuddun Nabi Murad, Mehmet Aktukmak, Yasin Yilmaz

AAAI 2025paper
19
citations

Wrapped Gaussian on the manifold of Symmetric Positive Definite Matrices

Thibault de Surrel, Fabien Lotte, Sylvain Chevallier et al.

ICML 2025poster
3
citations

WritingBench: A Comprehensive Benchmark for Generative Writing

Yuning Wu, Jiahao Mei, Ming Yan et al.

NeurIPS 2025posterarXiv:2503.05244
41
citations

WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image

Yuci Liang, Xinheng Lyu, Meidan Ding et al.

ICCV 2025posterarXiv:2412.02141
10
citations

WST: Wavelet-Based Multi-scale Tuning for Visual Transfer Learning

Jia Zeng, Lan Huang, Kangping Wang

AAAI 2025paper

Wukong's 72 Transformations: High-fidelity Textured 3D Morphing via Flow Models

Minghao Yin, Yukang Cao, Kai Han

NeurIPS 2025posterarXiv:2511.22425
1
citations

WyckoffDiff -- A Generative Diffusion Model for Crystal Symmetry

Filip Ekström Kelvinius, Oskar Andersson, Abhijith Parackal et al.

ICML 2025poster

Wyckoff Transformer: Generation of Symmetric Crystals

Nikita Kazeev, Wei Nong, Ignat Romanov et al.

ICML 2025poster