Poster Papers

24,624 papers found • Page 77 of 493

Do Vision-Language Models Really Understand Visual Language?

Yifan Hou, Buse Giledereli, Yilei Tu et al.

ICML 2025arXiv:2410.00193
17
citations

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities

Zheyuan Zhang, Fengyuan Hu, Jayjun Lee et al.

ICLR 2025arXiv:2410.17385
41
citations

Do vision models perceive objects like toddlers ?

Arthur Aubret, Jochen Triesch

ICLR 2025

Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Akhil Perincherry, Jacob Krantz, Stefan Lee

CVPR 2025arXiv:2503.16394
9
citations

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild

Damien Teney, Liangze Jiang, Florin Gogianu et al.

CVPR 2025arXiv:2503.10065
7
citations

Do We Need to Verify Step by Step? Rethinking Process Supervision from a Theoretical Perspective

Zeyu Jia, Alexander Rakhlin, Tengyang Xie

ICML 2025arXiv:2502.10581
8
citations

Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?

Yanbo Wang, Jiyang Guan, Jian Liang et al.

CVPR 2025arXiv:2504.10000
7
citations

Do WGANs succeed because they minimize the Wasserstein Distance? Lessons from Discrete Generators

Ariel Elnekave, Yair Weiss

ICLR 2025

Do You Keep an Eye on What I Ask? Mitigating Multimodal Hallucination via Attention-Guided Ensemble Decoding

Yeongjae Cho, Keonwoo Kim, Taebaek Hwang et al.

ICLR 2025arXiv:2505.17529
7
citations

Do Your Best and Get Enough Rest for Continual Learning

Hankyul Kang, Gregor Seifer, Donghyun Lee et al.

CVPR 2025arXiv:2503.18371
2
citations

Do You Really Need Public Data? Surrogate Public Data for Differential Privacy on Tabular Data

Shlomi Hod, Lucas Rosenblatt, Julia Stoyanovich

NEURIPS 2025arXiv:2504.14368
2
citations

DP²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution

Rongyuan Wu, Lingchen Sun, Zhengqiang ZHANG et al.

NEURIPS 2025

DPA: A one-stop metric to measure bias amplification in classification datasets

Bhanu Tokas, Rahul Nair, Hannah Kerner

NEURIPS 2025arXiv:2412.11060
1
citations

DPaI: Differentiable Pruning at Initialization with Node-Path Balance Principle

Lichuan Xiang, Quan Nguyen-Tri, Lan-Cuong Nguyen et al.

ICLR 2025

DPAIL: Training Diffusion Policy for Adversarial Imitation Learning without Policy Optimization

Yunseon Choi, Minchan Jeong, Soobin Um et al.

NEURIPS 2025

DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models

Haoyang Li, Liang Wang, Chao Wang et al.

CVPR 2025arXiv:2503.13443
9
citations

DPCore: Dynamic Prompt Coreset for Continual Test-Time Adaptation

Yunbei Zhang, Akshay Mehra, Shuaicheng Niu et al.

ICML 2025arXiv:2406.10737
14
citations

DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework

Henrique Morimitsu, Xiaobin Zhu, Roberto M. Cesar Jr et al.

CVPR 2025arXiv:2503.14880
14
citations

DP-LLM: Runtime Model Adaptation with Dynamic Layer-wise Precision Assignment

Sangwoo Kwon, Seong Hoon Seo, Jae W. Lee et al.

NEURIPS 2025arXiv:2508.06041
1
citations

DPLM-2: A Multimodal Diffusion Protein Language Model

Xinyou Wang, Zaixiang Zheng, Fei YE et al.

ICLR 2025arXiv:2410.13782
53
citations

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Junzhe Lu, Jing Lin, Hongkun Dou et al.

ICCV 2025arXiv:2508.00599
2
citations

DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation

Ziyu Zhao, Xiaoguang Li, Lingjia Shi et al.

CVPR 2025arXiv:2505.11676
4
citations

DQVis Dataset: Natural Language to Biomedical Visualization

Devin Lange, Pengwei Sui, Shanghua Gao et al.

NEURIPS 2025

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Zhiyuan Liang, Dongwen Tang, Yuhao Zhou et al.

NEURIPS 2025arXiv:2506.16406
3
citations

DRAG: Data Reconstruction Attack using Guided Diffusion

Wa-Kin Lei, Jun-Cheng Chen, Shang-Tse Chen

ICML 2025arXiv:2509.11724
1
citations

Dragin3D: Image Editing by Dragging in 3D Space

Weiran Guang, Xiaoguang Gu, Mengqi Huang et al.

CVPR 2025

DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model

Siwei Xia, Li Sun, Tiantian Sun et al.

ICML 2025arXiv:2505.12427
5
citations

DragSolver: A Multi-Scale Transformer for Real-World Automotive Drag Coefficient Estimation

Ye Liu, Yuntian Chen

ICML 2025
2
citations

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Wenlong Wang, Ivana Dusparic, Yucheng Shi et al.

ICLR 2025arXiv:2410.08893
3
citations

DRaM-LHM: A Quaternion Framework for Iterative Camera Pose Estimation

Chen Lin, Weizhi Du, Zhixiang Min et al.

ICCV 2025

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Weifeng Lin, Xinyu Wei, Ruichuan An et al.

ICLR 2025arXiv:2403.20271
87
citations

DRAWER: Digital Reconstruction and Articulation With Environment Realism

Hongchi Xia, Entong Su, Marius Memmel et al.

CVPR 2025arXiv:2504.15278
15
citations

Drawing Developmental Trajectory from Cortical Surface Reconstruction

WENXUAN WU, ruowen qu, Zhongliang Liu et al.

ICCV 2025

Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models

Hyungjin Kim, Seokho Ahn, Young-Duk Seo

ICCV 2025arXiv:2508.03481
2
citations

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Yuxuan Luo, Zhengkun Rong, Lizhen Wang et al.

ICCV 2025arXiv:2504.01724
26
citations

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Yuang Peng, Yuxin Cui, Haomiao Tang et al.

ICLR 2025arXiv:2406.16855
95
citations

DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching

Emanuele Aiello, Umberto Michieli, Diego Valsesia et al.

CVPR 2025arXiv:2411.17786
4
citations

DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Jiwook Kim, Seonho Lee, Jaeyo Shin et al.

ICLR 2025arXiv:2407.11394
5
citations

DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization

Yukun Huang, Yanning Zhou, Jianan Wang et al.

ICCV 2025

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses

Yatian Pang, Bin Zhu, Bin Lin et al.

ICCV 2025arXiv:2412.00397
12
citations

DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation

Brian Nlong Zhao, Yuhang Xiao, Jiashu Xu et al.

ICLR 2025arXiv:2312.14216
9
citations

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Zhenglin Zhou, Xiaobo Xia, Fan Ma et al.

ICML 2025arXiv:2502.04370
20
citations

DREAM: Drafting with Refined Target Features and Entropy-Adaptive Cross-Attention Fusion for Multimodal Speculative Decoding

Yunhai Hu, Tianhua Xia, Zining Liu et al.

NEURIPS 2025arXiv:2505.19201
4
citations

DreamFuse: Adaptive Image Fusion with Diffusion Transformer

Junjia Huang, Pengxiang Yan, Jiyang Liu et al.

ICCV 2025arXiv:2504.08291
6
citations

DreamLight: Towards Harmonious and Consistent Image Relighting

Yong Liu, Wenpeng Xiao, Qianqian Wang et al.

NEURIPS 2025arXiv:2506.14549
1
citations

DreamOmni: Unified Image Generation and Editing

Bin Xia, Yuechen Zhang, Jingyao Li et al.

CVPR 2025arXiv:2412.17098
16
citations

DreamPRM: Domain-reweighted Process Reward Model for Multimodal Reasoning

Qi Cao, Ruiyi Wang, Ruiyi Zhang et al.

NEURIPS 2025arXiv:2505.20241
9
citations

DreamRelation: Bridging Customization and Relation Generation

Qingyu Shi, Lu Qi, Jianzong Wu et al.

CVPR 2025arXiv:2410.23280
10
citations

DreamRelation: Relation-Centric Video Customization

Yujie Wei, Shiwei Zhang, Hangjie Yuan et al.

ICCV 2025arXiv:2503.07602
17
citations

DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models

Dewei Zhou, Mingwei Li, Zongxin Yang et al.

ICCV 2025arXiv:2503.12885
18
citations