Most Cited ICCV "prior distribution learning" Papers

2,701 papers found • Page 8 of 14

#1401

PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation

Chikai Shang, Mengke Li, Yiqun Zhang et al.

ICCV 2025arXiv:2503.06901
1
citations
#1402

Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations

Dahee Kwon, Sehyun Lee, Jaesik Choi

ICCV 2025arXiv:2508.01728
1
citations
#1403

IM360: Large-scale Indoor Mapping with 360 Cameras

Dongki Jung, Jaehoon Choi, Yonghan Lee et al.

ICCV 2025arXiv:2502.12545
1
citations
#1404

DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Zhihang Yuan, Rui Xie, Yuzhang Shang et al.

ICCV 2025
1
citations
#1405

Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation

Jie Liu, Jiayi Shen, Pan Zhou et al.

ICCV 2025arXiv:2506.22979
1
citations
#1406

Training-Free Class Purification for Open-Vocabulary Semantic Segmentation

Qi Chen, Lingxiao Yang, Yun Chen et al.

ICCV 2025arXiv:2508.00557
1
citations
#1407

Revisiting Point Cloud Completion: Are We Ready For The Real-World?

Stuti Pathak, Prashant Kumar, Dheeraj Baiju et al.

ICCV 2025arXiv:2411.17580
1
citations
#1408

Progressive Artwork Outpainting via Latent Diffusion Models

Dae-Young Song, Jung-Jae Yu, Donghyeon Cho

ICCV 2025
1
citations
#1409

A Conditional Probability Framework for Compositional Zero-shot Learning

Peng Wu, Qiuxia Lai, Hao Fang et al.

ICCV 2025arXiv:2507.17377
1
citations
#1410

An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval

Jaeseok Byun, Seokhyeon Jeong, Wonjae Kim et al.

ICCV 2025arXiv:2406.09188
1
citations
#1411

Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data

Hang Phung, Manh Nguyen, Thanh Huynh et al.

ICCV 2025
1
citations
#1412

Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning

Wenjin Mo, Zhiyuan Li, Minghong Fang et al.

ICCV 2025arXiv:2507.00423
1
citations
#1413

A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks

Hang Su, Yunlong Feng, Daniel Gehrig et al.

ICCV 2025highlightarXiv:2507.22733
1
citations
#1414

VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow

Ada Görgün, Bernt Schiele, Jonas Fischer

ICCV 2025arXiv:2503.22399
1
citations
#1415

Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown

Bowen Wang, Zhouqiang Jiang, Yasuaki Susumu et al.

ICCV 2025arXiv:2506.17589
1
citations
#1416

Causality-guided Prompt Learning for Vision-language Models via Visual Granulation

Mengyu Gao, Qiulei Dong

ICCV 2025arXiv:2509.03803
1
citations
#1417

Auxiliary Prompt Tuning of Vision-Language Models for Few-Shot Out-of-Distribution Detection

Wenjun Miao, Guansong Pang, Zihan Wang et al.

ICCV 2025
1
citations
#1418

Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation

Zixin Wang, Dong Gong, Sen Wang et al.

ICCV 2025arXiv:2410.14729
1
citations
#1419

Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations

Chongjie Si, Zhiyi Shi, Xuehui Wang et al.

ICCV 2025arXiv:2504.00851
1
citations
#1420

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Haoran Chen, Ping Wang, Zihan Zhou et al.

ICCV 2025arXiv:2503.07979
1
citations
#1421

Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels

Chenyu Mu, Yijun Qu, Jiexi Yan et al.

ICCV 2025
1
citations
#1422

On the Robustness Tradeoff in Fine-Tuning

Kunyang Li, Jean-Charles Noirot Ferrand, Ryan Sheatsley et al.

ICCV 2025arXiv:2503.14836
1
citations
#1423

Dataset Distillation as Data Compression: A Rate-Utility Perspective

Youneng Bao, Yiping Liu, Zhuo Chen et al.

ICCV 2025arXiv:2507.17221
1
citations
#1424

Divide-and-Conquer for Enhancing Unlabeled Learning, Stability, and Plasticity in Semi-supervised Continual Learning

Yue Duan, Taicai Chen, Lei Qi et al.

ICCV 2025arXiv:2508.05316
1
citations
#1425

HumorDB: Can AI understand graphical humor?

Vedaant V Jain, Gabriel Kreiman, Felipe Feitosa

ICCV 2025arXiv:2406.13564
1
citations
#1426

Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning.

Daniel DeAlcala, Aythami Morales, Julian Fierrez et al.

ICCV 2025arXiv:2509.07879
1
citations
#1427

One-Shot Knowledge Transfer for Scalable Person Re-Identification

Longhua Li, Lei Qi, Xin Geng

ICCV 2025arXiv:2511.06016
1
citations
#1428

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo, Miaomiao Cui, Liefeng Bo et al.

ICCV 2025arXiv:2507.22604
1
citations
#1429

Seal Your Backdoor with Variational Defense

Ivan Sabolic, Matej Grcic, Siniša Šegvić

ICCV 2025arXiv:2503.08829
1
citations
#1430

CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning

Marco P. Apolinario, Sakshi Choudhary, Kaushik Roy

ICCV 2025arXiv:2411.15235
1
citations
#1431

Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning

Tianjiao Jiang, Zhen Zhang, Yuhang Liu et al.

ICCV 2025arXiv:2508.03102
1
citations
#1432

BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning

Shengao Wang, Arjun Chandra, Aoming Liu et al.

ICCV 2025arXiv:2504.09426
1
citations
#1433

Improving Large Vision and Language Models by Learning from a Panel of Peers

Jefferson Hernandez, Jing Shi, Simon Jenni et al.

ICCV 2025arXiv:2509.01610
1
citations
#1434

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Cheng-Fu Yang, Da Yin, Wenbo Hu et al.

ICCV 2025arXiv:2411.18651
1
citations
#1435

Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection

Shizhen Zhao, Jiahui Liu, Xin Wen et al.

ICCV 2025arXiv:2510.10584
1
citations
#1436

Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning

Jieyi Tan, Chengwei Zhang, Bo Dang et al.

ICCV 2025arXiv:2503.11051
1
citations
#1437

Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability

Boyong He, Yuxiang Ji, Zhuoyue Tan et al.

ICCV 2025highlightarXiv:2506.21042
1
citations
#1438

ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers

Hanwen Cao, Haobo Lu, Xiaosen Wang et al.

ICCV 2025arXiv:2508.12384
1
citations
#1439

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Zitian Wang, Yue Liao, RONG KANG et al.

ICCV 2025arXiv:2503.20309
1
citations
#1440

Staining and Locking Computer Vision Models Without Retraining

Oliver Sutton, Qinghua Zhou, George Leete et al.

ICCV 2025arXiv:2507.22000
1
citations
#1441

The Inter-Intra Modal Measure: A Predictive Lens on Fine-Tuning Outcomes in Vision-Language Models

Laura Niss, Kevin Vogt-Lowell, Theodoros Tsiligkaridis

ICCV 2025arXiv:2407.15731
1
citations
#1442

PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data

CHANGHEE YANG, Hyeonseop Song, Seokhun Choi et al.

ICCV 2025arXiv:2503.13025
1
citations
#1443

Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling

Christopher Xie, Armen Avetisyan, Henry Howard-Jenkins et al.

ICCV 2025highlightarXiv:2503.11806
1
citations
#1444

AstroLoc: Robust Space to Ground Image Localizer

Gabriele Berton, Alex Stoken, Carlo Masone

ICCV 2025arXiv:2502.07003
1
citations
#1445

Toward Material-Agnostic System Identification from Videos

Yizhou Zhao, Haoyu Chen, Chunjiang Liu et al.

ICCV 2025arXiv:2508.01112
1
citations
#1446

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction

Runmin Zhang, Zhu Yu, Si-Yuan Cao et al.

ICCV 2025arXiv:2507.18331
1
citations
#1447

Robust Low-light Scene Restoration via Illumination Transition

Ze Li, Feng Zhang, Xiatian Zhu et al.

ICCV 2025arXiv:2507.03976
1
citations
#1448

Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation

CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.

ICCV 2025
1
citations
#1449

HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing

Junseong Shin, Seungwoo Chung, Yunjeong Yang et al.

ICCV 2025arXiv:2509.18190
1
citations
#1450

DAMap: Distance-aware MapNet for High Quality HD Map Construction

JINPENG DONG, Chen Li, Yutong Lin et al.

ICCV 2025arXiv:2510.22675
1
citations
#1451

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

Ahmed Abdelreheem, Filippo Aleotti, Jamie Watson et al.

ICCV 2025arXiv:2505.05288
1
citations
#1452

Understanding Flatness in Generative Models: Its Role and Benefits

Taehwan Lee, Kyeongkook Seo, Jaejun Yoo et al.

ICCV 2025arXiv:2503.11078
1
citations
#1453

Princeton365: A Diverse Dataset with Accurate Camera Pose

Karhan Kayan, Stamatis Alexandropoulos, Rishabh Jain et al.

ICCV 2025arXiv:2506.09035
1
citations
#1454

Voyaging into Perpetual Dynamic Scenes from a Single View

Fengrui Tian, Tianjiao Ding, Jinqi Luo et al.

ICCV 2025arXiv:2507.04183
1
citations
#1455

Learning 3D Scene Analogies with Neural Contextual Scene Maps

Junho Kim, Gwangtak Bae, Eun Sun Lee et al.

ICCV 2025arXiv:2503.15897
1
citations
#1456

RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration

Chong Cheng, Yu Hu, Sicheng Yu et al.

ICCV 2025arXiv:2507.08136
1
citations
#1457

PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation

Xiaoyang Hao, Han Li

ICCV 2025arXiv:2508.17239
1
citations
#1458

Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization

Qingwang Zhang, Yingying Zhu

ICCV 2025
1
citations
#1459

MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild

Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Mayur Patel et al.

ICCV 2025arXiv:2412.13393
1
citations
#1460

Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision

Tianma Shen, Aditya Shrish Puranik, James Vong et al.

ICCV 2025arXiv:2503.06089
1
citations
#1461

HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery

Yu Wang, Bo Dang, Wanchun Li et al.

ICCV 2025arXiv:2507.16251
1
citations
#1462

DialNav: Multi-turn Dialog Navigation with a Remote Guide

Leekyeung Han, Hyunji Min, Gyeom Hwangbo et al.

ICCV 2025arXiv:2509.12894
1
citations
#1463

Online Dense Point Tracking with Streaming Memory

Qiaole Dong, Yanwei Fu

ICCV 2025arXiv:2503.06471
1
citations
#1464

Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection

Hyewon Park, Hyejin Park, Jueun Ko et al.

ICCV 2025arXiv:2409.08566
1
citations
#1465

Learning on the Go: A Meta-learning Object Navigation Model

Xiaorong Qin, Xinhang Song, Sixian Zhang et al.

ICCV 2025
1
citations
#1466

ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users

Xiangyu Yin, Boyuan Yang, Weichen Liu et al.

ICCV 2025highlightarXiv:2507.10223
1
citations
#1467

After the Party: Navigating the Mapping From Color to Ambient Lighting

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ICCV 2025arXiv:2508.02168
1
citations
#1468

3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation

Jianzhe Gao, Rui Liu, Wenguan Wang

ICCV 2025
1
citations
#1469

DyGS-SLAM: Real-Time Accurate Localization and Gaussian Reconstruction for Dynamic Scenes

Xinggang Hu, Chenyangguang Zhang, Mingyuan Zhao et al.

ICCV 2025
1
citations
#1470

EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision

Dmitrii Torbunov, Yihui Ren, Animesh Ghose et al.

ICCV 2025arXiv:2412.02890
1
citations
#1471

A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition

Connor Malone, Somayeh Hussaini, Tobias Fischer et al.

ICCV 2025arXiv:2412.06153
1
citations
#1472

MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps

Jiahui Lei, Kyle Genova, George Kopanas et al.

ICCV 2025arXiv:2510.11107
1
citations
#1473

Expressive Talking Human from Single-Image with Imperfect Priors

Jun Xiang, Yudong Guo, Leipeng Hu et al.

ICCV 2025
1
citations
#1474

Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition

Zefeng Qian, Xincheng Yao, Yifei Huang et al.

ICCV 2025arXiv:2507.16287
1
citations
#1475

Reverse Convolution and Its Applications to Image Restoration

Xuhong Huang, Shiqi Liu, Kai Zhang et al.

ICCV 2025arXiv:2508.09824
1
citations
#1476

MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence

Liyuan Deng, Yunpeng Bai, Yongkang Dai et al.

ICCV 2025arXiv:2511.17647
1
citations
#1477

Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars

Vanessa Sklyarova, Egor Zakharov, Malte Prinzler et al.

ICCV 2025arXiv:2509.01469
1
citations
#1478

AFUNet: Cross-Iterative Alignment-Fusion Synergy for HDR Reconstruction via Deep Unfolding Paradigm

Xinyue Li, Zhangkai Ni, Wenhan Yang

ICCV 2025arXiv:2506.23537
1
citations
#1479

PINO: Person-Interaction Noise Optimization for Long-Duration and Customizable Motion Generation of Arbitrary-Sized Groups

Sakuya Ota, Qing Yu, Kent Fujiwara et al.

ICCV 2025arXiv:2507.19292
1
citations
#1480

TeRA: Rethinking Text-guided Realistic 3D Avatar Generation

Yanwen Wang, Yiyu Zhuang, Jiawei Zhang et al.

ICCV 2025arXiv:2509.02466
1
citations
#1481

Vulnerability-Aware Spatio-Temporal Learning for Generalizable Deepfake Video Detection

Dat NGUYEN, Marcella Astrid, Anis Kacem et al.

ICCV 2025arXiv:2501.01184
1
citations
#1482

Latent Swap Joint Diffusion for 2D Long-Form Latent Generation

Yusheng Dai, Chenxi Wang, Chang Li et al.

ICCV 2025arXiv:2502.05130
1
citations
#1483

Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification

Zhiqi Pang, Chunyu Wang, Lingling Zhao et al.

ICCV 2025
1
citations
#1484

Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion

Xingyu Hu, Junjun Jiang, Chenyang Wang et al.

ICCV 2025arXiv:2504.05164
1
citations
#1485

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution

Yong Liu, Hang Dong, Jinshan Pan et al.

ICCV 2025arXiv:2405.17158
1
citations
#1486

PrimHOI: Compositional Human-Object Interaction via Reusable Primitives

Kai Jia, Tengyu Liu, Mingtao Pei et al.

ICCV 2025
1
citations
#1487

Event-Driven Storytelling with Multiple Lifelike Humans in a 3D Scene

Donggeun Lim, Jinseok Bae, Inwoo Hwang et al.

ICCV 2025arXiv:2507.19232
1
citations
#1488

Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image

Shuang Xu, Zixiang Zhao, Haowen Bai et al.

ICCV 2025arXiv:2412.04201
1
citations
#1489

Skeleton Motion Words for Unsupervised Skeleton-based Temporal Action Segmentation

Uzay Gökay, Federico Spurio, Dominik Bach et al.

ICCV 2025arXiv:2508.04513
1
citations
#1490

Towards Efficient General Feature Prediction in Masked Skeleton Modeling

Shengkai Sun, Zefan Zhang, Jianfeng Dong et al.

ICCV 2025arXiv:2509.03609
1
citations
#1491

How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes

Mahnoor Saad, Ziad Al-Halah

ICCV 2025arXiv:2508.02905
1
citations
#1492

Occlusion-robust Stylization for Drawing-based 3D Animation

Sunjae Yoon, Gwanhyeong Koo, Younghwan Lee et al.

ICCV 2025arXiv:2508.00398
1
citations
#1493

DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

Yinqi Cai, Jichang Li, Zhaolun Li et al.

ICCV 2025arXiv:2510.25237
1
citations
#1494

GeoAvatar: Adaptive Geometrical Gaussian Splatting for 3D Head Avatar

SeungJun Moon, Hah Min Lew, Seungeun Lee et al.

ICCV 2025arXiv:2507.18155
1
citations
#1495

Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video

Xiao Li, Qi Chen, Xiulian Peng et al.

ICCV 2025arXiv:2509.08376
1
citations
#1496

Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images

Elena Buglakova, Anwai Archit, Edoardo D'Imprima et al.

ICCV 2025highlightarXiv:2503.19545
1
citations
#1497

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

Seongmin Park, Hyungmin Kim, Sangwoo kim et al.

ICCV 2025arXiv:2505.15304
1
citations
#1498

Group-wise Scaling and Orthogonal Decomposition for Domain-Invariant Feature Extraction in Face Anti-Spoofing

Seungjin Jung, Kanghee Lee, Yonghyun Jeong et al.

ICCV 2025arXiv:2507.04006
1
citations
#1499

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Zhaolun Li, Jichang Li, Yinqi Cai et al.

ICCV 2025arXiv:2512.14601
1
citations
#1500

StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors

Xiaokun Sun, Zeyu Cai, Ying Tai et al.

ICCV 2025arXiv:2412.11586
1
citations
#1501

T2Bs: Text-to-Character Blendshapes via Video Generation

Jiahao Luo, Chaoyang Wang, Michael Vasilkovsky et al.

ICCV 2025arXiv:2509.10678
1
citations
#1502

DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation

Haitao Tian

ICCV 2025arXiv:2509.05543
1
citations
#1503

IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution

Sejin Park, Sangmin Lee, Kyong Hwan Jin et al.

ICCV 2025arXiv:2507.09923
1
citations
#1504

NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration

Haotian Dong, Xin WANG, Di Lin et al.

ICCV 2025arXiv:2504.18448
1
citations
#1505

FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling

Jingting Li, Yu Qian, Lin Zhao et al.

ICCV 2025arXiv:2507.20557
1
citations
#1506

PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks

Clinton A Mo, Kun Hu, Chengjiang Long et al.

ICCV 2025arXiv:2507.20170
1
citations
#1507

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

Shengdong Han, Shangdong Yang, Yuxuan Li et al.

ICCV 2025arXiv:2505.19148
1
citations
#1508

ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy

Haejun Han, Hang Lu

ICCV 2025
1
citations
#1509

VSRM: A Robust Mamba-Based Framework for Video Super-Resolution

Phu Tran Dinh, Hung Dao, Daeyoung Kim

ICCV 2025arXiv:2506.22762
1
citations
#1510

AnimalClue: Recognizing Animals by their Traces

Risa Shinoda, Nakamasa Inoue, Iro Laina et al.

ICCV 2025highlightarXiv:2507.20240
1
citations
#1511

SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models

Pingchuan Ma, Xiaopei Yang, Ming Gui et al.

ICCV 2025arXiv:2508.03402
1
citations
#1512

ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai, liqiang liqiang, Kaicheng Yang et al.

ICCV 2025arXiv:2507.19804
1
citations
#1513

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Yi Liu, Shengqian Li, Zuzeng Lin et al.

ICCV 2025arXiv:2506.23347
1
citations
#1514

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.

ICCV 2025arXiv:2503.17539
1
citations
#1515

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025arXiv:2507.10340
1
citations
#1516

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

ICCV 2025arXiv:2507.05256
1
citations
#1517

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025arXiv:2509.26231
1
citations
#1518

Outlier-Aware Post-Training Quantization for Image Super-Resolution

Hailing Wang, Jianglin Lu, Yitian Zhang et al.

ICCV 2025highlightarXiv:2511.00682
1
citations
#1519

Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction

Giuseppe Cartella, Vittorio Cuculo, Alessandro D'Amelio et al.

ICCV 2025arXiv:2507.23021
1
citations
#1520

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

Haoxuan Li, Ziya Erkoç, Lei Li et al.

ICCV 2025arXiv:2503.01425
1
citations
#1521

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Kwanyoung Kim, Byeongsu Sim

ICCV 2025arXiv:2503.07677
1
citations
#1522

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

Yanran Zhang, Bingyao Yu, Yu Zheng et al.

ICCV 2025
1
citations
#1523

TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models

Christian Simon, Masato Ishii, Akio Hayakawa et al.

ICCV 2025arXiv:2508.00289
1
citations
#1524

CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation

Zixin Zhu, Kevin Duarte, Mamshad Nayeem Rizve et al.

ICCV 2025arXiv:2509.01028
1
citations
#1525

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Yuxuan Wang, Tianwei Cao, Huayu Zhang et al.

ICCV 2025arXiv:2507.02714
1
citations
#1526

HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation

Lingxiao Li, Kaixuan Fan, Boqing Gong et al.

ICCV 2025arXiv:2411.17784
1
citations
#1527

V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models

Jisoo Kim, Wooseok Seo, Junwan Kim et al.

ICCV 2025arXiv:2508.03254
1
citations
#1528

Streamlining Image Editing with Layered Diffusion Brushes

Peyman Gholami, Robert Xiao

ICCV 2025arXiv:2405.00313
1
citations
#1529

GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection

Wenxue Li, Tian Ye, Xinyu Xiong et al.

ICCV 2025
1
citations
#1530

DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images

Kazuma Nagata, Naoshi Kaneko

ICCV 2025arXiv:2509.14685
1
citations
#1531

SpecGuard: Spectral Projection-based Advanced Invisible Watermarking

Inzamamul Alam, Md Islam, Simon Woo et al.

ICCV 2025arXiv:2510.07302
1
citations
#1532

Preserve Anything: Controllable Image Synthesis with Object Preservation

Prasen Kumar Sharma, Neeraj Matiyali, Siddharth Srivastava et al.

ICCV 2025arXiv:2506.22531
1
citations
#1533

CompleteMe: Reference-based Human Image Completion

Yu-Ju Tsai, Brian Price, Qing Liu et al.

ICCV 2025arXiv:2504.20042
1
citations
#1534

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

Yitian Zhang, Long Mai, Aniruddha Mahapatra et al.

ICCV 2025arXiv:2503.08665
1
citations
#1535

From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition

Ling Lo, Kelvin Chan, Wen-Huang Cheng et al.

ICCV 2025arXiv:2509.19690
1
citations
#1536

Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On

Delong Zhang, Qiwei Huang, Yang Sun et al.

ICCV 2025
1
citations
#1537

Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing

Joowon Kim, Ziseok Lee, Donghyeon Cho et al.

ICCV 2025arXiv:2504.13490
1
citations
#1538

Context Guided Transformer Entropy Modeling for Video Compression

Junlong Tong, Wei Zhang, Yaohui Jin et al.

ICCV 2025arXiv:2508.01852
1
citations
#1539

UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint

Enis Simsar, Alessio Tonioni, Yongqin Xian et al.

ICCV 2025arXiv:2412.15216
1
citations
#1540

Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting

Yian Zhao, rushi ye, Ruochong Zheng et al.

ICCV 2025
1
citations
#1541

Blended Point Cloud Diffusion for Localized Text-guided Shape Editing

Etai Sella, Noam Atia, Ron Mokady et al.

ICCV 2025highlightarXiv:2507.15399
1
citations
#1542

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

Rongkun Xue, Jinouwen Zhang, Yazhe Niu et al.

ICCV 2025arXiv:2412.01787
1
citations
#1543

Attention to Neural Plagiarism: Diffusion Models Can Plagiarize Your Copyrighted Images!

zihang zou, Boqing Gong, Liqiang Wang

ICCV 2025
1
citations
#1544

HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos

Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta

ICCV 2025arXiv:2505.12911
1
citations
#1545

DiSCO-3D : Discovering and Segmenting Sub-Concepts from Open-vocabulary Queries in NeRF

Doriand Petit, Steve Bourgeois, Vincent Gay-Bellile et al.

ICCV 2025arXiv:2507.14596
1
citations
#1546

M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast

Jiacheng Lu, Hui Ding, Shiyu Zhang et al.

ICCV 2025arXiv:2507.20582
1
citations
#1547

Moment Quantization for Video Temporal Grounding

Xiaolong Sun, Le Wang, Sanping Zhou et al.

ICCV 2025arXiv:2504.02286
1
citations
#1548

S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM

Heeji Yoon, Heeseong Shin, Eunbeen Hong et al.

ICCV 2025
1
citations
#1549

Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval

Zhichuan Wang, Yang Zhou, Zhe Liu et al.

ICCV 2025arXiv:2507.21489
1
citations
#1550

Enhancing Zero-shot Object Counting via Text-guided Local Ranking and Number-evoked Global Attention

Shiwei Zhang, Qi Zhou, Wei Ke

ICCV 2025
1
citations
#1551

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

Wenxuan Zhu, Bing Li, Cheng Zheng et al.

ICCV 2025arXiv:2503.17827
1
citations
#1552

Referring Expression Comprehension for Small Objects

Kanoko Goto, Takumi Hirose, Mahiro Ukai et al.

ICCV 2025arXiv:2510.03701
1
citations
#1553

Text-guided Visual Prompt DINO for Generic Segmentation

Yuchen Guan, Chong Sun, Canmiao Fu et al.

ICCV 2025arXiv:2508.06146
1
citations
#1554

FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation

Yasser Benigmim, Mohammad Fahes, Tuan-Hung Vu et al.

ICCV 2025arXiv:2504.10487
1
citations
#1555

Sparse-Dense Side-Tuner for efficient Video Temporal Grounding

David Pujol-Perich, Sergio Escalera, Albert Clapés

ICCV 2025arXiv:2507.07744
1
citations
#1556

Learning Yourself: Class-Incremental Semantic Segmentation with Language-Inspired Bootstrapped Disentanglement

Ruitao Wu, Yifan Zhao, Jia Li

ICCV 2025arXiv:2509.00527
1
citations
#1557

Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text matching

Yang Liu, Wentao Feng, Zhuoyao Liu et al.

ICCV 2025arXiv:2503.14953
1
citations
#1558

PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction

Manahil Raza, Ayesha Azam, Talha Qaiser et al.

ICCV 2025arXiv:2509.20022
1
citations
#1559

Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts

Yanguang Sun, Jiawei Lian, jian Yang et al.

ICCV 2025arXiv:2510.21114
1
citations
#1560

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment

Shi-Chen Zhang, Yunheng Li, Yu-Huan Wu et al.

ICCV 2025arXiv:2508.08811
1
citations
#1561

Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation

Dong Zhao, Qi Zang, Shuang Wang et al.

ICCV 2025
1
citations
#1562

Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference

KUO WANG, Quanlong Zheng, Junlin Xie et al.

ICCV 2025arXiv:2508.02134
1
citations
#1563

Towards Fine-grained Interactive Segmentation in Images and Videos

Yuan Yao, Qiushi Yang, Miaomiao Cui et al.

ICCV 2025arXiv:2502.09660
1
citations
#1564

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

Pooyan Rahmanzadehgervi, Hung Nguyen, Rosanne Liu et al.

ICCV 2025arXiv:2412.18675
1
citations
#1565

Aligning Effective Tokens with Video Anomaly in Large Language Models

YINGXIAN Chen, Jiahui Liu, Ruidi Fan et al.

ICCV 2025arXiv:2508.06350
1
citations
#1566

No More Sibling Rivalry: Debiasing Human-Object Interaction Detection

Bin Yang, Yulin Zhang, Hong-Yu Zhou et al.

ICCV 2025arXiv:2509.00760
1
citations
#1567

Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation

Rongpei Hong, Jian Lang, Ting Zhong et al.

ICCV 2025
1
citations
#1568

Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation

Zhixiang Chi, Yanan Wu, Li Gu et al.

ICCV 2025arXiv:2508.20265
1
citations
#1569

Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories

Yicong Li, Yiyang Chen, Zhenyuan Ma et al.

ICCV 2025
1
citations
#1570

How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?

Yujian Lee, Peng Gao, Yongqi Xu et al.

ICCV 2025arXiv:2601.08133
1
citations
#1571

Scheduling Weight Transitions for Quantization-Aware Training

Junghyup Lee, Jeimin Jeon, Dohyung Kim et al.

ICCV 2025arXiv:2404.19248
1
citations
#1572

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

Ke Zhang, Yi Huang, Wei Liu et al.

ICCV 2025arXiv:2504.07827
1
citations
#1573

Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation

Peng Ren, Tian Bai, Jing Sun et al.

ICCV 2025
1
citations
#1574

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding

Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.

ICCV 2025arXiv:2507.15569
1
citations
#1575

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

Shuaiting Li, Juncan Deng, Chengxuan Wang et al.

ICCV 2025arXiv:2503.08668
1
citations
#1576

VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization

Xinye Cao, Hongcan Guo, Jiawen Qian et al.

ICCV 2025arXiv:2510.06040
1
citations
#1577

LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Xinyu Yan, Meijun Sun, Ge-Peng Ji et al.

ICCV 2025arXiv:2508.01152
1
citations
#1578

ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology

Vishwesh Ramanathan, Tony Xu, Pushpak Pati et al.

ICCV 2025arXiv:2503.17564
1
citations
#1579

Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration

Ting Lei, Shaofeng Yin, Qingchao Chen et al.

ICCV 2025arXiv:2508.03207
1
citations
#1580

LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance

Zhang Li, Biao Yang, Qiang Liu et al.

ICCV 2025arXiv:2507.06272
1
citations
#1581

Synchronizing Task Behavior: Aligning Multiple Tasks during Test-Time Training

Wooseong Jeong, Jegyeong Cho, Youngho Yoon et al.

ICCV 2025arXiv:2507.07778
1
citations
#1582

Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation

Maximilian Ulmer, Wout Boerdijk, Rudolph Triebel et al.

ICCV 2025arXiv:2508.04122
1
citations
#1583

HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration

Xiyu Zhang, Jiayi Ma, Jianwei Guo et al.

ICCV 2025arXiv:2503.02195
1
citations
#1584

All in One: Visual-Description-Guided Unified Point Cloud Segmentation

Zongyan Han, Mohamed El Amine Boudjoghra, Jiahua Dong et al.

ICCV 2025arXiv:2507.05211
1
citations
#1585

Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis

Inseung Hwang, Kiseok Choi, Hyunho Ha et al.

ICCV 2025arXiv:2503.18705
1
citations
#1586

RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters

Xiaolin Liu, Tianyi zhou, Hongbo Kang et al.

ICCV 2025highlightarXiv:2507.20117
1
citations
#1587

Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors

Shida Sun, Yue Li, Yueyi Zhang et al.

ICCV 2025arXiv:2409.14011
1
citations
#1588

AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes

Tianyi Xu, Fan Zhang, Boxin Shi et al.

ICCV 2025arXiv:2508.13503
1
citations
#1589

Benchmarking Egocentric Visual-Inertial SLAM at City Scale

Anusha Krishnan, Shaohui Liu, Paul-Edouard Sarlin et al.

ICCV 2025highlightarXiv:2509.26639
1
citations
#1590

A Real-world Display Inverse Rendering Dataset

Seokjun Choi, Hoon-Gyu Chung, Yujin Jeon et al.

ICCV 2025arXiv:2508.14411
1
citations
#1591

AutoScape: Geometry-Consistent Long-Horizon Scene Generation

Jiacheng Chen, Ziyu Jiang, Mingfu Liang et al.

ICCV 2025arXiv:2510.20726
1
citations
#1592

RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors

Sicong Du, Jiarun Liu, Qifeng Chen et al.

ICCV 2025arXiv:2506.22800
1
citations
#1593

Scene Coordinate Reconstruction Priors

Wenjing Bian, Axel Barroso-Laguna, Tommaso Cavallari et al.

ICCV 2025arXiv:2510.12387
1
citations
#1594

TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation

Changsong Lei, Yaqian Liang, Shaofeng Wang et al.

ICCV 2025arXiv:2507.04685
1
citations
#1595

Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge

Linshen Liu, Boyan Su, Junyue Jiang et al.

ICCV 2025arXiv:2507.04123
1
citations
#1596

ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling

Radu Beche, Sergiu Nedevschi

ICCV 2025arXiv:2503.17856
1
citations
#1597

Discontinuity-aware Normal Integration for Generic Central Camera Models

Francesco Milano, Manuel Lopez-Antequera, Naina Dhingra et al.

ICCV 2025highlightarXiv:2507.06075
1
citations
#1598

SL2A-INR: Single-Layer Learnable Activation for Implicit Neural Representation

Reza Rezaeian, Moein Heidari, Reza Azad et al.

ICCV 2025
1
citations
#1599

DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection

Yuval Haitman, Oded Bialer

ICCV 2025arXiv:2508.12330
1
citations
#1600

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

Karlo Koledic, Luka Petrovic, Ivan Marković et al.

ICCV 2025arXiv:2412.06080
1
citations