Most Cited ICCV "adversarial parameter perturbation" Papers

2,701 papers found • Page 5 of 14

#801

Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Models

Vahid Balazadeh, Mohammadmehdi Ataei, Hyunmin Cheong et al.

ICCV 2025posterarXiv:2412.08619
2
citations
#802

Sim-DETR: Unlock DETR for Temporal Sentence Grounding

Jiajin Tang, Zhengxuan Wei, Yuchen Zhu et al.

ICCV 2025posterarXiv:2509.23867
2
citations
#803

DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing

Aniruddha Bala, Rohit Chowdhury, Rohan Jaiswal et al.

ICCV 2025highlightarXiv:2504.17894
2
citations
#804

Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks

Artem Nikonorov, Georgy Perevozchikov, Andrei Korepanov et al.

ICCV 2025posterarXiv:2503.11781
2
citations
#805

ReasonVQA: A Multi-hop Reasoning Benchmark with Structural Knowledge for Visual Question Answering

Duong T. Tran, Trung-Kien Tran, Manfred Hauswirth et al.

ICCV 2025posterarXiv:2507.16403
2
citations
#806

SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications

Yana Hasson, Pauline Luc, Liliane Momeni et al.

ICCV 2025posterarXiv:2507.03578
2
citations
#807

TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition

Xingsong Ye, Yongkun Du, Yunbo Tao et al.

ICCV 2025posterarXiv:2412.01137
2
citations
#808

CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation

Jianyu Wu, Yizhou Wang, Xiangyu Yue et al.

ICCV 2025posterarXiv:2504.20830
2
citations
#809

VideoAds for Fast-Paced Video Understanding

Zheyuan Zhang, Wanying Dou, Linkai Peng et al.

ICCV 2025posterarXiv:2504.09282
2
citations
#810

ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis

Onkar Susladkar, Gayatri Deshmukh, Yalcin Tur et al.

ICCV 2025posterarXiv:2505.04963
2
citations
#811

Disentangling Instance and Scene Contexts for 3D Semantic Scene Completion

Enyu Liu, En Yu, Sijia Chen et al.

ICCV 2025posterarXiv:2507.08555
2
citations
#812

GT-Loc: Unifying When and Where in Images through a Joint Embedding Space

David G. Shatwell, Ishan Rajendrakumar Dave, Swetha Sirnam et al.

ICCV 2025posterarXiv:2507.10473
2
citations
#813

Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation

Yong Liu, Song-Li Wu, Sule Bai et al.

ICCV 2025posterarXiv:2506.16058
2
citations
#814

ProbRes: Probabilistic Jump Diffusion for Open-World Egocentric Activity Recognition

Sanjoy Kundu, Shanmukha Vellamcheti, Sathyanarayanan Aakur

ICCV 2025posterarXiv:2504.03948
2
citations
#815

Hybrid-grained Feature Aggregation with Coare-to-fine Language Guidance for Self-supervised Monocular Depth Estimation

Wenyao Zhang, Hongsi Liu, Bohan Li et al.

ICCV 2025poster
2
citations
#816

PASTA: Part-Aware Sketch-to-3D Shape Generation with Text-Aligned Prior

Seunggwan Lee, Hwanhee Jung, ByoungSoo Koh et al.

ICCV 2025posterarXiv:2503.12834
2
citations
#817

Trust but Verify: Programmatic VLM Evaluation in the Wild

Viraj Prabhu, Senthil Purushwalkam, An Yan et al.

ICCV 2025posterarXiv:2410.13121
2
citations
#818

AnyPortal: Zero-Shot Consistent Video Background Replacement

Wenshuo Gao, Xicheng Lan, Shuai Yang

ICCV 2025posterarXiv:2509.07472
2
citations
#819

Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts

Viet Nguyen, Anh Nguyen, Trung Dao et al.

ICCV 2025posterarXiv:2412.02687
2
citations
#820

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

Tao Han, Wanghan Xu, Junchao Gong et al.

ICCV 2025posterarXiv:2509.10441
2
citations
#821

Differentiable Room Acoustic Rendering with Multi-View Vision Priors

Derong Jin, Ruohan Gao

ICCV 2025posterarXiv:2504.21847
2
citations
#822

Identity-aware Language Gaussian Splatting for Open-vocabulary 3D Semantic Segmentation

SungMin Jang, Wonjun Kim

ICCV 2025poster
2
citations
#823

FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

Mainak Singha, Subhankar Roy, Sarthak Mehrotra et al.

ICCV 2025posterarXiv:2504.20860
2
citations
#824

DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup

Zhen Qu, Xian Tao, Xinyi Gong et al.

ICCV 2025posterarXiv:2508.13560
2
citations
#825

Denoising Token Prediction in Masked Autoregressive Models

Ting Yao, Yehao Li, Yingwei Pan et al.

ICCV 2025poster
2
citations
#826

ViLU: Learning Vision-Language Uncertainties for Failure Prediction

Marc Lafon, Yannis Karmim, Julio Silva-Rodríguez et al.

ICCV 2025posterarXiv:2507.07620
2
citations
#827

EVT: Efficient View Transformation for Multi-Modal 3D Object Detection

Yongjin Lee, Hyeon-Mun Jeong, Yurim Jeon et al.

ICCV 2025posterarXiv:2411.10715
2
citations
#828

Trade-offs in Image Generation: How Do Different Dimensions Interact?

Sicheng Zhang, Binzhu Xie, Zhonghao Yan et al.

ICCV 2025posterarXiv:2507.22100
2
citations
#829

SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting

Shengjie Lin, Jiading Fang, Muhammad Zubair Irshad et al.

ICCV 2025posterarXiv:2506.03594
2
citations
#830

DASH: 4D Hash Encoding with Self-Supervised Decomposition for Real-Time Dynamic Scene Rendering

Jie Chen, Zhangchi Hu, Peixi Wu et al.

ICCV 2025posterarXiv:2507.19141
2
citations
#831

RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions

Bimsara Pathiraja, Maitreya Patel, Shivam Singh et al.

ICCV 2025posterarXiv:2506.03448
2
citations
#832

LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders

Ilan Naiman, Emanuel Baruch Baruch, Oron Anschel et al.

ICCV 2025posterarXiv:2504.03501
2
citations
#833

MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion

Fei Peng, Junqiang Wu, Yan Li et al.

ICCV 2025posterarXiv:2508.14440
2
citations
#834

Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model

Chengxu Liu, Lu Qi, Jinshan Pan et al.

ICCV 2025posterarXiv:2507.13599
2
citations
#835

SDMatte: Grafting Diffusion Models for Interactive Matting

Longfei Huang, Yu Liang, Hao Zhang et al.

ICCV 2025posterarXiv:2508.00443
2
citations
#836

SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing

Heyi Sun, Cong Wang, Tian-Xing Xu et al.

ICCV 2025posterarXiv:2508.09597
2
citations
#837

Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation

HIroyasu Akada, Jian Wang, Vladislav Golyanik et al.

ICCV 2025posterarXiv:2503.11652
2
citations
#838

Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent

En Ci, Shanyan Guan, Yanhao Ge et al.

ICCV 2025poster
2
citations
#839

What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Lorenzo Baraldi, Davide Bucciarelli, Federico Betti et al.

ICCV 2025posterarXiv:2505.20405
2
citations
#840

Online Language Splatting

Saimouli Katragadda, Cho-Ying Wu, Yuliang Guo et al.

ICCV 2025posterarXiv:2503.09447
2
citations
#841

Consensus-Driven Active Model Selection

Justin Kay, Grant Horn, Subhransu Maji et al.

ICCV 2025highlightarXiv:2507.23771
2
citations
#842

StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance

Jaeseok Jeong, Junho Kim, Youngjung Uh et al.

ICCV 2025posterarXiv:2510.06827
2
citations
#843

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning

Zihua Zhao, Feng Hong, Mengxi Chen et al.

ICCV 2025posterarXiv:2507.12998
2
citations
#844

Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data

Zeyi Sun, Tong Wu, Pan Zhang et al.

ICCV 2025posterarXiv:2406.00093
2
citations
#845

Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection

Anja Delić, Matej Grcic, Siniša Šegvić

ICCV 2025highlightarXiv:2506.18368
2
citations
#846

Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection

Taehoon Kim, Jongwook Choi, Yonghyun Jeong et al.

ICCV 2025highlightarXiv:2507.02398
2
citations
#847

Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling

Chao Zhou, Tianyi Wei, Nenghai Yu

ICCV 2025posterarXiv:2507.16240
2
citations
#848

Demeter: A Parametric Model of Crop Plant Morphology from the Real World

Tianhang Cheng, Albert Zhai, Evan Chen et al.

ICCV 2025posterarXiv:2510.16377
2
citations
#849

Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection

Jiasheng Guo, Xin Gao, Yuxiang Yan et al.

ICCV 2025posterarXiv:2509.09183
2
citations
#850

A Unified Framework for Motion Reasoning and Generation in Human Interaction

Jeongeun Park, Sungjoon Choi, Sangdoo Yun

ICCV 2025posterarXiv:2410.05628
2
citations
#851

Multi-modal Multi-platform Person Re-Identification: Benchmark and Method

Ruiyang Ha, Songyi Jiang, Bin Li et al.

ICCV 2025posterarXiv:2503.17096
2
citations
#852

Stable Score Distillation

Haiming Zhu, Yangyang Xu, Chenshu Xu et al.

ICCV 2025posterarXiv:2507.09168
2
citations
#853

PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement

Tewodros W. Ayalew, Xiao Zhang, Kevin Y Wu et al.

ICCV 2025posterarXiv:2411.17764
2
citations
#854

Multimodal Prompt Alignment for Facial Expression Recognition

Fuyan Ma, Yiran He, Bin Sun et al.

ICCV 2025posterarXiv:2506.21017
2
citations
#855

AdsQA: Towards Advertisement Video Understanding

Xinwei Long, Kai Tian, Peng Xu et al.

ICCV 2025posterarXiv:2509.08621
2
citations
#856

IntroStyle: Training-Free Introspective Style Attribution using Diffusion Features

Anand Kumar, Jiteng Mu, Nuno Vasconcelos

ICCV 2025posterarXiv:2412.14432
2
citations
#857

AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model

Wenlun Zhang, Yunshan Zhong, Shimpei Ando et al.

ICCV 2025posterarXiv:2503.03088
2
citations
#858

Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation

Yihong Cao, Jiaming Zhang, Xu Zheng et al.

ICCV 2025posterarXiv:2506.21198
2
citations
#859

Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching

Tianli Liao, Chenyang Zhao, Lei Li et al.

ICCV 2025posterarXiv:2311.18564
2
citations
#860

Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling

LI XIAOJIE, Ronghui Li, Shukai Fang et al.

ICCV 2025posterarXiv:2507.14915
2
citations
#861

Global Regulation and Excitation via Attention Tuning for Stereo Matching

Jiahao LI, Xinhong Chen, Zhengmin JIANG et al.

ICCV 2025posterarXiv:2509.15891
2
citations
#862

Towards Explicit Exoskeleton for the Reconstruction of Complicated 3D Human Avatars

Yifan Zhan, Qingtian Zhu, Muyao Niu et al.

ICCV 2025posterarXiv:2410.08082
2
citations
#863

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Xavier Thomas, Deepti Ghadiyaram

ICCV 2025posterarXiv:2503.06698
2
citations
#864

GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion

Li-Heng Chen, Zi-Xin Zou, Chang Liu et al.

ICCV 2025posterarXiv:2503.22349
2
citations
#865

SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions

Jessica Bader, Leander Girrbach, Stephan Alaniz et al.

ICCV 2025posterarXiv:2507.23784
2
citations
#866

Noise2Score3D: Tweedie's Approach for Unsupervised Point Cloud Denoising

Xiangbin Wei, Yuanfeng Wang, Ao XU et al.

ICCV 2025posterarXiv:2503.09283
2
citations
#867

SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark

Alex Costanzino, Pierluigi Zama Ramirez, Luigi Lella et al.

ICCV 2025posterarXiv:2506.21549
2
citations
#868

ChartCap: Mitigating Hallucination of Dense Chart Captioning

Junyoung Lim, Jaewoo Ahn, Gunhee Kim

ICCV 2025highlightarXiv:2508.03164
2
citations
#869

FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning

qian feng, Jiahang Tu, Mintong Kang et al.

ICCV 2025posterarXiv:2601.13578
2
citations
#870

Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation

Sebastian Schmidt, Julius Koerner, Dominik Fuchsgruber et al.

ICCV 2025highlightarXiv:2504.04841
2
citations
#871

Multi-Modal Few-Shot Temporal Action Segmentation

Zijia Lu, Ehsan Elhamifar

ICCV 2025poster
2
citations
#872

Object-level Correlation for Few-Shot Segmentation

chunlin wen, Yu Zhang, Jie Fan et al.

ICCV 2025posterarXiv:2509.07917
2
citations
#873

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Kunlun Xu, Fan Zhuo, Jiangmeng Li et al.

ICCV 2025posterarXiv:2507.01884
2
citations
#874

DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image

Jijun Xiang, Xuan Zhu, Xianqi Wang et al.

ICCV 2025posterarXiv:2504.01596
2
citations
#875

Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration

Darshan Thaker, Abhishek Goyal, Rene Vidal

ICCV 2025posterarXiv:2411.15295
2
citations
#876

ETA: Energy-based Test-time Adaptation for Depth Completion

Younjoon Chung, Hyoungseob Park, Patrick Rim et al.

ICCV 2025posterarXiv:2508.05989
2
citations
#877

ODP-Bench: Benchmarking Out-of-Distribution Performance Prediction

Han Yu, Kehan Li, Dongbai Li et al.

ICCV 2025posterarXiv:2510.27263
2
citations
#878

Diffusion Image Prior

Hamadi Chihaoui, Paolo Favaro

ICCV 2025posterarXiv:2503.21410
2
citations
#879

RePoseD: Efficient Relative Pose Estimation With Known Depth Information

Yaqing Ding, Viktor Kocur, VACLAV VAVRA et al.

ICCV 2025posterarXiv:2501.07742
2
citations
#880

Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition

Jeonghyeok Do, Munchurl Kim

ICCV 2025posterarXiv:2411.10745
2
citations
#881

Synchronization of Multiple Videos

Avihai Naaman, Ron Shapira Weber, Oren Freifeld

ICCV 2025posterarXiv:2510.14051
2
citations
#882

A Structure-aware and Motion-adaptive Framework for 3D Human Pose Estimation with Mamba

Ye Lu, Jie Wang, Jianjun Gao et al.

ICCV 2025posterarXiv:2507.19852
2
citations
#883

MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments

Zhixuan Liu, Haokun Zhu, Rui Chen et al.

ICCV 2025posterarXiv:2503.13816
2
citations
#884

HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction for Large-Scale Aerial Scenes

Mai Su, Zhongtao Wang, Huishan Au et al.

ICCV 2025posterarXiv:2504.16606
1
citations
#885

SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation

Chen Yi Lu, Mehrab Tanjim, Ishita Dasgupta et al.

ICCV 2025posterarXiv:2503.08010
1
citations
#886

Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models

In Cho, Youngbeom Yoo, Subin Jeon et al.

ICCV 2025posterarXiv:2503.08737
1
citations
#887

Improving Large Vision and Language Models by Learning from a Panel of Peers

Jefferson Hernandez, Jing Shi, Simon Jenni et al.

ICCV 2025posterarXiv:2509.01610
1
citations
#888

Occlusion-robust Stylization for Drawing-based 3D Animation

Sunjae Yoon, Gwanhyeong Koo, Younghwan Lee et al.

ICCV 2025posterarXiv:2508.00398
1
citations
#889

BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning

Shengao Wang, Arjun Chandra, Aoming Liu et al.

ICCV 2025posterarXiv:2504.09426
1
citations
#890

S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM

Heeji Yoon, Heeseong Shin, Eunbeen Hong et al.

ICCV 2025poster
1
citations
#891

Blended Point Cloud Diffusion for Localized Text-guided Shape Editing

Etai Sella, Noam Atia, Ron Mokady et al.

ICCV 2025highlightarXiv:2507.15399
1
citations
#892

How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes

Mahnoor Saad, Ziad Al-Halah

ICCV 2025posterarXiv:2508.02905
1
citations
#893

Purge-Gate: Efficient Backpropagation-Free Test-Time Adaptation for Point Clouds via Token purging

Moslem Yazdanpanah, Ali Bahri, Mehrdad Noori et al.

ICCV 2025poster
1
citations
#894

UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint

Enis Simsar, Alessio Tonioni, Yongqin Xian et al.

ICCV 2025posterarXiv:2412.15216
1
citations
#895

StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions

Bo-Hsu Ke, You-Zhe Xie, Yu-Lun Liu et al.

ICCV 2025posterarXiv:2510.02314
1
citations
#896

ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers

Hanwen Cao, Haobo Lu, Xiaosen Wang et al.

ICCV 2025posterarXiv:2508.12384
1
citations
#897

Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability

Boyong He, Yuxiang Ji, Zhuoyue Tan et al.

ICCV 2025highlightarXiv:2506.21042
1
citations
#898

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Kwanyoung Kim, Byeongsu Sim

ICCV 2025posterarXiv:2503.07677
1
citations
#899

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

Yanran Zhang, Bingyao Yu, Yu Zheng et al.

ICCV 2025poster
1
citations
#900

Evading Data Provenance in Deep Neural Networks

Hongyu Zhu, Sichu Liang, Wenwen Wang et al.

ICCV 2025highlightarXiv:2508.01074
1
citations
#901

SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning

XIN Hu, Ke Qin, Guiduo Duan et al.

ICCV 2025posterarXiv:2507.05798
1
citations
#902

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

ICCV 2025posterarXiv:2507.05256
1
citations
#903

AnimalClue: Recognizing Animals by their Traces

Risa Shinoda, Nakamasa Inoue, Iro Laina et al.

ICCV 2025highlightarXiv:2507.20240
1
citations
#904

VSRM: A Robust Mamba-Based Framework for Video Super-Resolution

Phu Tran Dinh, Hung Dao, Daeyoung Kim

ICCV 2025posterarXiv:2506.22762
1
citations
#905

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025posterarXiv:2507.10340
1
citations
#906

Enhancing Image Restoration Transformer via Adaptive Translation Equivariance

JiaKui Hu, Zhengjian Yao, Lujia Jin et al.

ICCV 2025posterarXiv:2506.18520
1
citations
#907

Adversarial Exploitation of Data Diversity Improves Visual Localization

Sihang Li, Siqi Tan, Bowen Chang et al.

ICCV 2025posterarXiv:2412.00138
1
citations
#908

StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors

Xiaokun Sun, Zeyu Cai, Ying Tai et al.

ICCV 2025posterarXiv:2412.11586
1
citations
#909

ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy

Haejun Han, Hang Lu

ICCV 2025poster
1
citations
#910

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025posterarXiv:2509.26231
1
citations
#911

PrimHOI: Compositional Human-Object Interaction via Reusable Primitives

Kai Jia, Tengyu Liu, Mingtao Pei et al.

ICCV 2025poster
1
citations
#912

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

Seongmin Park, Hyungmin Kim, Sangwoo kim et al.

ICCV 2025posterarXiv:2505.15304
1
citations
#913

PINO: Person-Interaction Noise Optimization for Long-Duration and Customizable Motion Generation of Arbitrary-Sized Groups

Sakuya Ota, Qing Yu, Kent Fujiwara et al.

ICCV 2025posterarXiv:2507.19292
1
citations
#914

Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification

Zhiqi Pang, Chunyu Wang, Lingling Zhao et al.

ICCV 2025poster
1
citations
#915

PolarAnything: Diffusion-based Polarimetric Image Synthesis

Kailong Zhang, Youwei Lyu, Heng Guo et al.

ICCV 2025highlightarXiv:2507.17268
1
citations
#916

Auxiliary Prompt Tuning of Vision-Language Models for Few-Shot Out-of-Distribution Detection

Wenjun Miao, Guansong Pang, Zihan Wang et al.

ICCV 2025poster
1
citations
#917

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution

Yong Liu, Hang Dong, Jinshan Pan et al.

ICCV 2025posterarXiv:2405.17158
1
citations
#918

Efficient Spiking Point Mamba for Point Cloud Analysis

Peixi Wu, Bosong Chai, Menghua Zheng et al.

ICCV 2025posterarXiv:2504.14371
1
citations
#919

Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation

Jie Liu, Jiayi Shen, Pan Zhou et al.

ICCV 2025posterarXiv:2506.22979
1
citations
#920

Latent Swap Joint Diffusion for 2D Long-Form Latent Generation

Yusheng Dai, Chenxi Wang, Chang Li et al.

ICCV 2025posterarXiv:2502.05130
1
citations
#921

Discontinuity-aware Normal Integration for Generic Central Camera Models

Francesco Milano, Manuel Lopez-Antequera, Naina Dhingra et al.

ICCV 2025highlightarXiv:2507.06075
1
citations
#922

A Conditional Probability Framework for Compositional Zero-shot Learning

Peng Wu, Qiuxia Lai, Hao Fang et al.

ICCV 2025posterarXiv:2507.17377
1
citations
#923

After the Party: Navigating the Mapping From Color to Ambient Lighting

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ICCV 2025posterarXiv:2508.02168
1
citations
#924

Web Artifact Attacks Disrupt Vision Language Models

Maan Qraitem, Piotr Teterwak, Kate Saenko et al.

ICCV 2025posterarXiv:2503.13652
1
citations
#925

Revisiting Point Cloud Completion: Are We Ready For The Real-World?

Stuti Pathak, Prashant Kumar, Dheeraj Baiju et al.

ICCV 2025posterarXiv:2411.17580
1
citations
#926

Scene Coordinate Reconstruction Priors

Wenjing Bian, Axel Barroso-Laguna, Tommaso Cavallari et al.

ICCV 2025posterarXiv:2510.12387
1
citations
#927

AFUNet: Cross-Iterative Alignment-Fusion Synergy for HDR Reconstruction via Deep Unfolding Paradigm

Xinyue Li, Zhangkai Ni, Wenhan Yang

ICCV 2025posterarXiv:2506.23537
1
citations
#928

AutoScape: Geometry-Consistent Long-Horizon Scene Generation

Jiacheng Chen, Ziyu Jiang, Mingfu Liang et al.

ICCV 2025posterarXiv:2510.20726
1
citations
#929

Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation

CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.

ICCV 2025poster
1
citations
#930

DAMap: Distance-aware MapNet for High Quality HD Map Construction

JINPENG DONG, Chen Li, Yutong Lin et al.

ICCV 2025posterarXiv:2510.22675
1
citations
#931

Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning

Tianjiao Jiang, Zhen Zhang, Yuhang Liu et al.

ICCV 2025posterarXiv:2508.03102
1
citations
#932

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo, Miaomiao Cui, Liefeng Bo et al.

ICCV 2025posterarXiv:2507.22604
1
citations
#933

Fast Globally Optimal and Geometrically Consistent 3D Shape Matching

Paul Roetzer, Florian Bernard

ICCV 2025highlightarXiv:2504.06385
1
citations
#934

A Real-world Display Inverse Rendering Dataset

Seokjun Choi, Hoon-Gyu Chung, Yujin Jeon et al.

ICCV 2025posterarXiv:2508.14411
1
citations
#935

DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Zhihang Yuan, Rui Xie, Yuzhang Shang et al.

ICCV 2025poster
1
citations
#936

Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images

Jinsol Song, Jiamu Wang, Anh Nguyen et al.

ICCV 2025posterarXiv:2508.15256
1
citations
#937

VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow

Ada Görgün, Bernt Schiele, Jonas Fischer

ICCV 2025posterarXiv:2503.22399
1
citations
#938

Progressive Artwork Outpainting via Latent Diffusion Models

Dae-Young Song, Jung-Jae Yu, Donghyeon Cho

ICCV 2025poster
1
citations
#939

MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics

Bowei Guo, Shengkun Tang, Cong Zeng et al.

ICCV 2025posterarXiv:2510.11962
1
citations
#940

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Zhaolun Li, Jichang Li, Yinqi Cai et al.

ICCV 2025posterarXiv:2512.14601
1
citations
#941

Sparse-Dense Side-Tuner for efficient Video Temporal Grounding

David Pujol-Perich, Sergio Escalera, Albert Clapés

ICCV 2025posterarXiv:2507.07744
1
citations
#942

Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors

Shida Sun, Yue Li, Yueyi Zhang et al.

ICCV 2025posterarXiv:2409.14011
1
citations
#943

Benchmarking Burst Super-Resolution for Polarization Images: Noise Dataset and Analysis

Inseung Hwang, Kiseok Choi, Hyunho Ha et al.

ICCV 2025posterarXiv:2503.18705
1
citations
#944

TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes

Yan Xia, Yunxiang Lu, Rui Song et al.

ICCV 2025posterarXiv:2412.10308
1
citations
#945

All in One: Visual-Description-Guided Unified Point Cloud Segmentation

Zongyan Han, Mohamed El Amine Boudjoghra, Jiahua Dong et al.

ICCV 2025posterarXiv:2507.05211
1
citations
#946

EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision

Dmitrii Torbunov, Yihui Ren, Animesh Ghose et al.

ICCV 2025posterarXiv:2412.02890
1
citations
#947

Auto-Regressively Generating Multi-View Consistent Images

JiaKui Hu, Yuxiao Yang, Jialun Liu et al.

ICCV 2025posterarXiv:2506.18527
1
citations
#948

MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.

ICCV 2025posterarXiv:2506.08694
1
citations
#949

Stylized-Face: A Million-level Stylized Face Dataset for Face Recognition

Zhengyuan Peng, Jianqing Xu, Yuge Huang et al.

ICCV 2025poster
1
citations
#950

Correspondence-Free Fast and Robust Spherical Point Pattern Registration

Anik Sarker, Alan Asbeck

ICCV 2025posterarXiv:2508.02339
1
citations
#951

Aligning Moments in Time using Video Queries

Yogesh Kumar, Uday Agarwal, Manish Gupta et al.

ICCV 2025posterarXiv:2508.15439
1
citations
#952

Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations

Dahee Kwon, Sehyun Lee, Jaesik Choi

ICCV 2025posterarXiv:2508.01728
1
citations
#953

CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models

Junho Kim, Hyungjin Chung, Byung-Hoon Kim

ICCV 2025posterarXiv:2411.06869
1
citations
#954

CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning

Marco P. Apolinario, Sakshi Choudhary, Kaushik Roy

ICCV 2025posterarXiv:2411.15235
1
citations
#955

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

Liang Xu, Chengqun Yang, Zili Lin et al.

ICCV 2025posterarXiv:2508.04681
1
citations
#956

Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement

Junyu Lou, Xiaorui Zhao, Kexuan Shi et al.

ICCV 2025posterarXiv:2507.12135
1
citations
#957

MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment

Yachun Mi, Yu Li, Weicheng Meng et al.

ICCV 2025highlightarXiv:2504.16003
1
citations
#958

Visual Intention Grounding for Egocentric Assistants

Pengzhan Sun, Junbin Xiao, Tze Ho Elden Tse et al.

ICCV 2025posterarXiv:2504.13621
1
citations
#959

Scheduling Weight Transitions for Quantization-Aware Training

Junghyup Lee, Jeimin Jeon, Dohyung Kim et al.

ICCV 2025posterarXiv:2404.19248
1
citations
#960

Seal Your Backdoor with Variational Defense

Ivan Sabolic, Matej Grcic, Siniša Šegvić

ICCV 2025posterarXiv:2503.08829
1
citations
#961

A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions

Youliang Zhang, Ronghui Li, Yachao Zhang et al.

ICCV 2025highlightarXiv:2412.17377
1
citations
#962

MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion

Yikun Ma, Yiqing Li, Jiawei Wu et al.

ICCV 2025posterarXiv:2503.17695
1
citations
#963

FPEM: Face Prior Enhanced Facial Attractiveness Prediction for Live Videos with Face Retouching

Hui Li, Xiaoyu Ren, Hongjiu Yu et al.

ICCV 2025highlight
1
citations
#964

MCOP: Multi-UAV Collaborative Occupancy Prediction

Zefu Lin, Wenbo Chen, Xiaojuan Jin et al.

ICCV 2025posterarXiv:2510.12679
1
citations
#965

PoseAnchor: Robust Root Position Estimation for 3D Human Pose Estimation

Jun-Hee Kim, Jumin Han, Seong-Whan Lee

ICCV 2025poster
1
citations
#966

DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference

Jiajun Luo, Lizhuo Luo, Jianru Xu et al.

ICCV 2025poster
1
citations
#967

Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction

Dat Cong, Hieu Tran, Hoang Thanh-Tung

ICCV 2025posterarXiv:2508.19581
1
citations
#968

Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack

Xingshuo Han, Xuanye Zhang, Xiang Lan et al.

ICCV 2025posterarXiv:2411.16167
1
citations
#969

TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking

Mengmeng Wang, Haonan Wang, Yulong Li et al.

ICCV 2025posterarXiv:2507.19908
1
citations
#970

MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction

Yusuke Yoshiyasu, Leyuan Sun, Ryusuke Sagawa

ICCV 2025posterarXiv:2507.15212
1
citations
#971

Learning Robust Image Watermarking with Lossless Cover Recovery

jiale chen, Wei Wang, Chongyang Shi et al.

ICCV 2025poster
1
citations
#972

Multi-View 3D Point Tracking

Frano Rajič, Haofei Xu, Marko Mihajlovic et al.

ICCV 2025posterarXiv:2508.21060
1
citations
#973

Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation

Yunze Tong, Fengda Zhang, Didi Zhu et al.

ICCV 2025poster
1
citations
#974

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Fatemeh Saleh, Sadegh Aliakbarian, Charlie Hewitt et al.

ICCV 2025posterarXiv:2507.15365
1
citations
#975

Enhancing Numerical Prediction of MLLMs with Soft Labeling

Pei Wang, Zhaowei Cai, Hao Yang et al.

ICCV 2025poster
1
citations
#976

ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users

Xiangyu Yin, Boyuan Yang, Weichen Liu et al.

ICCV 2025highlightarXiv:2507.10223
1
citations
#977

Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation

Zhixiang Chi, Yanan Wu, Li Gu et al.

ICCV 2025posterarXiv:2508.20265
1
citations
#978

ARMO: Autoregressive Rigging for Multi-Category Objects

mingze sun, Shiwei Mao, Keyi Chen et al.

ICCV 2025posterarXiv:2503.20663
1
citations
#979

FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection

Xinhua Lu, Runhe Lai, Yanqi Wu et al.

ICCV 2025posterarXiv:2507.04511
1
citations
#980

Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation

Siyu Chen, Ting Han, Changshe Zhang et al.

ICCV 2025posterarXiv:2504.12753
1
citations
#981

HumorDB: Can AI understand graphical humor?

Vedaant V Jain, Gabriel Kreiman, Felipe Feitosa

ICCV 2025posterarXiv:2406.13564
1
citations
#982

Membership Inference Attacks with False Discovery Rate Control

Chenxu Zhao, Wei Qian, Aobo Chen et al.

ICCV 2025posterarXiv:2508.07066
1
citations
#983

Streaming VideoLLMs for Real-Time Procedural Video Understanding

Dibyadip Chatterjee, Edoardo Remelli, Yale Song et al.

ICCV 2025posterarXiv:2504.13915
1
citations
#984

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

Pooyan Rahmanzadehgervi, Hung Nguyen, Rosanne Liu et al.

ICCV 2025posterarXiv:2412.18675
1
citations
#985

OmniVTON: Training-Free Universal Virtual Try-On

Zhaotong Yang, Yuhui Li, Shengfeng He et al.

ICCV 2025posterarXiv:2507.15037
1
citations
#986

Towards Fine-grained Interactive Segmentation in Images and Videos

Yuan Yao, Qiushi Yang, Miaomiao Cui et al.

ICCV 2025posterarXiv:2502.09660
1
citations
#987

SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition

Zeqi Zheng, Yanchen Huang, Yingchao Yu et al.

ICCV 2025posterarXiv:2503.15986
1
citations
#988

Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference

KUO WANG, Quanlong Zheng, Junlin Xie et al.

ICCV 2025posterarXiv:2508.02134
1
citations
#989

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation

Yuanhong Yu, Xingyi He, Chen Zhao et al.

ICCV 2025posterarXiv:2504.07955
1
citations
#990

FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework

Yiwen Zhao, Yang Wang, Liting Wen et al.

ICCV 2025poster
1
citations
#991

CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection

Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.

ICCV 2025posterarXiv:2506.21364
1
citations
#992

Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering

Imad Eddine MAROUF, Enzo Tartaglione, Stéphane Lathuilière et al.

ICCV 2025posterarXiv:2502.04469
1
citations
#993

MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing

Langyu Wang, Langyu Wang, Yingying Chen et al.

ICCV 2025posterarXiv:2507.01384
1
citations
#994

Triad: Empowering LMM-based Anomaly Detection with Expert-guided Region-of-Interest Tokenizer and Manufacturing Process

Yuanze Li, Shihao Yuan, Haolin Wang et al.

ICCV 2025poster
1
citations
#995

IAP: Invisible Adversarial Patch Attack through Perceptibility-Aware Localization and Perturbation Optimization

Subrat Kishore Dutta, Xiao Zhang

ICCV 2025posterarXiv:2507.06856
1
citations
#996

Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework

Yi-Ting Chen, Ting-Hsuan Liao, Pengsheng Guo et al.

ICCV 2025posterarXiv:2508.04090
1
citations
#997

On the Robustness Tradeoff in Fine-Tuning

Kunyang Li, Jean-Charles Noirot Ferrand, Ryan Sheatsley et al.

ICCV 2025posterarXiv:2503.14836
1
citations
#998

MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning

Tianhong Gao, Yannian Fu, Weiqun Wu et al.

ICCV 2025posterarXiv:2507.21924
1
citations
#999

Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels

Chenyu Mu, Yijun Qu, Jiexi Yan et al.

ICCV 2025poster
1
citations
#1000

Passing the Driving Knowledge Test

Maolin Wei, Wanzhou Liu, Eshed Ohn-Bar

ICCV 2025posterarXiv:2508.21824
1
citations