Most Cited ICCV "image editing manipulation" Papers

2,701 papers found • Page 6 of 14

#1001

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

Tommaso Galliena, Tommaso Apicella, Stefano Rosa et al.

ICCV 2025highlightarXiv:2504.08531
1
citations
#1002

FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation

Yasser Benigmim, Mohammad Fahes, Tuan-Hung Vu et al.

ICCV 2025posterarXiv:2504.10487
1
citations
#1003

AIComposer: Any Style and Content Image Composition via Feature Integration

Haowen Li, Zhenfeng Fan, Zhang Wen et al.

ICCV 2025posterarXiv:2507.20721
1
citations
#1004

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025posterarXiv:2507.10340
1
citations
#1005

Referring Expression Comprehension for Small Objects

Kanoko Goto, Takumi Hirose, Mahiro Ukai et al.

ICCV 2025posterarXiv:2510.03701
1
citations
#1006

PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation

Xiaoyang Hao, Han Li

ICCV 2025posterarXiv:2508.17239
1
citations
#1007

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Haoran Chen, Ping Wang, Zihan Zhou et al.

ICCV 2025posterarXiv:2503.07979
1
citations
#1008

Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation

Tim Elsner, Paula Usinger, Julius Nehring-Wirxel et al.

ICCV 2025posterarXiv:2411.10281
1
citations
#1009

Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval

Zhichuan Wang, Yang Zhou, Zhe Liu et al.

ICCV 2025posterarXiv:2507.21489
1
citations
#1010

StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors

Xiaokun Sun, Zeyu Cai, Ying Tai et al.

ICCV 2025posterarXiv:2412.11586
1
citations
#1011

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Zhaolun Li, Jichang Li, Yinqi Cai et al.

ICCV 2025posterarXiv:2512.14601
1
citations
#1012

Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration

Shihao Zhou, Dayu Li, Jinshan Pan et al.

ICCV 2025posterarXiv:2503.20174
1
citations
#1013

Diffusion-based 3D Hand Motion Recovery with Intuitive Physics

Yufei Zhang, Zijun Cui, Jeffrey Kephart et al.

ICCV 2025posterarXiv:2508.01835
1
citations
#1014

VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions

Haoang Lu, Yuanqi Su, Xiaoning Zhang et al.

ICCV 2025posterarXiv:2507.19188
1
citations
#1015

M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast

Jiacheng Lu, Hui Ding, Shiyu Zhang et al.

ICCV 2025posterarXiv:2507.20582
1
citations
#1016

Activation Subspaces for Out-of-Distribution Detection

Barış Zöngür, Robin Hesse, Stefan Roth

ICCV 2025posterarXiv:2508.21695
1
citations
#1017

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering

xinyi zheng, Steve Zhang, Weizhe Lin et al.

ICCV 2025posterarXiv:2501.06927
1
citations
#1018

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

Seongmin Park, Hyungmin Kim, Sangwoo kim et al.

ICCV 2025posterarXiv:2505.15304
1
citations
#1019

ASCENT: Annotation-free Self-supervised Contrastive Embeddings for 3D Neuron Tracking in Fluorescence Microscopy

Haejun Han, Hang Lu

ICCV 2025poster
1
citations
#1020

Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification

Zhiqi Pang, Chunyu Wang, Lingling Zhao et al.

ICCV 2025poster
1
citations
#1021

SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation

Jiayuan Zhu, Junde Wu, Cheng Ouyang et al.

ICCV 2025posterarXiv:2411.15513
1
citations
#1022

PINO: Person-Interaction Noise Optimization for Long-Duration and Customizable Motion Generation of Arbitrary-Sized Groups

Sakuya Ota, Qing Yu, Kent Fujiwara et al.

ICCV 2025posterarXiv:2507.19292
1
citations
#1023

TurboVSR: Fantastic Video Upscalers and Where to Find Them

Zhongdao Wang, Guodongfang Zhao, Jingjing Ren et al.

ICCV 2025highlightarXiv:2506.23618
1
citations
#1024

Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment

Kejia Zhang, Juanjuan Weng, Zhiming Luo et al.

ICCV 2025posterarXiv:2408.06079
1
citations
#1025

Serialization based Point Cloud Oversegmentation

chenghui Lu, Dilong Li, Jianlong Kwan et al.

ICCV 2025poster
1
citations
#1026

Balancing Conservatism and Aggressiveness: Prototype-Affinity Hybrid Network for Few-Shot Segmentation

Tianyu Zou, Shengwu Xiong, Ruilin Yao et al.

ICCV 2025posterarXiv:2507.19140
1
citations
#1027

Beyond Blur: A Fluid Perspective on Generative Diffusion Models

Grzegorz Gruszczynski, Jakub Meixner, Michał Włodarczyk et al.

ICCV 2025posterarXiv:2506.16827
1
citations
#1028

LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models

Mert Sonmezer, Matthew Zheng, Pinar Yanardag

ICCV 2025posterarXiv:2510.15022
1
citations
#1029

DIMO: Diverse 3D Motion Generation for Arbitrary Objects

Linzhan Mou, Jiahui Lei, Chen Wang et al.

ICCV 2025highlightarXiv:2511.07409
1
citations
#1030

Global-Aware Monocular Semantic Scene Completion with State Space Models

Shijie Li, Zhongyao Cheng, Rong Li et al.

ICCV 2025posterarXiv:2503.06569
1
citations
#1031

VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching

Xihua Wang, Xin Cheng, Yuyue Wang et al.

ICCV 2025poster
1
citations
#1032

ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction

Sankeerth Durvasula, Sharanshangar Muhunthan, Zain Moustafa et al.

ICCV 2025posterarXiv:2509.03775
1
citations
#1033

After the Party: Navigating the Mapping From Color to Ambient Lighting

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ICCV 2025posterarXiv:2508.02168
1
citations
#1034

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Zedong Wang, Siyuan Li, Dan Xu

ICCV 2025highlightarXiv:2507.21049
1
citations
#1035

VSRM: A Robust Mamba-Based Framework for Video Super-Resolution

Phu Tran Dinh, Hung Dao, Daeyoung Kim

ICCV 2025posterarXiv:2506.22762
1
citations
#1036

Revisiting Pool-based Prompt Learning for Few-shot Class-incremental Learning

Yongwei Jiang, Yixiong Zou, Yuhua Li et al.

ICCV 2025posterarXiv:2507.09183
1
citations
#1037

Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting

Yian Zhao, rushi ye, Ruochong Zheng et al.

ICCV 2025poster
1
citations
#1038

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning

Mingqi Yuan, Bo Li, Xin Jin et al.

ICCV 2025posterarXiv:2503.06101
1
citations
#1039

Bi-Level Optimization for Self-Supervised AI-Generated Face Detection

Mian Zou, Nan Zhong, Baosheng Yu et al.

ICCV 2025highlightarXiv:2507.22824
1
citations
#1040

Context Guided Transformer Entropy Modeling for Video Compression

Junlong Tong, Wei Zhang, Yaohui Jin et al.

ICCV 2025posterarXiv:2508.01852
1
citations
#1041

Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing

Joowon Kim, Ziseok Lee, Donghyeon Cho et al.

ICCV 2025posterarXiv:2504.13490
1
citations
#1042

Causality-guided Prompt Learning for Vision-language Models via Visual Granulation

Mengyu Gao, Qiulei Dong

ICCV 2025posterarXiv:2509.03803
1
citations
#1043

STD-GS: Exploring Frame-Event Interaction for SpatioTemporal-Disentangled Gaussian Splatting to Reconstruct High-Dynamic Scene

Hanyu Zhou, Haonan Wang, Haoyue Liu et al.

ICCV 2025posterarXiv:2506.23157
1
citations
#1044

Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes

Zhangjun Zhou, Yiping Li, Chunlin Zhong et al.

ICCV 2025posterarXiv:2412.10943
1
citations
#1045

Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown

Bowen Wang, Zhouqiang Jiang, Yasuaki Susumu et al.

ICCV 2025posterarXiv:2506.17589
1
citations
#1046

Fusion Meets Diverse Conditions: A High-diversity Benchmark and Baseline for UAV-based Multimodal Object Detection with Condition Cues

Chen Chen, Kangcheng Bin, Hu Ting et al.

ICCV 2025posterarXiv:2510.13620
1
citations
#1047

Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation

CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.

ICCV 2025poster
1
citations
#1048

Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On

Delong Zhang, Qiwei Huang, Yang Sun et al.

ICCV 2025poster
1
citations
#1049

DAMap: Distance-aware MapNet for High Quality HD Map Construction

JINPENG DONG, Chen Li, Yutong Lin et al.

ICCV 2025posterarXiv:2510.22675
1
citations
#1050

M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision

Kailai Zhou, Fuqiang Yang, Shixian Wang et al.

ICCV 2025posterarXiv:2507.16318
1
citations
#1051

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

Ahmed Abdelreheem, Filippo Aleotti, Jamie Watson et al.

ICCV 2025posterarXiv:2505.05288
1
citations
#1052

SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference

Samir Khaki, Junxian Guo, Jiaming Tang et al.

ICCV 2025posterarXiv:2510.17777
1
citations
#1053

CompleteMe: Reference-based Human Image Completion

Yu-Ju Tsai, Brian Price, Qing Liu et al.

ICCV 2025posterarXiv:2504.20042
1
citations
#1054

PLMP - Point-Line Minimal Problems for Projective SfM

Kim Kiehn, Albin Ahlbäck, Kathlén Kohn

ICCV 2025highlightarXiv:2503.04351
1
citations
#1055

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

YU WEI, Jiahui Zhang, Xiaoqin Zhang et al.

ICCV 2025posterarXiv:2507.13891
1
citations
#1056

Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning

Tianjiao Jiang, Zhen Zhang, Yuhang Liu et al.

ICCV 2025posterarXiv:2508.03102
1
citations
#1057

UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions

Siyuan Yao, Rui Zhu, Ziqi Wang et al.

ICCV 2025posterarXiv:2507.00648
1
citations
#1058

Fast Globally Optimal and Geometrically Consistent 3D Shape Matching

Paul Roetzer, Florian Bernard

ICCV 2025highlightarXiv:2504.06385
1
citations
#1059

FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation

Wenzhuang Wang, Yifan Zhao, Mingcan Ma et al.

ICCV 2025posterarXiv:2509.01107
1
citations
#1060

From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning

Pengkun Jiao, Bin Zhu, Jingjing Chen et al.

ICCV 2025posterarXiv:2411.12787
1
citations
#1061

Dataset Ownership Verification for Pre-trained Masked Models

Yuechen Xie, Jie Song, Yicheng Shan et al.

ICCV 2025posterarXiv:2507.12022
1
citations
#1062

DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

Ziqi Gao, Qiufu Li, Linlin Shen

ICCV 2025highlightarXiv:2510.21635
1
citations
#1063

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo, Miaomiao Cui, Liefeng Bo et al.

ICCV 2025posterarXiv:2507.22604
1
citations
#1064

Balanced Sharpness-Aware Minimization for Imbalanced Regression

Yahao Liu, Qin Wang, Lixin Duan et al.

ICCV 2025posterarXiv:2508.16973
1
citations
#1065

SpecGuard: Spectral Projection-based Advanced Invisible Watermarking

Inzamamul Alam, Md Islam, Simon Woo et al.

ICCV 2025posterarXiv:2510.07302
1
citations
#1066

AnimalClue: Recognizing Animals by their Traces

Risa Shinoda, Nakamasa Inoue, Iro Laina et al.

ICCV 2025highlightarXiv:2507.20240
1
citations
#1067

Beyond Simple Edits: Composed Video Retrieval with Dense Modifications

Omkar Thawakar, Dmitry Demidov, Ritesh Thawkar et al.

ICCV 2025posterarXiv:2508.14039
1
citations
#1068

Robust Unfolding Network for HDR Imaging with Modulo Cameras

Zhile Chen, Hui Ji

ICCV 2025poster
1
citations
#1069

You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception

hao si, Ehsan Javanmardi, Manabu Tsukada

ICCV 2025posterarXiv:2509.09310
1
citations
#1070

VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow

Ada Görgün, Bernt Schiele, Jonas Fischer

ICCV 2025posterarXiv:2503.22399
1
citations
#1071

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

Danhui Chen, Ziquan Liu, Chuxi Yang et al.

ICCV 2025posterarXiv:2507.15803
1
citations
#1072

HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation

Lingxiao Li, Kaixuan Fan, Boqing Gong et al.

ICCV 2025posterarXiv:2411.17784
1
citations
#1073

Progressive Artwork Outpainting via Latent Diffusion Models

Dae-Young Song, Jung-Jae Yu, Donghyeon Cho

ICCV 2025poster
1
citations
#1074

MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics

Bowei Guo, Shengkun Tang, Cong Zeng et al.

ICCV 2025posterarXiv:2510.11962
1
citations
#1075

GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement

Jingxi Liao, Shijie Hao, Richang Hong et al.

ICCV 2025posterarXiv:2507.20148
1
citations
#1076

TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models

Christian Simon, Masato Ishii, Akio Hayakawa et al.

ICCV 2025posterarXiv:2508.00289
1
citations
#1077

Video Color Grading via Look-Up Table Generation

Seunghyun Shin, Dongmin Shin, Jisu Shin et al.

ICCV 2025posterarXiv:2508.00548
1
citations
#1078

S$^3$E: Self-Supervised State Estimation for Radar-Inertial System

Shengpeng Wang, Yulong Xie, Qing Liao et al.

ICCV 2025posterarXiv:2509.25984
1
citations
#1079

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Yuchen Liu, Yaoming Wang, Bowen Shi et al.

ICCV 2025posterarXiv:2507.20842
1
citations
#1080

Robust Low-light Scene Restoration via Illumination Transition

Ze Li, Feng Zhang, Xiatian Zhu et al.

ICCV 2025posterarXiv:2507.03976
1
citations
#1081

GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation

Ye Tao, jiawei zhang, Yahao Shi et al.

ICCV 2025posterarXiv:2503.06136
1
citations
#1082

Visual Relation Diffusion for Human-Object Interaction Detection

Ping Cao, Yepeng Tang, Chunjie Zhang et al.

ICCV 2025poster
1
citations
#1083

TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes

Yan Xia, Yunxiang Lu, Rui Song et al.

ICCV 2025posterarXiv:2412.10308
1
citations
#1084

Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction

Giuseppe Cartella, Vittorio Cuculo, Alessandro D'Amelio et al.

ICCV 2025posterarXiv:2507.23021
1
citations
#1085

BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models

Jianting Tang, Yubo Wang, Haoyu Cao et al.

ICCV 2025posterarXiv:2508.06895
1
citations
#1086

Outlier-Aware Post-Training Quantization for Image Super-Resolution

Hailing Wang, Jianglin Lu, Yitian Zhang et al.

ICCV 2025highlightarXiv:2511.00682
1
citations
#1087

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

Haoxuan Li, Ziya Erkoç, Lei Li et al.

ICCV 2025posterarXiv:2503.01425
1
citations
#1088

LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion

Yisu Zhang, Chenjie Cao, Chaohui Yu et al.

ICCV 2025posterarXiv:2507.05678
1
citations
#1089

Latent Expression Generation for Referring Image Segmentation and Grounding

Seonghoon Yu, Junbeom Hong, Joonseok Lee et al.

ICCV 2025posterarXiv:2508.05123
1
citations
#1090

Toward Material-Agnostic System Identification from Videos

Yizhou Zhao, Haoyu Chen, Chunjiang Liu et al.

ICCV 2025posterarXiv:2508.01112
1
citations
#1091

MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.

ICCV 2025posterarXiv:2506.08694
1
citations
#1092

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.

ICCV 2025posterarXiv:2503.17539
1
citations
#1093

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Changhao Li, Xinrui Chen, Ji Wang et al.

ICCV 2025posterarXiv:2507.16782
1
citations
#1094

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

Zongheng Tang, Yi Liu, Yifan Sun et al.

ICCV 2025highlightarXiv:2508.00359
1
citations
#1095

PoseAnchor: Robust Root Position Estimation for 3D Human Pose Estimation

Jun-Hee Kim, Jumin Han, Seong-Whan Lee

ICCV 2025poster
1
citations
#1096

VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation

Jiawei Wang, Zhiming Cui, Changjian Li

ICCV 2025posterarXiv:2411.16446
1
citations
#1097

SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers

Bhavna Gopal, Huanrui Yang, Mark Horton et al.

ICCV 2025posterarXiv:2501.01529
1
citations
#1098

AstroLoc: Robust Space to Ground Image Localizer

Gabriele Berton, Alex Stoken, Carlo Masone

ICCV 2025posterarXiv:2502.07003
1
citations
#1099

Domain Generalizable Portrait Style Transfer

Xinbo Wang, Wenju Xu, Qing Zhang et al.

ICCV 2025posterarXiv:2507.04243
1
citations
#1100

Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement

Junyu Lou, Xiaorui Zhao, Kexuan Shi et al.

ICCV 2025posterarXiv:2507.12135
1
citations
#1101

MCOP: Multi-UAV Collaborative Occupancy Prediction

Zefu Lin, Wenbo Chen, Xiaojuan Jin et al.

ICCV 2025posterarXiv:2510.12679
1
citations
#1102

CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models

Junho Kim, Hyungjin Chung, Byung-Hoon Kim

ICCV 2025posterarXiv:2411.06869
1
citations
#1103

Stylized-Face: A Million-level Stylized Face Dataset for Face Recognition

Zhengyuan Peng, Jianqing Xu, Yuge Huang et al.

ICCV 2025poster
1
citations
#1104

ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai, liqiang liqiang, Kaicheng Yang et al.

ICCV 2025posterarXiv:2507.19804
1
citations
#1105

Enhancing Numerical Prediction of MLLMs with Soft Labeling

Pei Wang, Zhaowei Cai, Hao Yang et al.

ICCV 2025poster
1
citations
#1106

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Fatemeh Saleh, Sadegh Aliakbarian, Charlie Hewitt et al.

ICCV 2025posterarXiv:2507.15365
1
citations
#1107

Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction

Dat Cong, Hieu Tran, Hoang Thanh-Tung

ICCV 2025posterarXiv:2508.19581
1
citations
#1108

MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion

Yikun Ma, Yiqing Li, Jiawei Wu et al.

ICCV 2025posterarXiv:2503.17695
1
citations
#1109

MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction

Yusuke Yoshiyasu, Leyuan Sun, Ryusuke Sagawa

ICCV 2025posterarXiv:2507.15212
1
citations
#1110

Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation

Yunze Tong, Fengda Zhang, Didi Zhu et al.

ICCV 2025poster
1
citations
#1111

Learning Robust Image Watermarking with Lossless Cover Recovery

jiale chen, Wei Wang, Chongyang Shi et al.

ICCV 2025poster
1
citations
#1112

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

Shengdong Han, Shangdong Yang, Yuxuan Li et al.

ICCV 2025posterarXiv:2505.19148
1
citations
#1113

FPEM: Face Prior Enhanced Facial Attractiveness Prediction for Live Videos with Face Retouching

Hui Li, Xiaoyu Ren, Hongjiu Yu et al.

ICCV 2025highlight
1
citations
#1114

OmniVTON: Training-Free Universal Virtual Try-On

Zhaotong Yang, Yuhui Li, Shengfeng He et al.

ICCV 2025posterarXiv:2507.15037
1
citations
#1115

Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data

Hang Phung, Manh Nguyen, Thanh Huynh et al.

ICCV 2025poster
1
citations
#1116

Membership Inference Attacks with False Discovery Rate Control

Chenxu Zhao, Wei Qian, Aobo Chen et al.

ICCV 2025posterarXiv:2508.07066
1
citations
#1117

Fuse Before Transfer: Knowledge Fusion for Heterogeneous Distillation

Guopeng Li, Qiang Wang, Ke Yan et al.

ICCV 2025posterarXiv:2410.12342
1
citations
#1118

PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks

Clinton A Mo, Kun Hu, Chengjiang Long et al.

ICCV 2025posterarXiv:2507.20170
1
citations
#1119

IAP: Invisible Adversarial Patch Attack through Perceptibility-Aware Localization and Perturbation Optimization

Subrat Kishore Dutta, Xiao Zhang

ICCV 2025posterarXiv:2507.06856
1
citations
#1120

DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

Yinqi Cai, Jichang Li, Zhaolun Li et al.

ICCV 2025posterarXiv:2510.25237
1
citations
#1121

Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation

Luca Bartolomei, Enrico Mannocci, Fabio Tosi et al.

ICCV 2025posterarXiv:2509.15224
1
citations
#1122

SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition

Zeqi Zheng, Yanchen Huang, Yingchao Yu et al.

ICCV 2025posterarXiv:2503.15986
1
citations
#1123

Towards Efficient General Feature Prediction in Masked Skeleton Modeling

Shengkai Sun, Zefan Zhang, Jianfeng Dong et al.

ICCV 2025posterarXiv:2509.03609
1
citations
#1124

MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning

Tianhong Gao, Yannian Fu, Weiqun Wu et al.

ICCV 2025posterarXiv:2507.21924
1
citations
#1125

Skeleton Motion Words for Unsupervised Skeleton-based Temporal Action Segmentation

Uzay Gökay, Federico Spurio, Dominik Bach et al.

ICCV 2025posterarXiv:2508.04513
1
citations
#1126

FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling

Jingting Li, Yu Qian, Lin Zhao et al.

ICCV 2025posterarXiv:2507.20557
1
citations
#1127

Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering

Imad Eddine MAROUF, Enzo Tartaglione, Stéphane Lathuilière et al.

ICCV 2025posterarXiv:2502.04469
1
citations
#1128

NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration

Haotian Dong, Xin WANG, Di Lin et al.

ICCV 2025posterarXiv:2504.18448
1
citations
#1129

G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion

Mengdi Liu, Zhangyang Gao, Hong Chang et al.

ICCV 2025posterarXiv:2502.04684
1
citations
#1130

Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework

Yi-Ting Chen, Ting-Hsuan Liao, Pengsheng Guo et al.

ICCV 2025posterarXiv:2508.04090
1
citations
#1131

Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation

Andrea Simonelli, Norman Müller, Peter Kontschieder

ICCV 2025posterarXiv:2504.11024
1
citations
#1132

Balancing Task-invariant Interaction and Task-specific Adaptation for Unified Image Fusion

Xingyu Hu, Junjun Jiang, Chenyang Wang et al.

ICCV 2025posterarXiv:2504.05164
1
citations
#1133

IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution

Sejin Park, Sangmin Lee, Kyong Hwan Jin et al.

ICCV 2025posterarXiv:2507.09923
1
citations
#1134

GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields

Shunsuke Yasuki, Taiki Miyanishi, Nakamasa Inoue et al.

ICCV 2025posterarXiv:2506.23352
1
citations
#1135

LLM-enhanced Action-aware Multi-modal Prompt Tuning for Image-Text Matching

Meng Tian, Shuo Yang, Xinxiao Wu

ICCV 2025posterarXiv:2506.23502
1
citations
#1136

Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation

Jiahua Dong, Hui Yin, Wenqi Liang et al.

ICCV 2025posterarXiv:2508.08612
1
citations
#1137

A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions

Youliang Zhang, Ronghui Li, Yachao Zhang et al.

ICCV 2025highlightarXiv:2412.17377
1
citations
#1138

Reverse Convolution and Its Applications to Image Restoration

Xuhong Huang, Shiqi Liu, Kai Zhang et al.

ICCV 2025posterarXiv:2508.09824
1
citations
#1139

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Junzhe Lu, Jing Lin, Hongkun Dou et al.

ICCV 2025posterarXiv:2508.00599
1
citations
#1140

MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps

Jiahui Lei, Kyle Genova, George Kopanas et al.

ICCV 2025posterarXiv:2510.11107
1
citations
#1141

InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow

Yiming Gong, Zhen Zhu, Minjia Zhang

ICCV 2025posterarXiv:2508.06033
1
citations
#1142

2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update

Jeongyun Kim, Seunghoon Jeong, Giseop Kim et al.

ICCV 2025posterarXiv:2507.11069
1
citations
#1143

Unsupervised Imaging Inverse Problems with Diffusion Distribution Matching

Giacomo Meanti, Thomas Ryckeboer, Michael Arbel et al.

ICCV 2025posterarXiv:2506.14605
1
citations
#1144

Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images

Elena Buglakova, Anwai Archit, Edoardo D'Imprima et al.

ICCV 2025highlightarXiv:2503.19545
1
citations
#1145

TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation

Changsong Lei, Yaqian Liang, Shaofeng Wang et al.

ICCV 2025posterarXiv:2507.04685
1
citations
#1146

Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection

Hyewon Park, Hyejin Park, Jueun Ko et al.

ICCV 2025posterarXiv:2409.08566
1
citations
#1147

Kaputt: A Large-Scale Dataset for Visual Defect Detection

Sebastian Höfer, Dorian Henning, Artemij Amiranashvili et al.

ICCV 2025posterarXiv:2510.05903
1
citations
#1148

TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking

Mengmeng Wang, Haonan Wang, Yulong Li et al.

ICCV 2025posterarXiv:2507.19908
1
citations
#1149

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding

Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.

ICCV 2025posterarXiv:2507.15569
1
citations
#1150

Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation

Peng Ren, Tian Bai, Jing Sun et al.

ICCV 2025poster
1
citations
#1151

HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery

Yu Wang, Bo Dang, Wanchun Li et al.

ICCV 2025posterarXiv:2507.16251
1
citations
#1152

Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision

Tianma Shen, Aditya Shrish Puranik, James Vong et al.

ICCV 2025posterarXiv:2503.06089
1
citations
#1153

Towards Foundational Models for Single-Chip Radar

Tianshu Huang, Akarsh Prabhakara, Chuhan Chen et al.

ICCV 2025posterarXiv:2509.12482
1
citations
#1154

Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization

Qingwang Zhang, Yingying Zhu

ICCV 2025poster
1
citations
#1155

Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval

Dohwan Ko, Ji Soo Lee, Minhyuk Choi et al.

ICCV 2025highlightarXiv:2507.23284
1
citations
#1156

MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration

George Ciubotariu, Zhuyun Zhou, Zongwei Wu et al.

ICCV 2025posterarXiv:2509.06803
1
citations
#1157

Aligning Effective Tokens with Video Anomaly in Large Language Models

YINGXIAN Chen, Jiahui Liu, Ruidi Fan et al.

ICCV 2025posterarXiv:2508.06350
1
citations
#1158

Certifiably Optimal Anisotropic Rotation Averaging

Carl Olsson, Yaroslava Lochman, Johan Malmport et al.

ICCV 2025posterarXiv:2503.07353
1
citations
#1159

Forecasting Continuous Non-Conservative Dynamical Systems in SO(3)

Lennart Bastian, Mohammad Rashed, Nassir Navab et al.

ICCV 2025posterarXiv:2508.07775
1
citations
#1160

Learning Yourself: Class-Incremental Semantic Segmentation with Language-Inspired Bootstrapped Disentanglement

Ruitao Wu, Yifan Zhao, Jia Li

ICCV 2025posterarXiv:2509.00527
1
citations
#1161

Learning 3D Scene Analogies with Neural Contextual Scene Maps

Junho Kim, Gwangtak Bae, Eun Sun Lee et al.

ICCV 2025posterarXiv:2503.15897
1
citations
#1162

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

Liang Xu, Chengqun Yang, Zili Lin et al.

ICCV 2025posterarXiv:2508.04681
1
citations
#1163

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Cheng-Fu Yang, Da Yin, Wenbo Hu et al.

ICCV 2025posterarXiv:2411.18651
1
citations
#1164

Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text matching

Yang Liu, Wentao Feng, Zhuoyao Liu et al.

ICCV 2025posterarXiv:2503.14953
1
citations
#1165

PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction

Manahil Raza, Ayesha Azam, Talha Qaiser et al.

ICCV 2025posterarXiv:2509.20022
1
citations
#1166

PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image

Geonhee Sim, Gyeongsik Moon

ICCV 2025posterarXiv:2508.09973
1
citations
#1167

From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition

Ling Lo, Kelvin Chan, Wen-Huang Cheng et al.

ICCV 2025posterarXiv:2509.19690
1
citations
#1168

DiSCO-3D : Discovering and Segmenting Sub-Concepts from Open-vocabulary Queries in NeRF

Doriand Petit, Steve Bourgeois, Vincent Gay-Bellile et al.

ICCV 2025posterarXiv:2507.14596
1
citations
#1169

GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection

Wenxue Li, Tian Ye, Xinyu Xiong et al.

ICCV 2025poster
1
citations
#1170

DONUT: A Decoder-Only Model for Trajectory Prediction

Markus Knoche, Daan de Geus, Bastian Leibe

ICCV 2025posterarXiv:2506.06854
1
citations
#1171

Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection

Shizhen Zhao, Jiahui Liu, Xin Wen et al.

ICCV 2025posterarXiv:2510.10584
1
citations
#1172

PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction

Jiahui Ren, Mochu Xiang, Jiajun Zhu et al.

ICCV 2025posterarXiv:2507.21960
1
citations
#1173

Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation

Chen Gao, Shuo Zhang, Youfang Lin

ICCV 2025poster
#1174

Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation

Pengfei Ren, Jingyu Wang, Haifeng Sun et al.

ICCV 2025poster
#1175

DRaM-LHM: A Quaternion Framework for Iterative Camera Pose Estimation

Chen Lin, Weizhi Du, Zhixiang Min et al.

ICCV 2025poster
#1176

Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations

Jianhua Sun, Yuxuan Li, Jiude Wei et al.

ICCV 2025posterarXiv:2412.14974
#1177

Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

Jiahao Zhang, Anoop Cherian, Cristian Rodriguez-Opazo et al.

ICCV 2025posterarXiv:2411.18011
#1178

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Xiyao Wang, Zhengyuan Yang, Linjie Li et al.

ICCV 2025posterarXiv:2412.03704
#1179

MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion

peilin Tao, Hainan Cui, Diantao Tu et al.

ICCV 2025posterarXiv:2507.03306
#1180

Learning Large Motion Estimation from Intermediate Representations with a High-Resolution Optical Flow Dataset Featuring Long-Range Dynamic Motion

Hoonhee Cho, Yuhwan Jeong, Kuk-Jin Yoon

ICCV 2025highlight
#1181

GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration

Li Mi, Manon Béchaz, Zeming Chen et al.

ICCV 2025posterarXiv:2508.00152
#1182

NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations

Junjie Nan, Jianing Li, Wei Chen et al.

ICCV 2025posterarXiv:2510.14025
#1183

PEFTDiff: Diffusion-Guided Transferability Estimation for Parameter-Efficient Fine-Tuning

PRAFFUL KHOBA, Zijian Wang, Chetan Arora et al.

ICCV 2025poster
#1184

Is Tracking really more challenging in First Person Egocentric Vision?

Matteo Dunnhofer, Zaira Manigrasso, Christian Micheloni

ICCV 2025highlightarXiv:2507.16015
#1185

Stochastic Interpolants for Revealing Stylistic Flows across the History of Art

Pingchuan Ma, Ming Gui, Johannes Schusterbauer et al.

ICCV 2025poster
#1186

POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction

Songyan Zhang, Yongtao Ge, Jinyuan Tian et al.

ICCV 2025posterarXiv:2504.05692
#1187

Multispectral Demosaicing via Dual Cameras

SaiKiran Tedla, Junyong Lee, Beixuan Yang et al.

ICCV 2025highlightarXiv:2503.22026
#1188

Flexi-FSCIL: Adaptive Knowledge Retention for Breaking the Stability-Plasticity Dilemma in Few-Shot Class-Incremental Learning

Wufei Xie, Yalin Wang, Chenliang Liu et al.

ICCV 2025poster
#1189

Staining and Locking Computer Vision Models Without Retraining

Oliver Sutton, Qinghua Zhou, George Leete et al.

ICCV 2025posterarXiv:2507.22000
#1190

AVAM: a Universal Training-free Adaptive Visual Anchoring Embedded into Multimodal Large Language Model for Multi-image Question Answering

Kang Zeng, Guojin Zhong, Jintao Cheng et al.

ICCV 2025poster
#1191

Analyzing Finetuning Representation Shift for Multimodal LLMs Steering

Pegah KHAYATAN, Mustafa Shukor, Jayneel Parekh et al.

ICCV 2025posterarXiv:2501.03012
#1192

RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction

Johannes Künzel, Anna Hilsmann, Peter Eisert

ICCV 2025posterarXiv:2507.04839
#1193

Prototype Guided Backdoor Defense via Activation Space Manipulation

Venkat Adithya Amula, Sunayana Samavedam, Saurabh Saini et al.

ICCV 2025poster
#1194

Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning

Borui Kang, Lei Wang, Zhiping Wu et al.

ICCV 2025poster
#1195

Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs

Bhavya Goyal, Felipe Gutierrez-Barragan, Wei Lin et al.

ICCV 2025posterarXiv:2508.00169
#1196

CE-FAM: Concept-Based Explanation via Fusion of Activation Maps

Michihiro Kuroki, Toshihiko Yamasaki

ICCV 2025posterarXiv:2509.23849
#1197

Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility

Melih Barsbey, Lucas Prieto, Stefanos Zafeiriou et al.

ICCV 2025posterarXiv:2507.17748
#1198

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Zitian Wang, Yue Liao, RONG KANG et al.

ICCV 2025posterarXiv:2503.20309
#1199

SynCity: Training-Free Generation of 3D Cities

Paul Engstler, Aleksandar Shtedritski, Iro Laina et al.

ICCV 2025poster
#1200

Multi-view Gaze Target Estimation

Qiaomu Miao, Vivek Golani, Jingyi Xu et al.

ICCV 2025posterarXiv:2508.05857