Most Cited ICCV "human-activity recognition" Papers

2,701 papers found • Page 8 of 14

#1401

Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond

Xin Qiao, Matteo Poggi, Xing Wei et al.

ICCV 2025posterarXiv:2511.01704
#1402

Discretized Gaussian Representation for Tomographic Reconstruction

Shaokai Wu, Yuxiang Lu, Yapan Guo et al.

ICCV 2025posterarXiv:2411.04844
#1403

3D Test-time Adaptation via Graph Spectral Driven Point Shift

Xin Wei, Qin Yang, Yijie Fang et al.

ICCV 2025posterarXiv:2507.18225
#1404

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

Zengyu Wan, Wei Zhai, Yang Cao et al.

ICCV 2025posterarXiv:2503.11371
#1405

KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding

Ran Ran, Jiwei Wei, Shiyuan He et al.

ICCV 2025poster
#1406

VisNumBench: Evaluating Number Sense of Multimodal Large Language Models

Tengjin Weng, Jingyi Wang, Wenhao Jiang et al.

ICCV 2025posterarXiv:2503.14939
#1407

STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries

Tahira Shehzadi, Khurram Azeem Hashmi, Shalini Sarode et al.

ICCV 2025poster
#1408

Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence

Weihao Wang, Yu Lan, Mingyu You et al.

ICCV 2025poster
#1409

Aligning Global Semantics and Local Textures in Generative Video Enhancement

Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.

ICCV 2025poster
#1410

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Hayeon Kim, Ji Ha Jang, Se Young Chun

ICCV 2025posterarXiv:2507.11061
#1411

Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation

Guanyi Qin, Ziyue Wang, Daiyun Shen et al.

ICCV 2025highlightarXiv:2507.18944
#1412

AIM: Amending Inherent Interpretability via Self-Supervised Masking

Eyad Alshami, Shashank Agnihotri, Bernt Schiele et al.

ICCV 2025highlightarXiv:2508.11502
#1413

One Last Attention for Your Vision-Language Model

Liang Chen, Ghazi Shazan Ahmad, Tianjun Yao et al.

ICCV 2025posterarXiv:2507.15480
#1414

RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS

Chuanyu Fu, Yuqi Zhang, Kunbin Yao et al.

ICCV 2025posterarXiv:2506.02751
#1415

High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation

Runyang Feng, Hyung Jin Chang, Tze Ho Elden Tse et al.

ICCV 2025posterarXiv:2510.11017
#1416

Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information

Junbo Zhao, Ting Zhang, Jiayu Sun et al.

ICCV 2025posterarXiv:2503.05543
#1417

Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination

Chao Pan, Ke Tang, Li Qing et al.

ICCV 2025poster
#1418

Consistency Trajectory Matching for One-Step Generative Super-Resolution

Weiyi You, Mingyang Zhang, Leheng Zhang et al.

ICCV 2025posterarXiv:2503.20349
#1419

Amodal Depth Anything: Amodal Depth Estimation in the Wild

Zhenyu Li, Mykola Lavreniuk, Jian Shi et al.

ICCV 2025posterarXiv:2412.02336
#1420

One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

Hao Fang, Jiawei Kong, Wenbo Yu et al.

ICCV 2025posterarXiv:2406.05491
#1421

CVPT: Cross Visual Prompt Tuning

Lingyun Huang, Jianxu Mao, Junfei YI et al.

ICCV 2025posterarXiv:2408.14961
#1422

DDB: Diffusion Driven Balancing to Address Spurious Correlations

Aryan Yazdan Parast, Basim Azam, Naveed Akhtar

ICCV 2025posterarXiv:2503.17226
#1423

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories

Jingqiao Xiu, Yicong Li, Na Zhao et al.

ICCV 2025poster
#1424

AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation

Guanxing Lu, Tengbo Yu, Haoyuan Deng et al.

ICCV 2025posterarXiv:2412.06779
#1425

FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation

Wenbin Teng, Gonglin Chen, Haiwei Chen et al.

ICCV 2025posterarXiv:2508.06392
#1426

CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance

Zheng Ziqiang, Wong Kwan, Binh-Son Hua et al.

ICCV 2025poster
#1427

Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space

Yingping Liang, Yutao Hu, Wenqi Shao et al.

ICCV 2025posterarXiv:2507.00392
#1428

Diagnosing Pretrained Models for Out-of-distribution Detection

Haipeng Xiong, Kai Xu, Angela Yao

ICCV 2025poster
#1429

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Jinhong Ni, Chang-Bin Zhang, Qiang Zhang et al.

ICCV 2025posterarXiv:2505.22129
#1430

Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering

Qing Li, Huifang Feng, Xun Gong et al.

ICCV 2025posterarXiv:2507.03394
#1431

Bayesian-Inspired Space-Time Superpixels

Kent Gauen, Stanley Chan

ICCV 2025poster
#1432

INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception

yunjiang xu, Yupeng Ouyang, Lingzhi Li et al.

ICCV 2025posterarXiv:2509.23700
#1433

Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification

Mingyang Liu, Xinyang Chen, Yang Shu et al.

ICCV 2025poster
#1434

PHATNet: A Physics-guided Haze Transfer Network for Domain-adaptive Real-world Image Dehazing

Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin et al.

ICCV 2025posterarXiv:2507.14826
#1435

Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts

Mingqi Fang, Ziguang Li, Lingyun Yu et al.

ICCV 2025poster
#1436

Information-Bottleneck Driven Binary Neural Network for Change Detection

Kaijie Yin, Zhiyuan Zhang, Shu Kong et al.

ICCV 2025posterarXiv:2507.03504
#1437

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment

Renye Yan, Jikang Cheng, Yaozhong Gan et al.

ICCV 2025poster
#1438

Time-Aware Auto White Balance in Mobile Photography

Mahmoud Afifi, Luxi Zhao, Abhijith Punnappurath et al.

ICCV 2025posterarXiv:2504.05623
#1439

Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation

Xueqing Deng, Linjie Yang, Qihang Yu et al.

ICCV 2025poster
#1440

ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition

Ronggang Huang, Haoxin Yang, Yan Cai et al.

ICCV 2025posterarXiv:2507.11261
#1441

Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer

Yuansheng Li, Yunhao Zou, Linwei Chen et al.

ICCV 2025posterarXiv:2506.21880
#1442

VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition

Shuting Dong, Mingzhi Chen, Feng Lu et al.

ICCV 2025poster
#1443

GUAVA: Generalizable Upper Body 3D Gaussian Avatar

Dongbin Zhang, Yunfei Liu, Lijian Lin et al.

ICCV 2025posterarXiv:2505.03351
#1444

HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation

Chenzhong Gao, Wei Li, Desheng Weng

ICCV 2025poster
#1445

GSOT3D: Towards Generic 3D Single Object Tracking in the Wild

Yifan Jiao, Yunhao Li, Junhua Ding et al.

ICCV 2025posterarXiv:2412.02129
#1446

Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection

Yehao Lu, Minghe Weng, Zekang Xiao et al.

ICCV 2025posterarXiv:2507.17436
#1447

WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

Jiwoo Park, Tae Choi, Youngjun Jun et al.

ICCV 2025posterarXiv:2506.23518
#1448

Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables

Wontae Kim, Keuntek Lee, Nam Ik Cho

ICCV 2025posterarXiv:2508.16121
#1449

Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues

Xu Cao, Takafumi Taketomi

ICCV 2025posterarXiv:2507.23162
#1450

EMatch: A Unified Framework for Event-based Optical Flow and Stereo Matching

Pengjie Zhang, Lin Zhu, Xiao Wang et al.

ICCV 2025posterarXiv:2407.21735
#1451

CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds

Feng Yang, Yichao Cao, Xiu Su et al.

ICCV 2025highlight
#1452

Liberated-GS: 3D Gaussian Splatting Independent from SfM Point Clouds

Weihong Pan, Xiaoyu Zhang, Hongjia Zhai et al.

ICCV 2025poster
#1453

Unlocking the Potential of Diffusion Priors in Blind Face Restoration

Yunqi Miao, Zhiyu Qu, Mingqi Gao et al.

ICCV 2025posterarXiv:2508.08556
#1454

Implicit Counterfactual Learning for Audio-Visual Segmentation

Mingfeng Zha, Tianyu Li, Guoqing Wang et al.

ICCV 2025posterarXiv:2507.20740
#1455

STaR: Seamless Spatial-Temporal Aware Motion Retargeting with Penetration and Consistency Constraints

Xiaohang Yang, Qing Wang, Jiahao Yang et al.

ICCV 2025posterarXiv:2504.06504
#1456

MRGen: Segmentation Data Engine For Underrepresented MRI Modalities

Haoning Wu, Ziheng Zhao, Ya Zhang et al.

ICCV 2025posterarXiv:2412.04106
#1457

Rethink Sparse Signals for Pose-guided Text-to-image Generation

Wenjie Xuan, Jing Zhang, Juhua Liu et al.

ICCV 2025posterarXiv:2506.20983
#1458

Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras

Petr Hruby, Marc Pollefeys

ICCV 2025posterarXiv:2506.22069
#1459

Enhancing Transferability of Targeted Adversarial Examples via Inverse Target Gradient Competition and Spatial Distance Stretching

Zhankai Li, Weiping Wang, jie li et al.

ICCV 2025poster
#1460

LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild

Jiaying Ying, Heming Du, Kaihao Zhang et al.

ICCV 2025poster
#1461

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Ahmed Nassar, Matteo Omenetti, Maksym Lysak et al.

ICCV 2025posterarXiv:2503.11576
#1462

Images as Noisy Labels: Unleashing the Potential of the Diffusion Model for Open-Vocabulary Semantic Segmentation

Fan Li, Xuanbin Wang, Xuan Wang et al.

ICCV 2025highlight
#1463

ContextFace: Generating Facial Expressions from Emotional Contexts

minjung kim, Minsang Kim, Seung Jun Baek

ICCV 2025poster
#1464

SMP-Attack: Boosting the Transferability of Feature Importance-based Adversarial Attack with Semantics-aware Multi-granularity Patchout

Wen Yang, Guodong Liu, Di Ming

ICCV 2025poster
#1465

Spatial-Temporal Forgery Trace based Forgery Image Identification

Yilin Wang, Zunlei Feng, Jiachi Wang et al.

ICCV 2025poster
#1466

Towards Annotation-Free Evaluation: KPAScore for Human Keypoint Detection

Xiaoxiao Wang, Chunxiao Li, Peng Sun et al.

ICCV 2025poster
#1467

Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter

JianHui Zhang, Shen Cheng, Qirui Sun et al.

ICCV 2025posterarXiv:2510.13419
#1468

Agreement aware and dissimilarity oriented GLOM

Ru Zeng, Yan Song, Yang ZHANG et al.

ICCV 2025poster
#1469

MeasureXpert: Automatic Anthropometric Measurement Extraction from Two Unregistered, Partial, Posed, and Dressed Body Scans

Ran Zhao, Xinxin Dai, Pengpeng Hu et al.

ICCV 2025poster
#1470

DiMPLe - Disentangled Multi-Modal Prompt Learning: Enhancing Out-Of-Distribution Alignment with Invariant and Spurious Feature Separation

Umaima Rahman, Mohammad Yaqub, Dwarikanath Mahapatra

ICCV 2025posterarXiv:2506.21237
#1471

ResidualViT for Efficient Temporally Dense Video Encoding

Mattia Soldan, Fabian Caba Heilbron, Bernard Ghanem et al.

ICCV 2025highlightarXiv:2509.13255
#1472

Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ICCV 2025posterarXiv:2408.04631
#1473

Randomized Autoregressive Visual Generation

Qihang Yu, Ju He, Xueqing Deng et al.

ICCV 2025posterarXiv:2411.00776
#1474

Unsupervised RGB-D Point Cloud Registration for Scenes with Low Overlap and Photometric Inconsistency

yejun Shou, Haocheng Wang, Lingfeng Shen et al.

ICCV 2025poster
#1475

TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision

Ayush Gupta, Anirban Roy, Rama Chellappa et al.

ICCV 2025posterarXiv:2506.09445
#1476

Training-free Geometric Image Editing on Diffusion Models

Hanshen Zhu, Zhen Zhu, Kaile Zhang et al.

ICCV 2025posterarXiv:2507.23300
#1477

Monocular Facial Appearance Capture in the Wild

Yingyan Xu, Kate Gadola, Prashanth Chandran et al.

ICCV 2025posterarXiv:2412.12765
#1478

Growing a Twig to Accelerate Large Vision-Language Models

Zhenwei Shao, Mingyang Wang, Zhou Yu et al.

ICCV 2025posterarXiv:2503.14075
#1479

SignRep: Enhancing Self-Supervised Sign Representations

Ryan Wong, Necati Cihan Camgoz, Richard Bowden

ICCV 2025posterarXiv:2503.08529
#1480

MixA: A Mixed Attention approach with Stable Lightweight Linear Attention to enhance Efficiency of Vision Transformers at the Edge

Sabbir Ahmed, Jingtao Li, Weiming Zhuang et al.

ICCV 2025poster
#1481

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Junyuan Zhang, Qintong Zhang, Bin Wang et al.

ICCV 2025posterarXiv:2412.02592
#1482

Efficient Event Camera Data Pretraining with Adaptive Prompt Fusion

Quanmin Liang, Qiang Li, Shuai Liu et al.

ICCV 2025poster
#1483

Head2Body: Body Pose Generation from Multi-sensory Head-mounted Inputs

Minh Tran, Hongda Mao, Qingshuang Chen et al.

ICCV 2025poster
#1484

Looking in the Mirror: A Faithful Counterfactual Explanation Method for Interpreting Deep Image Classification Models

Townim Chowdhury, Vu Phan, Kewen Liao et al.

ICCV 2025posterarXiv:2509.16822
#1485

FLSeg: Enhancing Privacy and Robustness in Federated Learning under Heterogeneous Data via Model Segmentation

Zichun Su, Zhi Lu, Yutong Wu et al.

ICCV 2025poster
#1486

Self-Calibrating Gaussian Splatting for Large Field-of-View Reconstruction

Youming Deng, Wenqi Xian, Guandao Yang et al.

ICCV 2025highlight
#1487

Gradient Decomposition and Alignment for Incremental Object Detection

Wenlong Luo, Shizhou Zhang, De Cheng et al.

ICCV 2025poster
#1488

MSQ: Memory-Efficient Bit Sparsification Quantization

Seokho Han, Seoyeon Yoon, Jinhee Kim et al.

ICCV 2025posterarXiv:2507.22349
#1489

When and Where do Data Poisons Attack Textual Inversion?

Jeremy Styborski, Mingzhi Lyu, Jiayou Lu et al.

ICCV 2025posterarXiv:2507.10578
#1490

SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement

Liwen Xiao, Zhiyu Pan, Zhicheng Wang et al.

ICCV 2025highlightarXiv:2507.04263
#1491

Rethinking Few Shot CLIP Benchmarks: A Critical Analysis in the Inductive Setting

Alexey Kravets, Da Chen, Vinay Namboodiri

ICCV 2025posterarXiv:2507.20834
#1492

AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation

Hao Li, Ju Dai, Feng Zhou et al.

ICCV 2025posterarXiv:2507.12001
#1493

BokehDiff: Neural Lens Blur with One-Step Diffusion

Chengxuan Zhu, Qingnan Fan, Qi Zhang et al.

ICCV 2025posterarXiv:2507.18060
#1494

Trial-Oriented Visual Rearrangement

Yuyi Liu, Xinhang Song, Tianliang Qi et al.

ICCV 2025poster
#1495

Debiased Teacher for Day-to-Night Domain Adaptive Object Detection

Yiming Cui, Liang Li, Haibing YIN et al.

ICCV 2025poster
#1496

SpikePack: Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility

Guobin Shen, Jindong Li, Tenglong Li et al.

ICCV 2025posterarXiv:2501.14484
#1497

Social Debiasing for Fair Multi-modal LLMs

Harry Cheng, Yangyang Guo, Qingpei Guo et al.

ICCV 2025posterarXiv:2408.06569
#1498

Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval

Zhe Li, Lei Zhang, Zheren Fu et al.

ICCV 2025poster
#1499

UPP: Unified Point-Level Prompting for Robust Point Cloud Analysis

Zixiang Ai, Zhenyu Cui, Yuxin Peng et al.

ICCV 2025posterarXiv:2507.18997
#1500

AV-Flow: Transforming Text to Audio-Visual Human-like Interactions

Aggelina Chatziagapi, Louis-Philippe Morency, Hongyu Gong et al.

ICCV 2025posterarXiv:2502.13133
#1501

Probabilistic Inertial Poser (ProbIP): Uncertainty-aware Human Motion Modeling from Sparse Inertial Sensors

Min Kim, Younho Jeon, Sungho Jo

ICCV 2025poster
#1502

SFUOD: Source-Free Unknown Object Detection

Keon-Hee Park, Seun-An Choe, Gyeong-Moon Park

ICCV 2025posterarXiv:2507.17373
#1503

Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal

Jinpei Guo, Zheng Chen, Wenbo Li et al.

ICCV 2025posterarXiv:2502.09873
#1504

ConstStyle: Robust Domain Generalization with Unified Style Transformation

Nam Duong Tran, Nam Nguyen Phuong, Hieu Pham et al.

ICCV 2025posterarXiv:2509.05975
#1505

Golden Noise for Diffusion Models: A Learning Framework

zikai zhou, Shitong Shao, Lichen Bai et al.

ICCV 2025posterarXiv:2411.09502
#1506

Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation

Yukuan Min, Muli Yang, Jinhao Zhang et al.

ICCV 2025poster
#1507

OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM

Jinhong Wang, Shuo Tong, Jintai CHEN et al.

ICCV 2025posterarXiv:2504.04801
#1508

Unified Open-World Segmentation with Multi-Modal Prompts

Yang Liu, Yufei Yin, Chenchen Jing et al.

ICCV 2025posterarXiv:2510.10524
#1509

LayerAnimate: Layer-level Control for Animation

Yuxue Yang, Lue Fan, Zuzeng Lin et al.

ICCV 2025posterarXiv:2501.08295
#1510

Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

shengyuan zhang, An Zhao, Ling Yang et al.

ICCV 2025posterarXiv:2412.03515
#1511

SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM

Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger

ICCV 2025highlightarXiv:2504.00139
#1512

FedAGC: Federated Continual Learning with Asymmetric Gradient Correction

Chengchao Zhang, Fanhua Shang, Hongying Liu et al.

ICCV 2025poster
#1513

Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation

Seunghyun Lee, Tae-Kyun Kim

ICCV 2025posterarXiv:2510.04125
#1514

Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization

Ashutosh Anshul, Shreyas Gopal, Deepu Rajan et al.

ICCV 2025poster
#1515

CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval

Zelong Sun, Dong Jing, Zhiwu Lu

ICCV 2025posterarXiv:2502.20826
#1516

The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation

Ho Kei Cheng, Alex Schwing

ICCV 2025posterarXiv:2503.10636
#1517

DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

Yue-Jiang Dong, Wang Zhao, Jiale Xu et al.

ICCV 2025posterarXiv:2507.01603
#1518

InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

Wenjie Zhuo, Fan Ma, Hehe Fan

ICCV 2025posterarXiv:2411.18303
#1519

Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing

Yongxin Guo, Lin Wang, Xiaoying Tang et al.

ICCV 2025posterarXiv:2405.16233
#1520

Instance-Level Video Depth in Groups Beyond Occlusions

Yuan Liang, Yang Zhou, Ziming Sun et al.

ICCV 2025poster
#1521

Future-Aware Interaction Network For Motion Forecasting

Shijie Li, Chunyu Liu, Xun Xu et al.

ICCV 2025posterarXiv:2503.06565
#1522

DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization

Yukun Huang, Yanning Zhou, Jianan Wang et al.

ICCV 2025poster
#1523

Optical Model-Driven Sharpness Mapping for Autofocus in Small Depth-of-Field and Severe Defocus Scenarios

Chen-Liang Fan, Mingpei Cao, Chih-Chien Hung et al.

ICCV 2025poster
#1524

HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection

Fengzhe Zhou, Humphrey Shi

ICCV 2025poster
#1525

MMAD: Multi-label Micro-Action Detection in Videos

Kun Li, pengyu Liu, Dan Guo et al.

ICCV 2025posterarXiv:2407.05311
#1526

Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization

Wang Liu, Wei Gao

ICCV 2025poster
#1527

Auto-Regressive Transformation for Image Alignment

Kanggeon Lee, Soochahn Lee, Kyoung Mu Lee

ICCV 2025posterarXiv:2505.04864
#1528

Training-Free Industrial Defect Generation with Diffusion Models

Ruyi Xu, Yen-Tzu Chiu, Tai-I Chen et al.

ICCV 2025poster
#1529

Principles of Visual Tokens for Efficient Video Understanding

Xinyue Hao, Li, Shreyank Gowda et al.

ICCV 2025posterarXiv:2411.13626
#1530

TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models

Ruidong Chen, honglin guo, Lanjun Wang et al.

ICCV 2025posterarXiv:2503.07389
#1531

Task-Aware Prompt Gradient Projection for Parameter-Efficient Tuning Federated Class-Incremental Learning

Hualong Ke, Yachao Zhang, Jiangming Shi et al.

ICCV 2025poster
#1532

Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks

Jiawei Wang, Yushen Zuo, Yuanjun Chai et al.

ICCV 2025posterarXiv:2504.01308
#1533

Diff2I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior

Juncheng Mu, Chengwei REN, Weixiang Zhang et al.

ICCV 2025poster
#1534

KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles

Chaoyong Yang, Jia-Li Yin, Bin Chen et al.

ICCV 2025poster
#1535

CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation

Jiannan Ge, Lingxi Xie, Hongtao Xie et al.

ICCV 2025poster
#1536

PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation

Hengjia Li, Haonan Qiu, Shiwei Zhang et al.

ICCV 2025posterarXiv:2411.17048
#1537

FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling

qiusheng huang, Xiaohui Zhong, Xu Fan et al.

ICCV 2025highlightarXiv:2503.19940
#1538

DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization

Dongyeun Lee, jiwan hur, Hyounguk Shon et al.

ICCV 2025posterarXiv:2507.12933
#1539

ImHead: A Large-scale Implicit Morphable Model for Localized Head Modeling

Rolandos Alexandros Potamias, Stathis Galanakis, Jiankang Deng et al.

ICCV 2025posterarXiv:2510.10793
#1540

Discovering Divergent Representations between Text-to-Image Models

Lisa Dunlap, Trevor Darrell, Joseph Gonzalez et al.

ICCV 2025posterarXiv:2509.08940
#1541

Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions

Mengyu Yang, Yiming Chen, Haozheng Pei et al.

ICCV 2025posterarXiv:2510.02313
#1542

Learning Visual Proxy for Compositional Zero-Shot Learning

Shiyu Zhang, Cheng Yan, Yang Liu et al.

ICCV 2025posterarXiv:2501.13859
#1543

LGA-Net: Learning Local and Global Affinities for Sparse Scribble based Image Colorization

Hongjin Lyu, Bo Li, Paul Rosin et al.

ICCV 2025poster
#1544

DISTIL: Data-Free Inversion of Suspicious Trojan Inputs via Latent Diffusion

Hossein Mirzaei, Zeinab Taghavi, Sepehr Rezaee et al.

ICCV 2025posterarXiv:2507.22813
#1545

Factorized Learning for Temporally Grounded Video-Language Models

Wenzheng Zeng, Difei Gao, Mike Zheng Shou et al.

ICCV 2025posterarXiv:2512.24097
#1546

MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models

Vittorio Pipoli, Alessia Saporita, Federico Bolelli et al.

ICCV 2025poster
#1547

Unknown Text Learning for CLIP-based Few-Shot Open-set Recognition

Rui Ma, Qilong Wang, Bing Cao et al.

ICCV 2025poster
#1548

Learning Separable Fine-Grained Representation via Dendrogram Construction from Coarse Labels for Fine-grained Visual Recognition

Guanghui Shi, Xuefeng liang, Wenjie Li et al.

ICCV 2025poster
#1549

Rethinking the Upsampling Process in Light Field Super-Resolution with Spatial-Epipolar Implicit Image Function

Ruixuan Cong, Yu Wang, Mingyuan Zhao et al.

ICCV 2025poster
#1550

Harnessing Input-Adaptive Inference for Efficient VLN

Dongwoo Kang, Akhil Perincherry, Zachary Coalson et al.

ICCV 2025posterarXiv:2508.09262
#1551

LA-MOTR: End-to-End Multi-Object Tracking by Learnable Association

Peng Wang, Yongcai Wang, Hualong Cao et al.

ICCV 2025poster
#1552

Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual Learning

Linlan Huang, Xusheng Cao, Haori Lu et al.

ICCV 2025highlightarXiv:2507.09118
#1553

Uncalibrated Structure from Motion on a Sphere

Jonathan Ventura, Viktor Larsson, Fredrik Kahl

ICCV 2025poster
#1554

Attention to the Burtiness in Visual Prompt Tuning!

Yuzhu Wang, Manni Duan, Shu Kong

ICCV 2025poster
#1555

A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks

Hang Su, Yunlong Feng, Daniel Gehrig et al.

ICCV 2025highlightarXiv:2507.22733
#1556

Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning

Xinyu Sun, Zhikun Zhao, congyan lang et al.

ICCV 2025poster
#1557

Deciphering Cross-Modal Alignment in Large Vision-Language Models via Modality Integration Rate

Qidong Huang, Xiaoyi Dong, Pan Zhang et al.

ICCV 2025poster
#1558

Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection

Subhajit Maity, Ayan Bhunia, Subhadeep Koley et al.

ICCV 2025posterarXiv:2507.07994
#1559

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Qianhao Yuan, Qingyu Zhang, yanjiang liu et al.

ICCV 2025posterarXiv:2504.00502
#1560

Improving Noise Efficiency in Privacy-preserving Dataset Distillation

Runkai Zheng, Vishnu Dasu, Yinong Wang et al.

ICCV 2025posterarXiv:2508.01749
#1561

HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction

Sara Rojas Martinez, Matthieu Armando, Bernard Ghanem et al.

ICCV 2025posterarXiv:2508.16433
#1562

NegRefine: Refining Negative Label-Based Zero-Shot OOD Detection

Amirhossein Ansari, Ke Wang, Pulei Xiong

ICCV 2025posterarXiv:2507.09795
#1563

GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability

Zhenghao He, Sanchit Sinha, Guangzhi Xiong et al.

ICCV 2025posterarXiv:2508.21197
#1564

Semi-ViM: Bidirectional State Space Model for Mitigating Label Imbalance in Semi-Supervised Learning

Hongyang He, Hongyang Xie, Haochen You et al.

ICCV 2025poster
#1565

Beyond the Limits: Overcoming Negative Correlation of Activation-Based Training-Free NAS

Haidong Kang, Lianbo Ma, Pengjun Chen et al.

ICCV 2025poster
#1566

Semi-supervised Deep Transfer for Regression without Domain Alignment

Mainak Biswas, Ambedkar Dukkipati, Devarajan Sridharan

ICCV 2025posterarXiv:2509.05092
#1567

From Easy to Hard: The MIR Benchmark for Progressive Interleaved Multi-Image Reasoning

Hang Du, Jiayang Zhang, Guoshun Nan et al.

ICCV 2025posterarXiv:2509.17040
#1568

Diffusion Guided Adaptive Augmentation for Generalization in Visual Reinforcement Learning

Jeong Woon Lee, Hyoseok Hwang

ICCV 2025poster
#1569

RainbowPrompt: Diversity-Enhanced Prompt-Evolving for Continual Learning

Kiseong Hong, Gyeong-Hyeon Kim, Eunwoo Kim

ICCV 2025posterarXiv:2507.22553
#1570

A Good Teacher Adapts Their Knowledge for Distillation

Chengyao Qian, Trung Le, Mehrtash Harandi

ICCV 2025poster
#1571

SIGMAN: Scaling 3D Human Gaussian Generation with Millions of Assets

Yuhang Yang, Fengqi Liu, Yixing Lu et al.

ICCV 2025poster
#1572

Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens

Suchisrit Gangopadhyay, Jung Hee Kim, Xien Chen et al.

ICCV 2025posterarXiv:2508.04928
#1573

TWIST & SCOUT: Grounding Multimodal LLM-Experts by Forget-Free Tuning

Aritra Bhowmik, Mohammad Mahdi Derakhshani, Dennis Koelma et al.

ICCV 2025posterarXiv:2410.10491
#1574

CE-FAM: Concept-Based Explanation via Fusion of Activation Maps

Michihiro Kuroki, Toshihiko Yamasaki

ICCV 2025posterarXiv:2509.23849
#1575

PEFTDiff: Diffusion-Guided Transferability Estimation for Parameter-Efficient Fine-Tuning

PRAFFUL KHOBA, Zijian Wang, Chetan Arora et al.

ICCV 2025poster
#1576

FedMeNF: Privacy-Preserving Federated Meta-Learning for Neural Fields

Junhyeog Yun, Minui Hong, Gunhee Kim

ICCV 2025posterarXiv:2508.06301
#1577

Boosting Class Representation via Semantically Related Instances for Robust Long-Tailed Learning with Noisy Labels

Yuhang Li, Zhuying Li, Yuheng Jia

ICCV 2025poster
#1578

CAT: A Unified Click-and-Track Framework for Realistic Tracking

Yongsheng Yuan, Jie Zhao, Dong Wang et al.

ICCV 2025poster
#1579

Diffusion-Based Extreme High-speed Scenes Reconstruction with the Complementary Vision Sensor

Yapeng Meng, Yihan Lin, Taoyi Wang et al.

ICCV 2025poster
#1580

DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching

Emery Pierson, Lei Li, Angela Dai et al.

ICCV 2025posterarXiv:2507.23715
#1581

SAC-GNC: SAmple Consensus for adaptive Graduated Non-Convexity

Valter Piedade, Chitturi Sidhartha, José Gaspar et al.

ICCV 2025highlight
#1582

Towards Real Unsupervised Anomaly Detection Via Confident Meta-Learning

Muhammad Aqeel, Shakiba Sharifi, Marco Cristani et al.

ICCV 2025posterarXiv:2508.02293
#1583

NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation

Peiran Xu, Xicheng Gong, Yadong Mu

ICCV 2025posterarXiv:2510.16457
#1584

GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation

Phillip Mueller, Talip Ünlü, Sebastian Schmidt et al.

ICCV 2025posterarXiv:2510.22337
#1585

Scaling 3D Compositional Models for Robust Classification and Pose Estimation

Xiaoding Yuan, Prakhar Kaushik, Guofeng Zhang et al.

ICCV 2025poster
#1586

Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations

Jeong Hun Yeo, Minsu Kim, Chae Won Kim et al.

ICCV 2025posterarXiv:2503.06273
#1587

StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth

Zheng Zhang, Lihe Yang, Tianyu Yang et al.

ICCV 2025highlight
#1588

4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads

Ling Liu, Jun Tian, Li Yi

ICCV 2025posterarXiv:2510.17664
#1589

Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction

Edgar Sucar, Zihang Lai, Eldar Insafutdinov et al.

ICCV 2025highlightarXiv:2503.16318
#1590

VOVTrack: Exploring the Potentiality in Raw Videos for Open-Vocabulary Multi-Object Tracking

Zekun Qian, Ruize Han, Junhui Hou et al.

ICCV 2025poster
#1591

C4D: 4D Made from 3D through Dual Correspondences

Shizun Wang, Zhenxiang Jiang, Xingyi Yang et al.

ICCV 2025posterarXiv:2510.14960
#1592

Princeton365: A Diverse Dataset with Accurate Camera Pose

Karhan Kayan, Stamatis Alexandropoulos, Rishabh Jain et al.

ICCV 2025posterarXiv:2506.09035
#1593

From Abyssal Darkness to Blinding Glare: A Benchmark on Extreme Exposure Correction in Real World

Bo Wang, Huiyuan Fu, Zhiye Huang et al.

ICCV 2025poster
#1594

Voyaging into Perpetual Dynamic Scenes from a Single View

Fengrui Tian, Tianjiao Ding, Jinqi Luo et al.

ICCV 2025posterarXiv:2507.04183
#1595

AJAHR: Amputated Joint Aware 3D Human Mesh Recovery

hyunjin cho, Giyun choi, Jongwon Choi

ICCV 2025posterarXiv:2509.19939
#1596

One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images

Byeongjun Kwon, Munchurl Kim

ICCV 2025posterarXiv:2503.22351
#1597

DialNav: Multi-turn Dialog Navigation with a Remote Guide

Leekyeung Han, Hyunji Min, Gyeom Hwangbo et al.

ICCV 2025posterarXiv:2509.12894
#1598

PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement

Haoye Dong, Gim Hee Lee

ICCV 2025poster
#1599

RnGCam: High-speed video from rolling & global shutter measurements

Kevin Tandi, Xiang Dai, Chinmay Talegaonkar et al.

ICCV 2025posterarXiv:2509.18087
#1600

Easi3R: Estimating Disentangled Motion from DUSt3R Without Training

Xingyu Chen, Yue Chen, Yuliang Xiu et al.

ICCV 2025posterarXiv:2503.24391