Most Cited ICCV "temporal redundancy reduction" Papers

2,701 papers found • Page 12 of 14

#2201

LDIP: Long Distance Information Propagation for Video Super-Resolution

Michael Bernasconi, Abdelaziz Djelouah, Yang Zhang et al.

ICCV 2025
#2202

ISP2HRNet: Learning to Reconstruct High Resolution Image from Irregularly Sampled Pixels via Hierarchical Gradient Learning

Yuanlin Wang, Ruiqin Xiong, Rui Zhao et al.

ICCV 2025highlight
#2203

Sparsity Outperforms Low-Rank Projections in Few-Shot Adaptation

Nairouz Mrabah, Nicolas Richet, Ismail Ayed et al.

ICCV 2025arXiv:2504.12436
#2204

Continuous-Time Human Motion Field from Event Cameras

Ziyun Wang, Ruijun Zhang, Zi-Yan Liu et al.

ICCV 2025
#2205

Neural Architecture Search Driven by Locally Guided Diffusion for Personalized Federated Learning

PENG LIAO, Xilu Wang, Yaochu Jin et al.

ICCV 2025
#2206

Hierarchical 3D Scene Graphs Construction Outdoors

Jon Nyffeler, Federico Tombari, Daniel Barath

ICCV 2025
#2207

Cycle-Consistent Learning for Joint Layout-to-Image Generation and Object Detection

Xinhao Cai, Qiuxia Lai, Gensheng Pei et al.

ICCV 2025
#2208

WarpHE4D: Dense 4D Head Map toward Full Head Reconstruction

Jongseob Yun, Yong-Hoon Kwon, Min-Gyu Park et al.

ICCV 2025
#2209

Federated Representation Angle Learning

Liping Yi, Han Yu, Gang Wang et al.

ICCV 2025
#2210

Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-end Whole Slide Image Analysis

Zhongwei Qiu, Hanqing Chao, Tiancheng Lin et al.

ICCV 2025
#2211

Neuroverse3D: Developing In-Context Learning Universal Model for Neuroimaging in 3D

Jiesi Hu, Hanyang Peng, Yanwu Yang et al.

ICCV 2025arXiv:2503.02410
#2212

Laboring on less labors: RPCA Paradigm for Pan-sharpening

honghui xu, Chuangjie Fang, Yibin Wang et al.

ICCV 2025
#2213

Punching Bag vs. Punching Person: Motion Transferability in Videos

Raiyaan Abdullah, Jared Claypoole, Michael Cogswell et al.

ICCV 2025arXiv:2508.00085
#2214

Robust Test-Time Adaptation for Single Image Denoising Using Deep Gaussian Prior

Qing Ma, Pengwei Liang, Xiong Zhou et al.

ICCV 2025
#2215

Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis

Lei-lei Li, Jianwu Fang, Junbin Xiao et al.

ICCV 2025arXiv:2506.23263
#2216

Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts

Maoxian Wan, Kaige Li, Qichuan Geng et al.

ICCV 2025
#2217

TrackVerse: A Large-Scale Object-Centric Video Dataset for Image-Level Representation Learning

Yibing Wei, Samuel Church, Victor Suciu et al.

ICCV 2025
#2218

DGTalker: Disentangled Generative Latent Space Learning for Audio-Driven Gaussian Talking Heads

Xiaoxi Liang, Yanbo Fan, Qiya Yang et al.

ICCV 2025
#2219

StyleSRN: Scene Text Image Super-Resolution with Text Style Embedding

Shengrong Yuan, Runmin Wang, Ke Hao et al.

ICCV 2025
#2220

Frequency-Guided Diffusion for Training-Free Text-Driven Image Translation

Zheng Gao, Jifei Song, Zhensong Zhang et al.

ICCV 2025
#2221

Less is More: Improving Motion Diffusion Models with Sparse Keyframes

Jinseok Bae, Inwoo Hwang, Young-Yoon Lee et al.

ICCV 2025arXiv:2503.13859
#2222

Drawing Developmental Trajectory from Cortical Surface Reconstruction

WENXUAN WU, ruowen qu, Zhongliang Liu et al.

ICCV 2025
#2223

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition

Wenhan Wu, Zhishuai Guo, Chen Chen et al.

ICCV 2025arXiv:2506.22179
#2224

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Ziyue Wang, Yurui Dong, Fuwen Luo et al.

ICCV 2025
#2225

Towards Human-like Virtual Beings: Simulating Human Behavior in 3D Scenes

CHEN LIANG, Wenguan Wang, Yi Yang

ICCV 2025
#2226

Cross-Category Subjectivity Generalization for Style-Adaptive Sketch Re-ID

Zechao Hu, Zhengwei Yang, Hao Li et al.

ICCV 2025
#2227

S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction

Guangting Zheng, Jiajun Deng, Xiaomeng Chu et al.

ICCV 2025arXiv:2503.08217
#2228

The Source Image is the Best Attention for Infrared and Visible Image Fusion

Song Wang, Xie Han, Liqun Kuang et al.

ICCV 2025
#2229

Blind Noisy Image Deblurring Using Residual Guidance Strategy

Heyan Liu, Jianing Sun, Jun Liu et al.

ICCV 2025
#2230

MonSTeR: a Unified Model for Motion, Scene, Text Retrieval

Luca Collorone, Matteo Gioia, Massimiliano Pappa et al.

ICCV 2025arXiv:2510.03200
#2231

E-NeMF: Event-based Neural Motion Field for Novel Space-time View Synthesis of Dynamic Scenes

Yan Liu, Zehao Chen, Haojie Yan et al.

ICCV 2025
#2232

Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond

Xin Qiao, Matteo Poggi, Xing Wei et al.

ICCV 2025arXiv:2511.01704
#2233

Multi-modal Identity Extraction

Ryan Webster, Teddy Furon

ICCV 2025
#2234

Reference-based Super-Resolution via Image-based Retrieval-Augmented Generation Diffusion

Byeonghun Lee, Hyunmin Cho, Honggyu Choi et al.

ICCV 2025
#2235

Deep Adaptive Unfolded Network via Spatial Morphology Stripping and Spectral Filtration for Pan-sharpening

Hebaixu Wang, Jiayi Ma

ICCV 2025
#2236

Open-World Skill Discovery from Unsegmented Demonstration Videos

Jingwen Deng, Zihao Wang, Shaofei Cai et al.

ICCV 2025
#2237

InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians

Kefan Chen, Sergiu Oprea, Justin Theiss et al.

ICCV 2025arXiv:2504.07949
#2238

Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer

Md Ashiqur Rahman, Chiao-An Yang, Michael N Cheng et al.

ICCV 2025arXiv:2508.14187
#2239

Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers

An Lun Liu, Yu-Wei Chao, Yi-Ting Chen

ICCV 2025arXiv:2507.11287
#2240

Wave-MambaAD: Wavelet-driven State Space Model for Multi-class Unsupervised Anomaly Detection

Qiao Zhang, Mingwen Shao, Xinyuan Chen et al.

ICCV 2025
#2241

3D Test-time Adaptation via Graph Spectral Driven Point Shift

Xin Wei, Qin Yang, Yijie Fang et al.

ICCV 2025arXiv:2507.18225
#2242

Task-Decoupled Bézier Surface Constraint for Uneven Low-Light Image Enhancement

Xingxiang Zhou, Xiangdong Su, Haoran Zhang et al.

ICCV 2025
#2243

EMoTive: Event-guided Trajectory Modeling for 3D Motion Estimation

Zengyu Wan, Wei Zhai, Yang Cao et al.

ICCV 2025arXiv:2503.11371
#2244

RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration

Longxin Kou, Fei Ni, Jianye HAO et al.

ICCV 2025
#2245

What If: Understanding Motion Through Sparse Interactions

Stefan A. Baumann, Nick Stracke, Timy Phan et al.

ICCV 2025
#2246

GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization

Shaowen Tong, Zimin Xia, Alexandre Alahi et al.

ICCV 2025arXiv:2507.10935
#2247

KDA: Knowledge Diffusion Alignment with Enhanced Context for Video Temporal Grounding

Ran Ran, Jiwei Wei, Shiyuan He et al.

ICCV 2025
#2248

Error Recognition in Procedural Videos using Generalized Task Graph

Shih-Po Lee, Ehsan Elhamifar

ICCV 2025
#2249

STEP-DETR: Advancing DETR-based Semi-Supervised Object Detection with Super Teacher and Pseudo-Label Guided Text Queries

Tahira Shehzadi, Khurram Azeem Hashmi, Shalini Sarode et al.

ICCV 2025
#2250

Text-to-Any-Skeleton Motion Generation Without Retargeting

Qingyuan Liu, Ke Lv, Kun Dong et al.

ICCV 2025
#2251

Completing 3D Partial Assemblies with View-Consistent 2D-3D Correspondence

Weihao Wang, Yu Lan, Mingyu You et al.

ICCV 2025
#2252

Aligning Global Semantics and Local Textures in Generative Video Enhancement

Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.

ICCV 2025
#2253

Simulating Dual-Pixel Images From Ray Tracing For Depth Estimation

Fengchen He, Dayang Zhao, Hao Xu et al.

ICCV 2025arXiv:2503.11213
#2254

WalkVLM: Aid Visually Impaired People Walking by Vision Language Model

Zhiqiang Yuan, Ting Zhang, Yeshuang Zhu et al.

ICCV 2025
#2255

Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes

Tom Fischer, Xiaojie Zhang, Eddy Ilg

ICCV 2025arXiv:2508.02157
#2256

Proactive Scene Decomposition and Reconstruction

Baicheng Li, Zike Yan, Dong Wu et al.

ICCV 2025arXiv:2510.16272
#2257

PASD: A Pixel-Adaptive Swarm Dynamics Approach for Unsupervised Low-Light Image Enhancement

Shuai Jin, Yuhua Qian, Feijiang Li et al.

ICCV 2025
#2258

Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification

Wajahat Khalid, Bin Liu, Xulin Li et al.

ICCV 2025
#2259

Combinative Matching for Geometric Shape Assembly

Nahyuk Lee, Juhong Min, Junhong Lee et al.

ICCV 2025highlightarXiv:2508.09780
#2260

Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images

Philipp Wulff, Felix Wimbauer, Dominik Muhle et al.

ICCV 2025arXiv:2508.02323
#2261

Auto-Regressive Transformation for Image Alignment

Kanggeon Lee, Soochahn Lee, Kyoung Mu Lee

ICCV 2025arXiv:2505.04864
#2262

Training-Free Industrial Defect Generation with Diffusion Models

Ruyi Xu, Yen-Tzu Chiu, Tai-I Chen et al.

ICCV 2025
#2263

GECO: Geometrically Consistent Embedding with Lightspeed Inference

Regine Hartwig, Dominik Muhle, Riccardo Marin et al.

ICCV 2025arXiv:2508.00746
#2264

GenHaze: Pioneering Controllable One-Step Realistic Haze Generation for Real-World Dehazing

Sixiang Chen, Tian Ye, Yunlong Lin et al.

ICCV 2025
#2265

WINS: Winograd Structured Pruning for Fast Winograd Convolution

Cheonjun Park, Hyunjae Oh, Mincheol Park et al.

ICCV 2025highlight
#2266

DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic

Munish Monga, Vishal Chudasama, Pankaj Wasnik et al.

ICCV 2025arXiv:2506.21260
#2267

ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models

Mengxue Qu, Yibo Hu, Kunyang Han et al.

ICCV 2025
#2268

A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields

Aoxiang Fan, Corentin Dumery, Nicolas Talabot et al.

ICCV 2025arXiv:2507.04408
#2269

MixRI: Mixing Features of Reference Images for Novel Object Pose Estimation

Xinhang Liu, Jiawei Shi, Zheng Dang et al.

ICCV 2025arXiv:2601.06883
#2270

RnGCam: High-speed video from rolling & global shutter measurements

Kevin Tandi, Xiang Dai, Chinmay Talegaonkar et al.

ICCV 2025arXiv:2509.18087
#2271

Test-Time Retrieval-Augmented Adaptation for Vision-Language Models

Xinqi Fan, Xueli CHEN, Luoxiao Yang et al.

ICCV 2025
#2272

SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures

Yi Qin, Rui Wang, Tao Huang et al.

ICCV 2025arXiv:2508.06127
#2273

CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector

Abhinav Kumar, Yuliang Guo, Zhihao Zhang et al.

ICCV 2025arXiv:2508.11185
#2274

Environment-Agnostic Pose: Generating Environment-independent Object Representations for 6D Pose Estimation

Shaobo Zhang, Yuhang Huang, Wanqing Zhao et al.

ICCV 2025
#2275

Spatial Alignment and Temporal Matching Adapter for Video-Radar Remote Physiological Measurement

Qian Liang, Ruixu Geng, Jinbo Chen et al.

ICCV 2025
#2276

Gaze-Language Alignment for Zero-Shot Prediction of Visual Search Targets from Human Gaze Scanpaths

Sounak Mondal, Naveen Sendhilnathan, Ting Zhang et al.

ICCV 2025
#2277

PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement

Haoye Dong, Gim Hee Lee

ICCV 2025
#2278

Event-guided Unified Framework for Low-light Video Enhancement, Frame Interpolation, and Deblurring

Taewoo Kim, Kuk-Jin Yoon

ICCV 2025
#2279

Beyond Pixel Uncertainty: Bounding the OoD Objects in Road Scenes

Huachao Zhu, Zelong Liu, Zhichao Sun et al.

ICCV 2025
#2280

A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds

Jizong Peng, Tze Ho Elden Tse, Kai Xu et al.

ICCV 2025highlightarXiv:2504.09129
#2281

Conditional Visual Autoregressive Modeling for Pathological Image Restoration

Ziyi Liu, Zhe Xu, Jiabo MA et al.

ICCV 2025
#2282

LGA-Net: Learning Local and Global Affinities for Sparse Scribble based Image Colorization

Hongjin Lyu, Bo Li, Paul Rosin et al.

ICCV 2025
#2283

Medical World Model

Yijun Yang, Zhao-Yang Wang, Qiuping Liu et al.

ICCV 2025
#2284

ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization

Yuanhe Guo, Linxi Xie, Zhuoran Chen et al.

ICCV 2025arXiv:2510.18433
#2285

EYE3:Turn Anything into Naked-eye 3D

Yingde Song, Zongyuan Yang, Baolin Liu et al.

ICCV 2025
#2286

C2MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis

Min Cen, Zhenfeng Zhuang, Yuzhe Zhang et al.

ICCV 2025
#2287

Partially Matching Submap Helps: Uncetainty Modeling and Propagation for Text to Point Cloud Localization

Mingtao Feng, Longlong Mei, Zijie Wu et al.

ICCV 2025
#2288

TopicGeo: An Efficient Unified Framework for Geolocation

Xin Wang, Xinlin Wang, Shuiping Gou

ICCV 2025
#2289

High-Resolution Spatiotemporal Modeling with Global-Local State Space Models for Video-Based Human Pose Estimation

Runyang Feng, Hyung Jin Chang, Tze Ho Elden Tse et al.

ICCV 2025arXiv:2510.11017
#2290

Learning Visual Proxy for Compositional Zero-Shot Learning

Shiyu Zhang, Cheng Yan, Yang Liu et al.

ICCV 2025arXiv:2501.13859
#2291

CObL: Toward Zero-Shot Ordinal Layering without User Prompting

Aneel Damaraju, Dean Hazineh, Todd Zickler

ICCV 2025highlightarXiv:2508.08498
#2292

Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision

Xiao Fang, Minhyek Jeon, Zheyang Qin et al.

ICCV 2025arXiv:2507.20976
#2293

One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images

Byeongjun Kwon, Munchurl Kim

ICCV 2025arXiv:2503.22351
#2294

Background Invariance Testing According to Semantic Proximity

Zukang Liao, Min Chen

ICCV 2025arXiv:2208.09286
#2295

Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions

Mengyu Yang, Yiming Chen, Haozheng Pei et al.

ICCV 2025arXiv:2510.02313
#2296

TryOn-Refiner: Conditional Rectified-flow-based TryOn Refiner for More Accurate Detail Reconstruction

Wen Qian

ICCV 2025
#2297

EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks

Athinoulla Konstantinou, Georgios Leontidis, Mamatha Thota et al.

ICCV 2025arXiv:2506.09895
#2298

Mitigating Catastrophic Overfitting in Fast Adversarial Training via Label Information Elimination

Chao Pan, Ke Tang, Li Qing et al.

ICCV 2025
#2299

AJAHR: Amputated Joint Aware 3D Human Mesh Recovery

hyunjin cho, Giyun choi, Jongwon Choi

ICCV 2025arXiv:2509.19939
#2300

SpikeDiff: Zero-shot High-Quality Video Reconstruction from Chromatic Spike Camera and Sub-millisecond Spike Streams

Siqi Yang, Jinxiu Liang, Zhaojun Huang et al.

ICCV 2025
#2301

Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos

Yuang Feng, Shuyong Gao, Fuzhen Yan et al.

ICCV 2025arXiv:2503.17050
#2302

Global Motion Corresponder for 3D Point-Based Scene Interpolation under Large Motion

Junru Lin, Chirag Vashist, Mikaela Uy et al.

ICCV 2025arXiv:2508.20136
#2303

TESPEC: Temporally-Enhanced Self-Supervised Pretraining for Event Cameras

Mohammad Mohammadi, Ziyi Wu, Igor Gilitschenski

ICCV 2025arXiv:2508.00913
#2304

From Abyssal Darkness to Blinding Glare: A Benchmark on Extreme Exposure Correction in Real World

Bo Wang, Huiyuan Fu, Zhiye Huang et al.

ICCV 2025
#2305

Diffusion-based Source-biased Model for Single Domain Generalized Object Detection

Han Jiang, Wenfei Yang, Tianzhu Zhang et al.

ICCV 2025
#2306

H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction

Heng Jia, Na Zhao, Linchao Zhu

ICCV 2025arXiv:2508.03118
#2307

Estimating 2D Camera Motion with Hybrid Motion Basis

Haipeng Li, Tianhao Zhou, Zhanglei Yang et al.

ICCV 2025arXiv:2507.22480
#2308

MonoSOWA: Scalable monocular 3D Object detector Without human Annotations

Jan Skvrna, Lukas Neumann

ICCV 2025arXiv:2501.09481
#2309

Discovering Divergent Representations between Text-to-Image Models

Lisa Dunlap, Trevor Darrell, Joseph Gonzalez et al.

ICCV 2025arXiv:2509.08940
#2310

ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion

AO LI, Jinpeng Liu, Yixuan Zhu et al.

ICCV 2025arXiv:2509.07920
#2311

PHD: Personalized 3D Human Body Fitting with Point Diffusion

Hsuan-I Ho, Chen Guo, Po-Chen Wu et al.

ICCV 2025arXiv:2508.21257
#2312

Understanding Personal Concept in Open-Vocabulary Semantic Segmentation

Sunghyun Park, Jungsoo Lee, Shubhankar Borse et al.

ICCV 2025arXiv:2507.11030
#2313

FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling

qiusheng huang, Xiaohui Zhong, Xu Fan et al.

ICCV 2025highlightarXiv:2503.19940
#2314

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories

Jingqiao Xiu, Yicong Li, Na Zhao et al.

ICCV 2025
#2315

Visual Interestingness Decoded: How GPT-4o Mirrors Human Interests

Fitim Abdullahu, Helmut Grabner

ICCV 2025arXiv:2510.13316
#2316

Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations

Ruoxi Guo, Huaijin Pi, Zehong Shen et al.

ICCV 2025
#2317

Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints

Dinh-Vinh-Thuy Tran, Ruochen Chen, Shaifali Parashar

ICCV 2025arXiv:2507.22699
#2318

Towards Performance Consistency in Multi-Level Model Collaboration

Qi Li, Runpeng Yu, Xinchao Wang

ICCV 2025
#2319

AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs

Yi-Ting Shen, Sungmin Eum, Doheon Lee et al.

ICCV 2025arXiv:2503.22884
#2320

Tracking Tiny Drones against Clutter: Large-Scale Infrared Benchmark with Motion-Centric Adaptive Algorithm

Jiahao Zhang, Zongli Jiang, Gang Wang et al.

ICCV 2025
#2321

FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation

Wenbin Teng, Gonglin Chen, Haiwei Chen et al.

ICCV 2025arXiv:2508.06392
#2322

PVMamba: Parallelizing Vision Mamba via Dynamic State Aggregation

Fei Xie, Zhongdao Wang, Weijia Zhang et al.

ICCV 2025
#2323

SummDiff: Generative Modeling of Video Summarization with Diffusion

Kwanseok Kim, Jaehoon Hahm, Sumin Kim et al.

ICCV 2025highlightarXiv:2510.08458
#2324

CoralSRT: Revisiting Coral Reef Semantic Segmentation by Feature Rectifying via Self-supervised Guidance

Zheng Ziqiang, Wong Kwan, Binh-Son Hua et al.

ICCV 2025
#2325

Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning

Zongyao Xue, Meina Kan, Shiguang Shan et al.

ICCV 2025
#2326

Diagnosing Pretrained Models for Out-of-distribution Detection

Haipeng Xiong, Kai Xu, Angela Yao

ICCV 2025
#2327

RALoc: Enhancing Outdoor LiDAR Localization via Rotation Awareness

Yuyang Yang, Wen Li, Sheng Ao et al.

ICCV 2025highlight
#2328

CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers

Jiaqi Han, Haotian Ye, Puheng Li et al.

ICCV 2025arXiv:2507.15260
#2329

Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions

Dong Li, Chunhui Luo, Yuanfei Bao et al.

ICCV 2025
#2330

Adversarial Training for Probabilistic Robustness

YI ZHANG, Yuhang Chen, Zhen Chen et al.

ICCV 2025
#2331

Learning to See Inside Opaque Liquid Containers using Speckle Vibrometry

Matan Kichler, Shai Bagon, Mark Sheinin

ICCV 2025arXiv:2507.20757
#2332

Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities

Yiyuan Zhang, Handong Li, Jing Liu et al.

ICCV 2025
#2333

LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning

Jiang Yuan, ji ma, Bo Wang et al.

ICCV 2025arXiv:2506.22710
#2334

When Pixel Difference Patterns Meet ViT: PiDiViT for Few-Shot Object Detection

Hongliang Zhou, Yongxiang Liu, Canyu Mo et al.

ICCV 2025
#2335

VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions

Yash Garg, Saketh Bachu, Arindam Dutta et al.

ICCV 2025arXiv:2508.06757
#2336

Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement

Shuo Zhang, Chen Gao, Youfang Lin

ICCV 2025highlight
#2337

Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering

Qing Li, Huifang Feng, Xun Gong et al.

ICCV 2025arXiv:2507.03394
#2338

HccePose (BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation

Yulin Wang, Mengting Hu, Hongli Li et al.

ICCV 2025highlightarXiv:2510.10177
#2339

Bayesian-Inspired Space-Time Superpixels

Kent Gauen, Stanley Chan

ICCV 2025
#2340

Progressive Distribution Bridging: Unsupervised Adaptation for Large-scale Pre-trained Models via Adaptive Auxiliary Data

Weinan He, Yixin Zhang, Zilei Wang

ICCV 2025
#2341

Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures

Xinlong Ding, Hongwei Yu, Jiawei Li et al.

ICCV 2025highlightarXiv:2507.10265
#2342

Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing

Yongxin Guo, Lin Wang, Xiaoying Tang et al.

ICCV 2025arXiv:2405.16233
#2343

4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads

Ling Liu, Jun Tian, Li Yi

ICCV 2025arXiv:2510.17664
#2344

INSTINCT: Instance-Level Interaction Architecture for Query-Based Collaborative Perception

yunjiang xu, Yupeng Ouyang, Lingzhi Li et al.

ICCV 2025arXiv:2509.23700
#2345

SPD: Shallow Backdoor Protecting Deep Backdoor Against Backdoor Detection

Shunjie Yuan, Xinghua Li, Xuelin Cao et al.

ICCV 2025
#2346

StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth

Zheng Zhang, Lihe Yang, Tianyu Yang et al.

ICCV 2025highlight
#2347

Layer-wise Vision Injection with Disentangled Attention for Efficient LVLMs

Xuange Zhang, Dengjie Li, Bo Liu et al.

ICCV 2025
#2348

Rethinking DPO-style Diffusion Aligning Frameworks

XUN WU, Shaohan Huang, Lingjie Jiang et al.

ICCV 2025highlight
#2349

Debiased Curriculum Adaptation for Safe Transfer Learning in Chest X-ray Classification

Mingyang Liu, Xinyang Chen, Yang Shu et al.

ICCV 2025
#2350

Weakly-Supervised Learning of Dense Functional Correspondences

Stefan Stojanov, Linan Zhao, Yunzhi Zhang et al.

ICCV 2025arXiv:2509.03893
#2351

End-to-End Entity-Predicate Association Reasoning for Dynamic Scene Graph Generation

LiWei Wang, YanDuo Zhang, Tao Lu et al.

ICCV 2025
#2352

PlaneRAS: Learning Planar Primitives for 3D Plane Recovery

Fang Zhang, Wenzhao Zheng, Linqing Zhao et al.

ICCV 2025
#2353

Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection

Jae Young Kang, Hoonhee Cho, Kuk-Jin Yoon

ICCV 2025arXiv:2508.02288
#2354

Ensemble Foreground Management for Unsupervised Object Discovery

Ziling Wu, Armaghan Moemeni, Praminda Caleb-Solly

ICCV 2025highlightarXiv:2507.20860
#2355

Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts

Mingqi Fang, Ziguang Li, Lingyun Yu et al.

ICCV 2025
#2356

AR-VRM: Imitating Human Motions for Visual Robot Manipulation with Analogical Reasoning

Dejie Yang, Zijing Zhao, Yang Liu

ICCV 2025arXiv:2508.07626
#2357

Imbalance in Balance: Online Concept Balancing in Generation Models

Yukai Shi, Jiarong Ou, Rui Chen et al.

ICCV 2025arXiv:2507.13345
#2358

Entropy-Adaptive Diffusion Policy Optimization with Dynamic Step Alignment

Renye Yan, Jikang Cheng, Yaozhong Gan et al.

ICCV 2025
#2359

A Simple yet Mighty Hartley Diffusion Versatilist for Generalizable Dense Vision Tasks

Qi Bi, Jingjun Yi, Huimin Huang et al.

ICCV 2025
#2360

SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion

Yuxi Xiao, Jianyuan Wang, Nan Xue et al.

ICCV 2025
#2361

Focal Plane Visual Feature Generation and Matching on a Pixel Processor Array

Hongyi Zhang, Laurie Bose, Jianing Chen et al.

ICCV 2025
#2362

Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation

Xueqing Deng, Linjie Yang, Qihang Yu et al.

ICCV 2025
#2363

PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation

Zhihao ZHU, Yifan Zheng, Siyu Pan et al.

ICCV 2025arXiv:2508.05976
#2364

GloPER: Unsupervised Animal Pattern Extraction from Local Reconstruction

Bowen Chen, Yun Sing Koh, Gillian Dobbie

ICCV 2025
#2365

Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation

Chen Gao, Shuo Zhang, Youfang Lin

ICCV 2025
#2366

Physical Degradation Model-Guided Interferometric Hyperspectral Reconstruction with Unfolding Transformer

Yuansheng Li, Yunhao Zou, Linwei Chen et al.

ICCV 2025arXiv:2506.21880
#2367

VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition

Shuting Dong, Mingzhi Chen, Feng Lu et al.

ICCV 2025
#2368

Multi-Modal Multi-Task Unified Embedding Model (M3T-UEM): A Task-Adaptive Representation Learning Framework

Rohan Sharma, Changyou Chen, Feng-Ju Chang et al.

ICCV 2025
#2369

Instance-Level Video Depth in Groups Beyond Occlusions

Yuan Liang, Yang Zhou, Ziming Sun et al.

ICCV 2025
#2370

Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization

Zhaoyang Wu, Fang Liu, Licheng Jiao et al.

ICCV 2025
#2371

Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation

Pengfei Ren, Jingyu Wang, Haifeng Sun et al.

ICCV 2025
#2372

DRaM-LHM: A Quaternion Framework for Iterative Camera Pose Estimation

Chen Lin, Weizhi Du, Zhixiang Min et al.

ICCV 2025
#2373

CO2-Net: A Physics-Informed Spatio-Temporal Model for Global Surface CO2 Reconstruction

Hao Zheng, Yuting Zheng, Hanbo Huang et al.

ICCV 2025
#2374

Scaling 3D Compositional Models for Robust Classification and Pose Estimation

Xiaoding Yuan, Prakhar Kaushik, Guofeng Zhang et al.

ICCV 2025
#2375

HOMO-Feature: Cross-Arbitrary-Modal Image Matching with Homomorphism of Organized Major Orientation

Chenzhong Gao, Wei Li, Desheng Weng

ICCV 2025
#2376

OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS

Han Ling, Yinghui Sun, Xian Xu et al.

ICCV 2025arXiv:2508.01239
#2377

GSOT3D: Towards Generic 3D Single Object Tracking in the Wild

Yifan Jiao, Yunhao Li, Junhua Ding et al.

ICCV 2025arXiv:2412.02129
#2378

OVA-Fields: Weakly Supervised Open-Vocabulary Affordance Fields for Robot Operational Part Detection

Heng Su, Mengying Xie, Nieqing Cao et al.

ICCV 2025
#2379

Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers

Yunshan Zhong, Yuyao Zhou, Yuxin Zhang et al.

ICCV 2025arXiv:2412.16553
#2380

GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation

Phillip Mueller, Talip Ünlü, Sebastian Schmidt et al.

ICCV 2025arXiv:2510.22337
#2381

NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation

Peiran Xu, Xicheng Gong, Yadong Mu

ICCV 2025arXiv:2510.16457
#2382

Motal: Unsupervised 3D Object Detection by Modality and Task-specific Knowledge Transfer

Hai Wu, Hongwei Lin, Xusheng Guo et al.

ICCV 2025
#2383

Zero-shot Inexact CAD Model Alignment from a Single Image

Pattaramanee Arsomngern, Sasikarn Khwanmuang, Matthias Nießner et al.

ICCV 2025arXiv:2507.03292
#2384

MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion

peilin Tao, Hainan Cui, Diantao Tu et al.

ICCV 2025arXiv:2507.03306
#2385

Learning Large Motion Estimation from Intermediate Representations with a High-Resolution Optical Flow Dataset Featuring Long-Range Dynamic Motion

Hoonhee Cho, Yuhwan Jeong, Kuk-Jin Yoon

ICCV 2025highlight
#2386

WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

Jiwoo Park, Tae Choi, Youngjun Jun et al.

ICCV 2025arXiv:2506.23518
#2387

Flow Stochastic Segmentation Networks

Fabio De Sousa Ribeiro, Omar Todd, Charles Jones et al.

ICCV 2025arXiv:2507.18838
#2388

MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

Prerit Gupta, Jason Alexander Fotso-Puepi, Zhengyuan Li et al.

ICCV 2025arXiv:2508.16911
#2389

GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration

Li Mi, Manon Béchaz, Zeming Chen et al.

ICCV 2025arXiv:2508.00152
#2390

Is Tracking really more challenging in First Person Egocentric Vision?

Matteo Dunnhofer, Zaira Manigrasso, Christian Micheloni

ICCV 2025highlightarXiv:2507.16015
#2391

Stochastic Interpolants for Revealing Stylistic Flows across the History of Art

Pingchuan Ma, Ming Gui, Johannes Schusterbauer et al.

ICCV 2025
#2392

Real3D: Towards Scaling Large Reconstruction Models with Real Images

Hanwen Jiang, Qixing Huang, Georgios Pavlakos

ICCV 2025
#2393

CaliMatch: Adaptive Calibration for Improving Safe Semi-supervised Learning

Jinsoo Bae, Seoung Bum Kim, Hyungrok Do

ICCV 2025arXiv:2508.00922
#2394

SAC-GNC: SAmple Consensus for adaptive Graduated Non-Convexity

Valter Piedade, Chitturi Sidhartha, José Gaspar et al.

ICCV 2025highlight
#2395

DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching

Emery Pierson, Lei Li, Angela Dai et al.

ICCV 2025arXiv:2507.23715
#2396

Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design

Yuhao Sun, Yihua Zhang, Gaowen Liu et al.

ICCV 2025arXiv:2508.10065
#2397

Diffusion-Based Extreme High-speed Scenes Reconstruction with the Complementary Vision Sensor

Yapeng Meng, Yihan Lin, Taoyi Wang et al.

ICCV 2025
#2398

Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables

Wontae Kim, Keuntek Lee, Nam Ik Cho

ICCV 2025arXiv:2508.16121
#2399

Future-Aware Interaction Network For Motion Forecasting

Shijie Li, Chunyu Liu, Xun Xu et al.

ICCV 2025arXiv:2503.06565
#2400

CAT: A Unified Click-and-Track Framework for Realistic Tracking

Yongsheng Yuan, Jie Zhao, Dong Wang et al.

ICCV 2025