Most Cited ICCV "large language models training" Papers

2,701 papers found • Page 10 of 14

Filters:Most Cited ICCV large language models training Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1801

Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

Yuseung Lee, Jihyeon Je, Chanho Park et al.

ICCV 2025posterarXiv:2504.17207

#1802

GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination

Chengwei REN, Fan Zhang, Liangchao Xu et al.

ICCV 2025poster

#1803

Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model

Kai Tong, Kang Pan, Xiao Zhang et al.

ICCV 2025poster

#1804

Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts

Ibtihel Amara, Ahmed Imtiaz Humayun, Ivana Kajic et al.

ICCV 2025posterarXiv:2501.09833

#1805

MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

Prerit Gupta, Jason Alexander Fotso-Puepi, Zhengyuan Li et al.

ICCV 2025posterarXiv:2508.16911

#1806

Removing Cost Volumes from Optical Flow Estimators

Simon Kiefhaber, Stefan Roth, Simone Schaub-Meyer

ICCV 2025posterarXiv:2510.13317

#1807

PASG: A Closed-Loop Framework for Automated Geometric Primitive Extraction and Semantic Anchoring in Robotic Manipulation

Zhihao ZHU, Yifan Zheng, Siyu Pan et al.

ICCV 2025posterarXiv:2508.05976

#1808

PanSt3R: Multi-view Consistent Panoptic Segmentation

Lojze Zust, Yohann Cabon, Juliette Marrie et al.

ICCV 2025posterarXiv:2506.21348

#1809

GARF: Learning Generalizable 3D Reassembly for Real-World Fractures

Sihang Li, Zeyu Jiang, Grace Chen et al.

ICCV 2025posterarXiv:2504.05400

#1810

Progressive Distribution Bridging: Unsupervised Adaptation for Large-scale Pre-trained Models via Adaptive Auxiliary Data

Weinan He, Yixin Zhang, Zilei Wang

ICCV 2025poster

#1811

AdaDCP: Learning an Adapter with Discrete Cosine Prior for Clear-to-Adverse Domain Generalization

Qi Bi, Yixian Shen, Jingjun Yi et al.

ICCV 2025poster

#1812

SummDiff: Generative Modeling of Video Summarization with Diffusion

Kwanseok Kim, Jaehoon Hahm, Sumin Kim et al.

ICCV 2025highlightarXiv:2510.08458

#1813

Towards Performance Consistency in Multi-Level Model Collaboration

Qi Li, Runpeng Yu, Xinchao Wang

ICCV 2025poster

#1814

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Dongwon Kim, Ju He, Qihang Yu et al.

ICCV 2025posterarXiv:2501.07730

#1815

VRM: Knowledge Distillation via Virtual Relation Matching

Weijia Zhang, Fei Xie, Weidong Cai et al.

ICCV 2025highlightarXiv:2502.20760

#1816

ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization

Yuanhe Guo, Linxi Xie, Zhuoran Chen et al.

ICCV 2025posterarXiv:2510.18433

#1817

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Chenhao Zheng, Jieyu Zhang, Mohammadreza Salehi et al.

ICCV 2025highlightarXiv:2505.23617

#1818

Gaze-Language Alignment for Zero-Shot Prediction of Visual Search Targets from Human Gaze Scanpaths

Sounak Mondal, Naveen Sendhilnathan, Ting Zhang et al.

ICCV 2025poster

#1819

DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic

Munish Monga, Vishal Chudasama, Pankaj Wasnik et al.

ICCV 2025posterarXiv:2506.21260

#1820

FedPall: Prototype-based Adversarial and Collaborative Learning for Federated Learning with Feature Drift

yong zhang, Feng Liang, Guanghu Yuan et al.

ICCV 2025posterarXiv:2507.04781

#1821

Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining

Zhiqi Ge, Juncheng Li, Xinglei Pang et al.

ICCV 2025posterarXiv:2412.10342

#1822

External Knowledge Injection for CLIP-Based Class-Incremental Learning

Da-Wei Zhou, Kai-Wen Li, Jingyi Ning et al.

ICCV 2025posterarXiv:2503.08510

#1823

Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios

Deng Li, Aming WU, Yang Li et al.

ICCV 2025posterarXiv:2506.24063

#1824

Hyper-Depth: Hypergraph-based Multi-Scale Representation Fusion for Monocular Depth Estimation

Lin Bie, Siqi Li, Yifan Feng et al.

ICCV 2025poster

#1825

Instruction-Grounded Visual Projectors for Continual Learning of Generative Vision-Language Models

Hyundong Jin, Hyung Jin Chang, Eunwoo Kim

ICCV 2025posterarXiv:2508.00260

#1826

PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation

Chikai Shang, Mengke Li, Yiqun Zhang et al.

ICCV 2025posterarXiv:2503.06901

#1827

Generalized Deep Multi-view Clustering via Causal Learning with Partially Aligned Cross-view Correspondence

Xihong Yang, Siwei Wang, Jiaqi Jin et al.

ICCV 2025posterarXiv:2509.16022

#1828

Less is More: Empowering GUI Agent with Context-Aware Simplification

Gongwei Chen, Xurui Zhou, Rui Shao et al.

ICCV 2025highlightarXiv:2507.03730

#1829

EventUPS: Uncalibrated Photometric Stereo Using an Event Camera

Jinxiu Liang, Bohan Yu, Siqi Yang et al.

ICCV 2025highlight

#1830

When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack

Hanqing Liu, Shouwei Ruan, Yao Huang et al.

ICCV 2025posterarXiv:2503.06903

#1831

SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis

Xiangyue Zhang, Jianfang Li, Jiaxu Zhang et al.

ICCV 2025posterarXiv:2412.16563

#1832

Guiding Diffusion Models with Adaptive Negative Sampling Without External Resources

Alakh Desai, Nuno Vasconcelos

ICCV 2025poster

#1833

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

Yiyang Wang, Xi Chen, Xiaogang Xu et al.

ICCV 2025posterarXiv:2501.12382

#1834

Training-Free Class Purification for Open-Vocabulary Semantic Segmentation

Qi Chen, Lingxiao Yang, Yun Chen et al.

ICCV 2025posterarXiv:2508.00557

#1835

Keep Your Friends Close, and Your Enemies Farther: Distance-aware Voxel-wise Contrastive Learning for Semi-supervised Multi-organ Segmentation

Haochen Zhao, Jianwei Niu, Xuefeng Liu et al.

ICCV 2025poster

#1836

Learning to Inference Adaptively for Multimodal Large Language Models

Zhuoyan Xu, Khoi Nguyen, Preeti Mukherjee et al.

ICCV 2025posterarXiv:2503.10905

#1837

Hierarchical Divide-and-Conquer Grouping for Classification Adaptation of Pre-Trained Models

Ziqian Lu, Yunlong Yu, Qinyue Tong et al.

ICCV 2025poster

#1838

Lark: Low-Rank Updates After Knowledge Localization for Few-shot Class-Incremental Learning

Jinxin Shi, Jiabao Zhao, Yifan Yang et al.

ICCV 2025poster

#1839

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Kaichen Zhang, Yifei Shen, Bo Li et al.

ICCV 2025posterarXiv:2411.14982

#1840

p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay

Jun Zhang, Desen Meng, Zhengming Zhang et al.

ICCV 2025posterarXiv:2412.04449

#1841

ZIUM: Zero-Shot Intent-Aware Adversarial Attack on Unlearned Models

Hyun Jun Yook, Ga San Jhun, Cho Hyun et al.

ICCV 2025posterarXiv:2507.21985

#1842

Noise-Modeled Diffusion Models for Low-Light Spike Image Restoration

Ruonan Liu, Lin Zhu, Xijie Xiang et al.

ICCV 2025highlight

#1843

Prototype-based Contrastive Learning with Stage-wise Progressive Augmentation for Self-Supervised Fine-Grained Learning

BaoFeng Tan, Xiu-Shen Wei, Lin Zhao

ICCV 2025poster

#1844

LMM-Det: Make Large Multimodal Models Excel in Object Detection

Jincheng Li, Chunyu Xie, Ji Ao et al.

ICCV 2025posterarXiv:2507.18300

#1845

ReTracker: Exploring Image Matching for Robust Online Any Point Tracking

Dongli Tan, Xingyi He, Sida Peng et al.

ICCV 2025highlight

#1846

FedWSQ: Efficient Federated Learning with Weight Standardization and Distribution-Aware Non-Uniform Quantization

Seung-Wook Kim, Seongyeol Kim, Jiah Kim et al.

ICCV 2025posterarXiv:2506.23516

#1847

CMAD: Correlation-Aware and Modalities-Aware Distillation for Multimodal Sentiment Analysis with Missing Modalities

Yan Zhuang, Minhao Liu, Wei Bai et al.

ICCV 2025poster

#1848

Revelio: Interpreting and leveraging semantic information in diffusion models

Dahye Kim, Xavier Thomas, Deepti Ghadiyaram

ICCV 2025posterarXiv:2411.16725

#1849

CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

Siyu Jiao, Haoye Dong, Yuyang Yin et al.

ICCV 2025posterarXiv:2412.19142

#1850

SplatTalk: 3D VQA with Gaussian Splatting

Anh Thai, Kyle Genova, Songyou Peng et al.

ICCV 2025posterarXiv:2503.06271

#1851

Improved Noise Schedule for Diffusion Training

Tiankai Hang, Shuyang Gu, Jianmin Bao et al.

ICCV 2025posterarXiv:2407.03297

#1852

Test-Time Prompt Tuning for Zero-Shot Depth Completion

Chanhwi Jeong, Inhwan Bae, Jin-Hwi Park et al.

ICCV 2025highlight

#1853

One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators

Parag Dutta, Mohd Ayyoob, Shalabh Bhatnagar et al.

ICCV 2025poster

#1854

TITAN: Query-Token based Domain Adaptive Adversarial Learning

Tajamul Ashraf, Janibul Bashir

ICCV 2025posterarXiv:2506.21484

#1855

StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data

Yixu Wang, Yan Teng, Yingchun Wang et al.

ICCV 2025highlightarXiv:2509.23594

#1856

LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding

Amirhossein Kazerouni, Soroush Mehraban, Michael Brudno et al.

ICCV 2025posterarXiv:2503.15420

#1857

MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning

Mattia Segu, Marta Tintore Gazulla, Yongqin Xian et al.

ICCV 2025posterarXiv:2510.15026

#1858

Moderating the Generalization of Score-based Generative Model

Wan Jiang, He Wang, Xin Zhang et al.

ICCV 2025posterarXiv:2412.07229

#1859

LLM-assisted Entropy-based Adaptive Distillation for Unsupervised Fine-grained Visual Representation Learning

Jianfeng Dong, Danfeng Luo, Daizong Liu et al.

ICCV 2025poster

#1860

Boundary Probing for Input Privacy Protection When Using LMM Services

Xiaofei Hui, Haoxuan Qu, Ping Hu et al.

ICCV 2025poster

#1861

Intrepretable Zero-Shot Learning with Locally-Aligned Vision-Language Model

Shiming Chen, Bowen Duan, Salman Khan et al.

ICCV 2025poster

#1862

UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement

Xiao Zhang, Fei Wei, Yong Wang et al.

ICCV 2025posterarXiv:2507.00721

#1863

Dataset Distillation as Data Compression: A Rate-Utility Perspective

Youneng Bao, Yiping Liu, Zhuo Chen et al.

ICCV 2025posterarXiv:2507.17221

#1864

Open-set Cross Modal Generalization via Multimodal Unified Representation

Hai Huang, Yan Xia, Shulei Wang et al.

ICCV 2025posterarXiv:2507.14935

#1865

Adversarial Data Augmentation for Single Domain Generalization via Lyapunov Exponent-Guided Optimization

ZUYU ZHANG, Ning Chen, Yongshan Liu et al.

ICCV 2025posterarXiv:2507.04302

#1866

A Unified Framework to BRIDGE Complete and Incomplete Deep Multi-View Clustering under Non-IID Missing Patterns

Xiaorui Jiang, Buyun He, Peng Yuan Zhou et al.

ICCV 2025poster

#1867

Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning.

Daniel DeAlcala, Aythami Morales, Julian Fierrez et al.

ICCV 2025posterarXiv:2509.07879

#1868

One-Shot Knowledge Transfer for Scalable Person Re-Identification

Longhua Li, Lei Qi, Xin Geng

ICCV 2025posterarXiv:2511.06016

#1869

EA-Vit: Efficient Adaptation for Elastic Vision Transformer

Chen Zhu, Wangbo Zhao, Huiwen Zhang et al.

ICCV 2025posterarXiv:2507.19360

#1870

Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark

Changsheng Gao, Yifan Ma, Qiaoxi Chen et al.

ICCV 2025posterarXiv:2412.04307

#1871

MM-IFEngine: Towards Multimodal Instruction Following

Shengyuan Ding, Wu Shenxi, Xiangyu Zhao et al.

ICCV 2025posterarXiv:2504.07957

#1872

Dataset Distillation via the Wasserstein Metric

Haoyang Liu, Peiran Wang, Yijiang Li et al.

ICCV 2025posterarXiv:2311.18531

#1873

AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving

Ruifei Zhang, Junlin Xie, Wei Zhang et al.

ICCV 2025posterarXiv:2511.06253

#1874

Depth Any Event Stream: Enhancing Event-based Monocular Depth Estimation via Dense-to-Sparse Distillation

Jinjing Zhu, Tianbo Pan, Zidong Cao et al.

ICCV 2025poster

#1875

Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention

Weida Wang, Changyong He, Jin Zeng et al.

ICCV 2025posterarXiv:2506.23542

#1876

MPBR: Multimodal Progressive Bidirectional Reasoning for Open-Set Fine-Grained Recognition

Junfu Tan, Peiguang Jing, Yu Zhu et al.

ICCV 2025poster

#1877

MAVias: Mitigate any Visual Bias

Ioannis Sarridis, Christos Koutlis, Symeon Papadopoulos et al.

ICCV 2025posterarXiv:2412.06632

#1878

AnnofreeOD: Detecting All Classes at Low Frame Rates Without Human Annotations

Boyi Sun, Yuhang Liu, Houxin He et al.

ICCV 2025poster

#1879

Controlling Multimodal LLMs via Reward-guided Decoding

Oscar Mañas, Pierluca D'Oro, Koustuv Sinha et al.

ICCV 2025posterarXiv:2508.11616

#1880

Class-Wise Federated Averaging for Efficient Personalization

Gyuejeong Lee, Daeyoung Choi

ICCV 2025posterarXiv:2406.07800

#1881

Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning

Jieyi Tan, Chengwei Zhang, Bo Dang et al.

ICCV 2025posterarXiv:2503.11051

#1882

Multi-view Gaze Target Estimation

Qiaomu Miao, Vivek Golani, Jingyi Xu et al.

ICCV 2025posterarXiv:2508.05857

#1883

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Zitian Wang, Yue Liao, RONG KANG et al.

ICCV 2025posterarXiv:2503.20309

#1884

Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility

Melih Barsbey, Lucas Prieto, Stefanos Zafeiriou et al.

ICCV 2025posterarXiv:2507.17748

#1885

Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning

Borui Kang, Lei Wang, Zhiping Wu et al.

ICCV 2025poster

#1886

Prototype Guided Backdoor Defense via Activation Space Manipulation

Venkat Adithya Amula, Sunayana Samavedam, Saurabh Saini et al.

ICCV 2025poster

#1887

RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction

Johannes Künzel, Anna Hilsmann, Peter Eisert

ICCV 2025posterarXiv:2507.04839

#1888

Analyzing Finetuning Representation Shift for Multimodal LLMs Steering

Pegah KHAYATAN, Mustafa Shukor, Jayneel Parekh et al.

ICCV 2025posterarXiv:2501.03012

#1889

AVAM: a Universal Training-free Adaptive Visual Anchoring Embedded into Multimodal Large Language Model for Multi-image Question Answering

Kang Zeng, Guojin Zhong, Jintao Cheng et al.

ICCV 2025poster

#1890

Staining and Locking Computer Vision Models Without Retraining

Oliver Sutton, Qinghua Zhou, George Leete et al.

ICCV 2025posterarXiv:2507.22000

#1891

Flexi-FSCIL: Adaptive Knowledge Retention for Breaking the Stability-Plasticity Dilemma in Few-Shot Class-Incremental Learning

Wufei Xie, Yalin Wang, Chenliang Liu et al.

ICCV 2025poster

#1892

Multispectral Demosaicing via Dual Cameras

SaiKiran Tedla, Junyong Lee, Beixuan Yang et al.

ICCV 2025highlightarXiv:2503.22026

#1893

POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction

Songyan Zhang, Yongtao Ge, Jinyuan Tian et al.

ICCV 2025posterarXiv:2504.05692

#1894

Stochastic Interpolants for Revealing Stylistic Flows across the History of Art

Pingchuan Ma, Ming Gui, Johannes Schusterbauer et al.

ICCV 2025poster

#1895

Is Tracking really more challenging in First Person Egocentric Vision?

Matteo Dunnhofer, Zaira Manigrasso, Christian Micheloni

ICCV 2025highlightarXiv:2507.16015

#1896

GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration

Li Mi, Manon Béchaz, Zeming Chen et al.

ICCV 2025posterarXiv:2508.00152

#1897

Learning Large Motion Estimation from Intermediate Representations with a High-Resolution Optical Flow Dataset Featuring Long-Range Dynamic Motion

Hoonhee Cho, Yuhwan Jeong, Kuk-Jin Yoon

ICCV 2025highlight

#1898

MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion

peilin Tao, Hainan Cui, Diantao Tu et al.

ICCV 2025posterarXiv:2507.03306

#1899

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Xiyao Wang, Zhengyuan Yang, Linjie Li et al.

ICCV 2025posterarXiv:2412.03704

#1900

Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

Jiahao Zhang, Anoop Cherian, Cristian Rodriguez-Opazo et al.

ICCV 2025posterarXiv:2411.18011

#1901

Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations

Jianhua Sun, Yuxuan Li, Jiude Wei et al.

ICCV 2025posterarXiv:2412.14974

#1902

DRaM-LHM: A Quaternion Framework for Iterative Camera Pose Estimation

Chen Lin, Weizhi Du, Zhixiang Min et al.

ICCV 2025poster

#1903

Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation

Pengfei Ren, Jingyu Wang, Haifeng Sun et al.

ICCV 2025poster

#1904

Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation

Chen Gao, Shuo Zhang, Youfang Lin

ICCV 2025poster

#1905

SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion

Yuxi Xiao, Jianyuan Wang, Nan Xue et al.

ICCV 2025poster

#1906

A Simple yet Mighty Hartley Diffusion Versatilist for Generalizable Dense Vision Tasks

Qi Bi, Jingjun Yi, Huimin Huang et al.

ICCV 2025poster

#1907

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

Wenxuan Guo, Xiuwei Xu, Hang Yin et al.

ICCV 2025posterarXiv:2508.00823

#1908

Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics

Taowen Wang, Cheng Han, James Liang et al.

ICCV 2025posterarXiv:2411.13587

#1909

Simultaneous Motion And Noise Estimation with Event Cameras

Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

ICCV 2025posterarXiv:2504.04029

#1910

Weakly-Supervised Learning of Dense Functional Correspondences

Stefan Stojanov, Linan Zhao, Yunzhi Zhang et al.

ICCV 2025posterarXiv:2509.03893

#1911

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

Andrew Bond, Jui-Hsien Wang, Long Mai et al.

ICCV 2025posterarXiv:2501.04782

#1912

CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers

Dimitrios Mallis, Ahmet Karadeniz, Sebastian Cavada et al.

ICCV 2025posterarXiv:2412.13810

#1913

Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement

Shuo Zhang, Chen Gao, Youfang Lin

ICCV 2025highlight

#1914

VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions

Yash Garg, Saketh Bachu, Arindam Dutta et al.

ICCV 2025posterarXiv:2508.06757

#1915

Tracking Tiny Drones against Clutter: Large-Scale Infrared Benchmark with Motion-Centric Adaptive Algorithm

Jiahao Zhang, Zongli Jiang, Gang Wang et al.

ICCV 2025poster

#1916

AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs

Yi-Ting Shen, Sungmin Eum, Doheon Lee et al.

ICCV 2025posterarXiv:2503.22884

#1917

Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing

Chengxu Liu, Lu Qi, Jinshan Pan et al.

ICCV 2025posterarXiv:2507.01275

#1918

H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction

Heng Jia, Na Zhao, Linchao Zhu

ICCV 2025posterarXiv:2508.03118

#1919

Find Any Part in 3D

Ziqi Ma, Yisong Yue, Georgia Gkioxari

ICCV 2025highlightarXiv:2411.13550

#1920

Global Motion Corresponder for 3D Point-Based Scene Interpolation under Large Motion

Junru Lin, Chirag Vashist, Mikaela Uy et al.

ICCV 2025posterarXiv:2508.20136

#1921

SpikeDiff: Zero-shot High-Quality Video Reconstruction from Chromatic Spike Camera and Sub-millisecond Spike Streams

Siqi Yang, Jinxiu Liang, Zhaojun Huang et al.

ICCV 2025poster

#1922

EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks

Athinoulla Konstantinou, Georgios Leontidis, Mamatha Thota et al.

ICCV 2025posterarXiv:2506.09895

#1923

Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras

Shuang Guo, Friedhelm Hamann, Guillermo Gallego

ICCV 2025highlightarXiv:2503.17262

#1924

6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting

Yufeng Jin, Vignesh Prasad, Snehal Jauhri et al.

ICCV 2025posterarXiv:2412.01543

#1925

Background Invariance Testing According to Semantic Proximity

Zukang Liao, Min Chen

ICCV 2025posterarXiv:2208.09286

#1926

RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration

Chong Cheng, Yu Hu, Sicheng Yu et al.

ICCV 2025posterarXiv:2507.08136

#1927

CObL: Toward Zero-Shot Ordinal Layering without User Prompting

Aneel Damaraju, Dean Hazineh, Todd Zickler

ICCV 2025highlightarXiv:2508.08498

#1928

Hierarchical Material Recognition from Local Appearance

Matthew Beveridge, Shree Nayar

ICCV 2025highlightarXiv:2505.22911

#1929

TopicGeo: An Efficient Unified Framework for Geolocation

Xin Wang, Xinlin Wang, Shuiping Gou

ICCV 2025poster

#1930

Partially Matching Submap Helps: Uncetainty Modeling and Propagation for Text to Point Cloud Localization

Mingtao Feng, Longlong Mei, Zijie Wu et al.

ICCV 2025poster

#1931

Beyond Pixel Uncertainty: Bounding the OoD Objects in Road Scenes

Huachao Zhu, Zelong Liu, Zhichao Sun et al.

ICCV 2025poster

#1932

AGO: Adaptive Grounding for Open World 3D Occupancy Prediction

Peizheng Li, Shuxiao Ding, You Zhou et al.

ICCV 2025posterarXiv:2504.10117

#1933

Environment-Agnostic Pose: Generating Environment-independent Object Representations for 6D Pose Estimation

Shaobo Zhang, Yuhang Huang, Wanqing Zhao et al.

ICCV 2025poster

#1934

Online Dense Point Tracking with Streaming Memory

Qiaole Dong, Yanwei Fu

ICCV 2025posterarXiv:2503.06471

#1935

Test-Time Retrieval-Augmented Adaptation for Vision-Language Models

Xinqi Fan, Xueli CHEN, Luoxiao Yang et al.

ICCV 2025poster

#1936

Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos

Chengbo Yuan, Geng Chen, Li Yi et al.

ICCV 2025posterarXiv:2411.09145

#1937

MixRI: Mixing Features of Reference Images for Novel Object Pose Estimation

Xinhang Liu, Jiawei Shi, Zheng Dang et al.

ICCV 2025posterarXiv:2601.06883

#1938

ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction

ADEELA ISLAM, Stefano Fiorini, Stuart James et al.

ICCV 2025posterarXiv:2505.21117

#1939

Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images

Philipp Wulff, Felix Wimbauer, Dominik Muhle et al.

ICCV 2025posterarXiv:2508.02323

#1940

LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling

Jiahao Wu, Rui Peng, Jianbo Jiao et al.

ICCV 2025posterarXiv:2507.02363

#1941

Combinative Matching for Geometric Shape Assembly

Nahyuk Lee, Juhong Min, Junhong Lee et al.

ICCV 2025highlightarXiv:2508.09780

#1942

TAPNext: Tracking Any Point (TAP) as Next Token Prediction

Artem Zholus, Carl Doersch, Yi Yang et al.

ICCV 2025posterarXiv:2504.05579

#1943

Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes

Tom Fischer, Xiaojie Zhang, Eddy Ilg

ICCV 2025posterarXiv:2508.02157

#1944

A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition

Connor Malone, Somayeh Hussaini, Tobias Fischer et al.

ICCV 2025posterarXiv:2412.06153

#1945

Error Recognition in Procedural Videos using Generalized Task Graph

Shih-Po Lee, Ehsan Elhamifar

ICCV 2025poster

#1946

FaceShield: Defending Facial Image against Deepfake Threats

Jaehwan Jeong, Sumin In, Sieun Kim et al.

ICCV 2025posterarXiv:2412.09921

#1947

Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers

An Lun Liu, Yu-Wei Chao, Yi-Ting Chen

ICCV 2025posterarXiv:2507.11287

#1948

Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering

shanlin sun, Yifan Wang, Hanwen Zhang et al.

ICCV 2025posterarXiv:2508.14461

#1949

Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars

Vanessa Sklyarova, Egor Zakharov, Malte Prinzler et al.

ICCV 2025posterarXiv:2509.01469

#1950

TeRA: Rethinking Text-guided Realistic 3D Avatar Generation

Yanwen Wang, Yiyu Zhuang, Jiawei Zhang et al.

ICCV 2025posterarXiv:2509.02466

#1951

Open-World Skill Discovery from Unsegmented Demonstration Videos

Jingwen Deng, Zihao Wang, Shaofei Cai et al.

ICCV 2025poster

#1952

E-NeMF: Event-based Neural Motion Field for Novel Space-time View Synthesis of Dynamic Scenes

Yan Liu, Zehao Chen, Haojie Yan et al.

ICCV 2025poster

#1953

MonSTeR: a Unified Model for Motion, Scene, Text Retrieval

Luca Collorone, Matteo Gioia, Massimiliano Pappa et al.

ICCV 2025posterarXiv:2510.03200

#1954

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

shiduo zhang, Zhe Xu, Peiju Liu et al.

ICCV 2025posterarXiv:2412.18194

#1955

TrackVerse: A Large-Scale Object-Centric Video Dataset for Image-Level Representation Learning

Yibing Wei, Samuel Church, Victor Suciu et al.

ICCV 2025poster

#1956

Robust Test-Time Adaptation for Single Image Denoising Using Deep Gaussian Prior

Qing Ma, Pengwei Liang, Xiong Zhou et al.

ICCV 2025poster

#1957

Augmented Mass-Spring Model for Real-Time Dense Hair Simulation

Jorge Herrera, Yi Zhou, Xin Sun et al.

ICCV 2025posterarXiv:2412.17144

#1958

Punching Bag vs. Punching Person: Motion Transferability in Videos

Raiyaan Abdullah, Jared Claypoole, Michael Cogswell et al.

ICCV 2025posterarXiv:2508.00085

#1959

Laboring on less labors: RPCA Paradigm for Pan-sharpening

honghui xu, Chuangjie Fang, Yibin Wang et al.

ICCV 2025poster

#1960

WarpHE4D: Dense 4D Head Map toward Full Head Reconstruction

Jongseob Yun, Yong-Hoon Kwon, Min-Gyu Park et al.

ICCV 2025poster

#1961

MBTI: Masked Blending Transformers with Implicit Positional Encoding for Frame-rate Agnostic Motion Estimation

Jungwoo Huh, Yeseung Park, Seongjean Kim et al.

ICCV 2025poster

#1962

GENMO: A GENeralist Model for Human MOtion

Jiefeng Li, Jinkun Cao, Haotian Zhang et al.

ICCV 2025highlightarXiv:2505.01425

#1963

Learning Efficient and Generalizable Human Representation with Human Gaussian Model

Yifan Liu, Shengjun Zhang, Chensheng Dai et al.

ICCV 2025posterarXiv:2507.18758

#1964

Switch-a-View: View Selection Learned from Unlabeled In-the-wild Videos

Sagnik Majumder, Tushar Nagarajan, Ziad Al-Halah et al.

ICCV 2025posterarXiv:2412.18386

#1965

Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars

Tobias Kirschstein, Javier Romero, Artem Sevastopolsky et al.

ICCV 2025posterarXiv:2502.20220

#1966

TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging

Zepeng Su, zhulin liu, Zongyan Zhang et al.

ICCV 2025poster

#1967

GDKVM: Echocardiography Video Segmentation via Spatiotemporal Key-Value Memory with Gated Delta Rule

Rui Wang, Yimu Sun, Jingxing Guo et al.

ICCV 2025posterarXiv:2512.10252

#1968

Scaling Action Detection: AdaTAD++ with Transformer-Enhanced Temporal-Spatial Adaptation

Tanay Agrawal, Abid Ali, Antitza Dantcheva et al.

ICCV 2025poster

#1969

FlowDPS : Flow-Driven Posterior Sampling for Inverse Problems

Jeongsol Kim, Bryan Sangwoo Kim, Jong Ye

ICCV 2025poster

#1970

ZFusion: Efficient Deep Compositional Zero-shot Learning for Blind Image Super-Resolution with Generative Diffusion Prior

Alireza Esmaeilzehi, Hossein Zaredar, Yapeng Tian et al.

ICCV 2025poster

#1971

Learning A Unified Template for Gait Recognition

Panjian Huang, Saihui Hou, Junzhou Huang et al.

ICCV 2025poster

#1972

GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation

Quanwei Yang, Luying Huang, Kaisiyuan Wang et al.

ICCV 2025posterarXiv:2507.22731

#1973

Latent-Reframe: Enabling Camera Control for Video Diffusion Models without Training

Zhenghong Zhou, Jie An, Jiebo Luo

ICCV 2025posterarXiv:2412.06029

#1974

MorphoGen: Efficient Unconditional Generation of Long-Range Projection Neuronal Morphology via a Global-to-Local Framework

Tianfang Zhu, Hongyang Zhou, Anan LI

ICCV 2025poster

#1975

GaussianSpeech: Audio-Driven Personalized 3D Gaussian Avatars

Shivangi Aneja, Artem Sevastopolsky, Tobias Kirschstein et al.

ICCV 2025poster

#1976

A Quality-Guided Mixture of Score-Fusion Experts Framework for Human Recognition

Jie Zhu, Yiyang Su, Minchul Kim et al.

ICCV 2025posterarXiv:2508.00053

#1977

Capturing head avatar with hand contacts from a monocular video

Haonan He, Yufeng Zheng, Jie Song

ICCV 2025posterarXiv:2510.17181

#1978

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Zhefei Gong, Pengxiang Ding, Shangke Lyu et al.

ICCV 2025posterarXiv:2412.06782

#1979

AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion

Yangyi Huang, Ye Yuan, Xueting Li et al.

ICCV 2025posterarXiv:2505.24877

#1980

Controllable Weather Synthesis and Removal with Video Diffusion Models

Chih-Hao Lin, Zian Wang, Ruofan Liang et al.

ICCV 2025posterarXiv:2505.00704

#1981

Unfolding-Associative Encoder-Decoder Network with Progressive Alignment for Pansharpening

Shijie Fang, Hongping Gan

ICCV 2025poster

#1982

MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration

Tao Wang, Peiwen Xia, Bo Li et al.

ICCV 2025poster

#1983

DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation

Haitao Tian

ICCV 2025posterarXiv:2509.05543

#1984

EVDM: Event-based Real-world Video Deblurring with Mamba

Zhijing Sun, Senyan Xu, Kean Liu et al.

ICCV 2025poster

#1985

Q-Norm: Robust Representation Learning via Quality-Adaptive Normalization

Lanning Zhang, Ying Zhou, Fei Gao et al.

ICCV 2025poster

#1986

Proxy-Bridged Game Transformer for Interactive Extreme Motion Prediction

Yanwen Fang, Wenqi Jia, Xu Cao et al.

ICCV 2025poster

#1987

Metric Convolutions: A Unifying Theory to Adaptive Image Convolutions

Thomas Dagès, Michael Lindenbaum, Alfred Bruckstein

ICCV 2025posterarXiv:2406.05400

#1988

RobAVA: A Large-scale Dataset and Baseline Towards Video based Robotic Arm Action Understanding

Baoli Sun, Ning Wang, Xinzhu Ma et al.

ICCV 2025poster

#1989

IDFace: Face Template Protection for Efficient and Secure Identification

Sunpill Kim, Seunghun Paik, Chanwoo Hwang et al.

ICCV 2025posterarXiv:2507.12050

#1990

On-Device Diffusion Transformer Policy for Efficient Robot Manipulation

Yiming Wu, Huan Wang, Zhenghao Chen et al.

ICCV 2025posterarXiv:2508.00697

#1991

Generic Event Boundary Detection via Denoising Diffusion

Jaejun Hwang, Dayoung Gong, Manjin Kim et al.

ICCV 2025posterarXiv:2508.12084

#1992

Not All Degradations Are Equal: A Targeted Feature Denoising Framework for Generalizable Image Super-Resolution

hongjun wang, Jiyuan Chen, Zhengwei Yin et al.

ICCV 2025posterarXiv:2509.14841

#1993

Fine-Grained 3D Gaussian Head Avatars Modeling from Static Captures via Joint Reconstruction and Registration

Yuan Sun, Xuan Wang, Cong Wang et al.

ICCV 2025poster

#1994

SEREP: Semantic Facial Expression Representation for Robust In-the-Wild Capture and Retargeting

Arthur Josi, Luiz Gustavo Hafemann, Abdallah Dib et al.

ICCV 2025posterarXiv:2412.14371

#1995

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Zhuo Li, Mingshuang Luo, RuiBing Hou et al.

ICCV 2025posterarXiv:2411.14951

#1996

DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding

Thomas Kreutz, Max Mühlhäuser, Alejandro Sanchez Guinea

ICCV 2025posterarXiv:2506.13897

#1997

Efficient Concertormer for Image Deblurring and Beyond

Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien et al.

ICCV 2025posterarXiv:2404.06135

#1998

TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Wenhao Wang, Yi Yang

ICCV 2025posterarXiv:2411.04709

#1999

SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models

Pingchuan Ma, Xiaopei Yang, Ming Gui et al.

ICCV 2025posterarXiv:2508.03402

#2000

Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Zhengyao Lyu, Chenyang Si, Tianlin Pan et al.

ICCV 2025poster

← Previous

1...8 9 10 11 12...14