Most Cited ICCV "skeletal coordinates" Papers

2,701 papers found • Page 5 of 14

#801

DALIP: Distribution Alignment-based Language-Image Pre-Training for Domain-Specific Data

Junjie Wu, Jiangtao Xie, Zhaolin Zhang et al.

ICCV 2025posterarXiv:2504.01386
2
citations
#802

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Kunlun Xu, Fan Zhuo, Jiangmeng Li et al.

ICCV 2025posterarXiv:2507.01884
2
citations
#803

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Xavier Thomas, Deepti Ghadiyaram

ICCV 2025posterarXiv:2503.06698
2
citations
#804

RePoseD: Efficient Relative Pose Estimation With Known Depth Information

Yaqing Ding, Viktor Kocur, VACLAV VAVRA et al.

ICCV 2025posterarXiv:2501.07742
2
citations
#805

Frequency-Guided Posterior Sampling for Diffusion-Based Image Restoration

Darshan Thaker, Abhishek Goyal, Rene Vidal

ICCV 2025posterarXiv:2411.15295
2
citations
#806

Understanding Co-speech Gestures in-the-wild

Sindhu Hegde, K R Prajwal, Taein Kwon et al.

ICCV 2025posterarXiv:2503.22668
2
citations
#807

Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition

Jeonghyeok Do, Munchurl Kim

ICCV 2025posterarXiv:2411.10745
2
citations
#808

Aligning Constraint Generation with Design Intent in Parametric CAD

Evan Casey, Tianyu Zhang, Shu Ishida et al.

ICCV 2025posterarXiv:2504.13178
2
citations
#809

FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning

qian feng, Jiahang Tu, Mintong Kang et al.

ICCV 2025posterarXiv:2601.13578
2
citations
#810

Quantifying and Narrowing the Unknown: Interactive Text-to-Video Retrieval via Uncertainty Minimization

Bingqing Zhang, Zhuo Cao, Heming Du et al.

ICCV 2025posterarXiv:2507.15504
2
citations
#811

Synchronization of Multiple Videos

Avihai Naaman, Ron Shapira Weber, Oren Freifeld

ICCV 2025posterarXiv:2510.14051
2
citations
#812

Timestep-Aware Diffusion Model for Extreme Image Rescaling

Ce Wang, Zhenyu Hu, Wanjie Sun et al.

ICCV 2025posterarXiv:2408.09151
2
citations
#813

ODP-Bench: Benchmarking Out-of-Distribution Performance Prediction

Han Yu, Kehan Li, Dongbai Li et al.

ICCV 2025posterarXiv:2510.27263
2
citations
#814

Demeter: A Parametric Model of Crop Plant Morphology from the Real World

Tianhang Cheng, Albert Zhai, Evan Chen et al.

ICCV 2025posterarXiv:2510.16377
2
citations
#815

What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning

Chi-Hsi Kung, Frangil Ramirez, Juhyung Ha et al.

ICCV 2025posterarXiv:2503.21055
2
citations
#816

Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting

Seunggeun Chi, Pin-Hao Huang, Enna Sachdeva et al.

ICCV 2025highlightarXiv:2508.00427
2
citations
#817

Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views

Xiangdong Zhang, Shaofeng Zhang, Junchi Yan

ICCV 2025posterarXiv:2509.01250
2
citations
#818

ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment

Chong Xia, Shengjun Zhang, Fangfu Liu et al.

ICCV 2025posterarXiv:2507.19058
2
citations
#819

Learning Visual Hierarchies in Hyperbolic Space for Image Retrieval

Ziwei Wang, Sameera Ramasinghe, Chenchen Xu et al.

ICCV 2025posterarXiv:2411.17490
2
citations
#820

LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression

Wenjie Huang, Qi Yang, Shuting Xia et al.

ICCV 2025posterarXiv:2507.15686
2
citations
#821

D2ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition

Wenjie Pei, Qizhong Tan, Guangming Lu et al.

ICCV 2025poster
2
citations
#822

HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction for Large-Scale Aerial Scenes

Mai Su, Zhongtao Wang, Huishan Au et al.

ICCV 2025posterarXiv:2504.16606
2
citations
#823

Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads

Yingjie Zhou, Jiezhang Cao, Zicheng Zhang et al.

ICCV 2025posterarXiv:2507.23343
2
citations
#824

Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs

Bhavya Goyal, Felipe Gutierrez-Barragan, Wei Lin et al.

ICCV 2025posterarXiv:2508.00169
2
citations
#825

IDF: Iterative Dynamic Filtering Networks for Generalizable Image Denoising

Dongjin Kim, Jaekyun Ko, Muhammad Kashif Ali et al.

ICCV 2025posterarXiv:2508.19649
2
citations
#826

Identity Preserving 3D Head Stylization with Multiview Score Distillation

Bahri Batuhan Bilecen, Ahmet Berke Gokmen, Furkan Güzelant et al.

ICCV 2025posterarXiv:2411.13536
2
citations
#827

Adversarial Attention Perturbations for Large Object Detection Transformers

Zachary Yahn, Selim Tekin, Fatih Ilhan et al.

ICCV 2025posterarXiv:2508.02987
2
citations
#828

CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting

Lei Tian, Xiaomin Li, Liqian Ma et al.

ICCV 2025posterarXiv:2505.20469
2
citations
#829

LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering

Xiaohang Zhan, Dingming Liu

ICCV 2025posterarXiv:2508.07647
2
citations
#830

Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation

Shaowei Liu, chuan guo, Bing Zhou et al.

ICCV 2025posterarXiv:2510.14976
2
citations
#831

Generalized Few-Shot Point Cloud Segmentation via LLM-Assisted Hyper-Relation Matching

Zhaoyang Li, Yuan Wang, Guoxin Xiong et al.

ICCV 2025poster
2
citations
#832

CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation

Xiao Lin, Yun Peng, Liuyi Wang et al.

ICCV 2025posterarXiv:2502.01312
2
citations
#833

SMGDiff: Soccer Motion Generation using Diffusion Probabilistic Models

Hongdi Yang, Chengyang Li, Zhenxuan Wu et al.

ICCV 2025posterarXiv:2411.16216
2
citations
#834

Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model

Daehee Park, Monu Surana, Pranav Desai et al.

ICCV 2025posterarXiv:2507.22615
2
citations
#835

Generate, Transduct, Adapt: Iterative Transduction with VLMs

Oindrila Saha, Logan Lawrence, Grant Horn et al.

ICCV 2025posterarXiv:2501.06031
2
citations
#836

Generative Adversarial Diffusion

U-Chae Jun, Jaeeun Ko, Jiwoo Kang

ICCV 2025poster
2
citations
#837

MixA-Q: Revisiting Activation Sparsity for Vision Transformers from a Mixed-Precision Quantization Perspective

Weitian Wang, Shubham rai, Cecilia De la Parra et al.

ICCV 2025posterarXiv:2507.19131
2
citations
#838

Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization

Kangle Deng, Hsueh-Ti Derek Liu, Yiheng Zhu et al.

ICCV 2025posterarXiv:2504.02817
2
citations
#839

EvolvingGrasp: Evolutionary Grasp Generation via Efficient Preference Alignment

Yufei Zhu, Yiming Zhong, Zemin Yang et al.

ICCV 2025posterarXiv:2503.14329
2
citations
#840

FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

Fei Yin, Mallikarjun Reddy, Chun-Han Yao et al.

ICCV 2025posterarXiv:2504.15179
2
citations
#841

Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection

Anja Delić, Matej Grcic, Siniša Šegvić

ICCV 2025highlightarXiv:2506.18368
2
citations
#842

MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions

Qingyuan Zhou, Yuehu Gong, Weidong Yang et al.

ICCV 2025posterarXiv:2503.05182
2
citations
#843

PseudoMapTrainer: Learning Online Mapping without HD Maps

Christian Löwens, Thorben Funke, Jingchao Xie et al.

ICCV 2025posterarXiv:2508.18788
2
citations
#844

MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments

Zhixuan Liu, Haokun Zhu, Rui Chen et al.

ICCV 2025posterarXiv:2503.13816
2
citations
#845

Multi-Object Sketch Animation by Scene Decomposition and Motion Planning

Jingyu Liu, Zijie Xin, Yuhan Fu et al.

ICCV 2025posterarXiv:2503.19351
2
citations
#846

Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching

Tianli Liao, Chenyang Zhao, Lei Li et al.

ICCV 2025posterarXiv:2311.18564
2
citations
#847

DIP: Unsupervised Dense In-Context Post-training of Visual Representations

Sophia Sirko-Galouchenko, Spyros Gidaris, Antonin Vobecky et al.

ICCV 2025posterarXiv:2506.18463
2
citations
#848

Towards Open-World Generation of Stereo Images and Unsupervised Matching

Feng Qiao, Zhexiao Xiong, Eric Xing et al.

ICCV 2025posterarXiv:2503.12720
2
citations
#849

Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens

Suchisrit Gangopadhyay, Jung Hee Kim, Xien Chen et al.

ICCV 2025posterarXiv:2508.04928
2
citations
#850

Stylized-Face: A Million-level Stylized Face Dataset for Face Recognition

Zhengyuan Peng, Jianqing Xu, Yuge Huang et al.

ICCV 2025poster
2
citations
#851

Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection

Taehoon Kim, Jongwook Choi, Yonghyun Jeong et al.

ICCV 2025highlightarXiv:2507.02398
2
citations
#852

CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images

Jungho Lee, DongHyeong Kim, Dogyoon Lee et al.

ICCV 2025posterarXiv:2503.05332
2
citations
#853

Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models

Zerui Tao, Yuhta Takida, Naoki Murata et al.

ICCV 2025posterarXiv:2501.08727
2
citations
#854

Joint Asymmetric Loss for Learning with Noisy Labels

Jialiang Wang, Xianming Liu, Xiong Zhou et al.

ICCV 2025posterarXiv:2507.17692
2
citations
#855

F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration

Lu Liu, Huiyu Duan, Qiang Hu et al.

ICCV 2025highlightarXiv:2412.13155
2
citations
#856

Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography

Jianing Zhang, Jiayi Zhu, Feiyu Ji et al.

ICCV 2025highlightarXiv:2506.22753
2
citations
#857

Noise2Score3D: Tweedie's Approach for Unsupervised Point Cloud Denoising

Xiangbin Wei, Yuanfeng Wang, Ao XU et al.

ICCV 2025posterarXiv:2503.09283
2
citations
#858

SketchSplat: 3D Edge Reconstruction via Differentiable Multi-view Sketch Splatting

Haiyang Ying, Matthias Zwicker

ICCV 2025posterarXiv:2503.14786
2
citations
#859

DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering

Rongjia Zheng, Qing Zhang, Chengjiang Long et al.

ICCV 2025posterarXiv:2507.03924
2
citations
#860

EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception

Sanjoy Chowdhury, Subrata Biswas, Sayan Nag et al.

ICCV 2025posterarXiv:2506.21080
2
citations
#861

GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion

Li-Heng Chen, Zi-Xin Zou, Chang Liu et al.

ICCV 2025posterarXiv:2503.22349
2
citations
#862

Web Artifact Attacks Disrupt Vision Language Models

Maan Qraitem, Piotr Teterwak, Kate Saenko et al.

ICCV 2025posterarXiv:2503.13652
2
citations
#863

A Unified Framework for Motion Reasoning and Generation in Human Interaction

Jeongeun Park, Sungjoon Choi, Sangdoo Yun

ICCV 2025posterarXiv:2410.05628
2
citations
#864

Online Generic Event Boundary Detection

Hyung Rok Jung, Daneul Kim, Seunggyun Lim et al.

ICCV 2025posterarXiv:2510.06855
2
citations
#865

StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning

Chuxin Wang, Yixin Zha, Wenfei Yang et al.

ICCV 2025posterarXiv:2506.21541
2
citations
#866

Global Regulation and Excitation via Attention Tuning for Stereo Matching

Jiahao LI, Xinhong Chen, Zhengmin JIANG et al.

ICCV 2025posterarXiv:2509.15891
2
citations
#867

Enhancing Transformers Through Conditioned Embedded Tokens

Hemanth Saratchandran, Simon Lucey

ICCV 2025posterarXiv:2505.12789
2
citations
#868

Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

Xudong Li, Zihao Huang, Yan Zhang et al.

ICCV 2025posterarXiv:2409.05381
2
citations
#869

LookOut: Real-World Humanoid Egocentric Navigation

Boxiao Pan, Adam Harley, Francis Engelmann et al.

ICCV 2025posterarXiv:2508.14466
2
citations
#870

SAMO: A Lightweight Sharpness-Aware Approach for Multi-Task Optimization with Joint Global-Local Perturbation

Hao Ban, Gokul Ram Subramani, Kaiyi Ji

ICCV 2025posterarXiv:2507.07883
2
citations
#871

PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement

Tewodros W. Ayalew, Xiao Zhang, Kevin Y Wu et al.

ICCV 2025posterarXiv:2411.17764
2
citations
#872

VTimeCoT: Thinking by Drawing for Video Temporal Grounding and Reasoning

Jinglei Zhang, Yuanfan Guo, Rolandos Alexandros Potamias et al.

ICCV 2025posterarXiv:2510.14672
2
citations
#873

Multi-modal Multi-platform Person Re-Identification: Benchmark and Method

Ruiyang Ha, Songyi Jiang, Bin Li et al.

ICCV 2025posterarXiv:2503.17096
2
citations
#874

GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions

Xiaomeng Chu, Jiajun Deng, Guoliang You et al.

ICCV 2025posterarXiv:2503.16013
2
citations
#875

RoboPearls: Editable Video Simulation for Robot Manipulation

Tao Tang, Likui Zhang, Youpeng Wen et al.

ICCV 2025posterarXiv:2506.22756
2
citations
#876

TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models

Ziyang Luo, Nian Liu, Xuguang Yang et al.

ICCV 2025posterarXiv:2506.11436
2
citations
#877

PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding Projection

Mahdiyar Molahasani, Azadeh Motamedi, Michael Greenspan et al.

ICCV 2025posterarXiv:2507.08979
2
citations
#878

GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding

Zijun Lin, Shuting He, Cheston Tan et al.

ICCV 2025posterarXiv:2506.21188
2
citations
#879

Object-level Correlation for Few-Shot Segmentation

chunlin wen, Yu Zhang, Jie Fan et al.

ICCV 2025posterarXiv:2509.07917
2
citations
#880

Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation

Sebastian Schmidt, Julius Koerner, Dominik Fuchsgruber et al.

ICCV 2025highlightarXiv:2504.04841
2
citations
#881

Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions

Yuanhong Zheng, Ruixuan Yu, Jian Sun

ICCV 2025posterarXiv:2507.09446
2
citations
#882

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

Zongheng Tang, Yi Liu, Yifan Sun et al.

ICCV 2025highlightarXiv:2508.00359
2
citations
#883

Training-Free Personalization via Retrieval and Reasoning on Fingerprints

Deepayan Das, Davide Talon, Yiming Wang et al.

ICCV 2025posterarXiv:2503.18623
2
citations
#884

Teaching VLMs to Localize Specific Objects from In-context Examples

Sivan Doveh, Nimrod Shabtay, Eli Schwartz et al.

ICCV 2025posterarXiv:2411.13317
2
citations
#885

Cross-Architecture Distillation Made Simple with Redundancy Suppression

Weijia Zhang, Yuehao Liu, Wu Ran et al.

ICCV 2025highlightarXiv:2507.21844
2
citations
#886

AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction

Bin Rao, Haicheng Liao, Yanchen Guan et al.

ICCV 2025posterarXiv:2507.01801
2
citations
#887

ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis

Onkar Susladkar, Gayatri Deshmukh, Yalcin Tur et al.

ICCV 2025posterarXiv:2505.04963
2
citations
#888

Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians

Quankai Gao, Iliyan Georgiev, Tuanfeng Wang et al.

ICCV 2025posterarXiv:2508.01464
2
citations
#889

Sim-DETR: Unlock DETR for Temporal Sentence Grounding

Jiajin Tang, Zhengxuan Wei, Yuchen Zhu et al.

ICCV 2025posterarXiv:2509.23867
2
citations
#890

Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation

Shuchang Ye, Usman Naseem, Mingyuan Meng et al.

ICCV 2025posterarXiv:2507.11055
2
citations
#891

Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding

Huy Ta, Duy Anh Huynh, Yutong Xie et al.

ICCV 2025highlightarXiv:2505.15123
2
citations
#892

Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation

Yihong Cao, Jiaming Zhang, Xu Zheng et al.

ICCV 2025posterarXiv:2506.21198
2
citations
#893

CAFA: a Controllable Automatic Foley Artist

Roi Benita, Michael Finkelson, Tavi Halperin et al.

ICCV 2025posterarXiv:2504.06778
2
citations
#894

Everything is a Video: Unifying Modalities through Next-Frame Prediction

G Thomas Hudson, Dean Slack, Thomas Winterbottom et al.

ICCV 2025posterarXiv:2411.10503
2
citations
#895

Diorama: Unleashing Zero-shot Single-view 3D Indoor Scene Modeling

Qirui Wu, Denys Iliash, Daniel Ritchie et al.

ICCV 2025highlightarXiv:2411.19492
2
citations
#896

Intrepretable Zero-Shot Learning with Locally-Aligned Vision-Language Model

Shiming Chen, Bowen Duan, Salman Khan et al.

ICCV 2025poster
2
citations
#897

Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training

Weiwei Cao, Jianpeng Zhang, Zhongyi Shui et al.

ICCV 2025posterarXiv:2508.03742
2
citations
#898

Refer to Any Segmentation Mask Group With Vision-Language Prompts

Shengcao Cao, Zijun Wei, Jason Kuen et al.

ICCV 2025posterarXiv:2506.05342
2
citations
#899

A Hidden Stumbling Block in Generalized Category Discovery: Distracted Attention

Qiyu Xu, Zhanxuan Hu, Yu Duan et al.

ICCV 2025posterarXiv:2507.14315
2
citations
#900

Towards a Universal 3D Medical Multi-modality Generalization via Learning Personalized Invariant Representation

Zhaorui Tan, Xi Yang, Tan Pan et al.

ICCV 2025posterarXiv:2411.06106
2
citations
#901

TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos

Jinxi Li, Ziyang Song, Bo Yang

ICCV 2025posterarXiv:2508.09811
2
citations
#902

VideoAds for Fast-Paced Video Understanding

Zheyuan Zhang, Wanying Dou, Linkai Peng et al.

ICCV 2025posterarXiv:2504.09282
2
citations
#903

Triad: Empowering LMM-based Anomaly Detection with Expert-guided Region-of-Interest Tokenizer and Manufacturing Process

Yuanze Li, Shihao Yuan, Haolin Wang et al.

ICCV 2025poster
2
citations
#904

Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding

Xiaojie Zhang, Yuanfei Wang, Ruihai Wu et al.

ICCV 2025posterarXiv:2507.18276
2
citations
#905

MR-FIQA: Face Image Quality Assessment with Multi-Reference Representations from Synthetic Data Generation

Fu-Zhao Ou, Chongyi Li, Shiqi Wang et al.

ICCV 2025poster
2
citations
#906

Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models

Xiao Liang, Di Wang, Zhicheng Jiao et al.

ICCV 2025posterarXiv:2507.09209
2
citations
#907

Preacher: Paper-to-Video Agentic System

Jingwei Liu, Ling Yang, Hao Luo et al.

ICCV 2025posterarXiv:2508.09632
2
citations
#908

Gait-X: Exploring X modality for Generalized Gait Recognition

Zengbin Wang, Saihui Hou, Junjie Li et al.

ICCV 2025poster
2
citations
#909

From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning

Yuhui Zeng, Haoxiang Wu, Wenjie Nie et al.

ICCV 2025posterarXiv:2502.05843
2
citations
#910

MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling

Yingyue Li, Bencheng Liao, Wenyu Liu et al.

ICCV 2025posterarXiv:2503.13440
2
citations
#911

Diffusion-Based Imaginative Coordination for Bimanual Manipulation

Huilin Xu, Jian Ding, Jiakun Xu et al.

ICCV 2025posterarXiv:2507.11296
2
citations
#912

Improving Noise Efficiency in Privacy-preserving Dataset Distillation

Runkai Zheng, Vishnu Dasu, Yinong Wang et al.

ICCV 2025posterarXiv:2508.01749
2
citations
#913

UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale

Yuhao Wang, Wei Xi

ICCV 2025posterarXiv:2508.09000
2
citations
#914

G2SF: Geometry-Guided Score Fusion for Multimodal Industrial Anomaly Detection

Chengyu Tao, Xuanming Cao, Juan Du

ICCV 2025poster
2
citations
#915

Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation

Xiaoling Hu, Xiangrui Zeng, Oula Puonti et al.

ICCV 2025posterarXiv:2411.16719
2
citations
#916

Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning

Tan Pan, Zhaorui Tan, Kaiyu Guo et al.

ICCV 2025posterarXiv:2507.02581
2
citations
#917

COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation

Sanghyun Jo, Seo Lee, Seungwoo Lee et al.

ICCV 2025posterarXiv:2503.11439
2
citations
#918

ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models

Guoyizhe Wei, Rama Chellappa

ICCV 2025posterarXiv:2504.00037
2
citations
#919

Exploiting Domain Properties in Language-Driven Domain Generalization for Semantic Segmentation

Seogkyu Jeon, Kibeom Hong, Hyeran Byun

ICCV 2025posterarXiv:2512.03508
2
citations
#920

Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment

Kejia Zhang, Juanjuan Weng, Zhiming Luo et al.

ICCV 2025posterarXiv:2408.06079
2
citations
#921

MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation

Vladislav Bargatin, Egor Chistov, Alexander Yakovenko et al.

ICCV 2025highlightarXiv:2506.23151
2
citations
#922

Training-Free Generation of Temporally Consistent Rewards from VLMs

Yinuo Zhao, Jiale Yuan, Zhiyuan Xu et al.

ICCV 2025posterarXiv:2507.04789
2
citations
#923

Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints

Jens U. Kreber, Joerg Stueckler

ICCV 2025highlightarXiv:2508.00558
2
citations
#924

Adaptive Routing of Text-to-Image Generation Requests Between Large Cloud Model and Light-Weight Edge Model

Zewei Xin, Qinya Li, Chaoyue Niu et al.

ICCV 2025posterarXiv:2411.13787
2
citations
#925

Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models

Young Kyun Jang, Ser-Nam Lim

ICCV 2025posterarXiv:2405.14715
2
citations
#926

VSC: Visual Search Compositional Text-to-Image Diffusion Model

Do Dat, Nam Hyeon-Woo, Po-Yuan Mao et al.

ICCV 2025posterarXiv:2505.01104
2
citations
#927

A Structure-aware and Motion-adaptive Framework for 3D Human Pose Estimation with Mamba

Ye Lu, Jie Wang, Jianjun Gao et al.

ICCV 2025posterarXiv:2507.19852
2
citations
#928

DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing

Aniruddha Bala, Rohit Chowdhury, Rohan Jaiswal et al.

ICCV 2025highlightarXiv:2504.17894
2
citations
#929

AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?

Shouwei Ruan, Hanqing Liu, Yao Huang et al.

ICCV 2025highlightarXiv:2412.03002
2
citations
#930

GausSim: Foreseeing Reality by Gaussian Simulator for Elastic Objects

Yidi Shao, Mu Huang, Chen Change Loy et al.

ICCV 2025posterarXiv:2412.17804
2
citations
#931

Learnable Feature Patches and Vectors for Boosting Low-light Image Enhancement without External Knowledge

Xiaogang Xu, Jiafei Wu, Qingsen Yan et al.

ICCV 2025poster
2
citations
#932

PASTA: Part-Aware Sketch-to-3D Shape Generation with Text-Aligned Prior

Seunggwan Lee, Hwanhee Jung, ByoungSoo Koh et al.

ICCV 2025posterarXiv:2503.12834
2
citations
#933

SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications

Yana Hasson, Pauline Luc, Liliane Momeni et al.

ICCV 2025posterarXiv:2507.03578
2
citations
#934

Acknowledging Focus Ambiguity in Visual Questions

Chongyan Chen, Yu-Yun Tseng, Zhuoheng Li et al.

ICCV 2025posterarXiv:2501.02201
2
citations
#935

Trust but Verify: Programmatic VLM Evaluation in the Wild

Viraj Prabhu, Senthil Purushwalkam, An Yan et al.

ICCV 2025posterarXiv:2410.13121
2
citations
#936

FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging

Zichen Tang, Haihong E, Jiacheng Liu et al.

ICCV 2025posterarXiv:2508.04625
2
citations
#937

Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts

Viet Nguyen, Anh Nguyen, Trung Dao et al.

ICCV 2025posterarXiv:2412.02687
2
citations
#938

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

Tao Han, Wanghan Xu, Junchao Gong et al.

ICCV 2025posterarXiv:2509.10441
2
citations
#939

Denoising Token Prediction in Masked Autoregressive Models

Ting Yao, Yehao Li, Yingwei Pan et al.

ICCV 2025poster
2
citations
#940

Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis

Byung Hyun Lee, Wongi Jeong, Woojae Han et al.

ICCV 2025posterarXiv:2507.02395
2
citations
#941

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow

Yixiang Chen, Peiyan Li, Yan Huang et al.

ICCV 2025posterarXiv:2507.06224
2
citations
#942

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Fatemeh Saleh, Sadegh Aliakbarian, Charlie Hewitt et al.

ICCV 2025posterarXiv:2507.15365
2
citations
#943

ViLU: Learning Vision-Language Uncertainties for Failure Prediction

Marc Lafon, Yannis Karmim, Julio Silva-Rodríguez et al.

ICCV 2025posterarXiv:2507.07620
2
citations
#944

Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Models

Vahid Balazadeh, Mohammadmehdi Ataei, Hyunmin Cheong et al.

ICCV 2025posterarXiv:2412.08619
2
citations
#945

Subjective Camera 1.0: Bridging Human Cognition and Visual Reconstruction through Sequence-Aware Sketch-Guided Diffusion

Haoyang Chen, Dongfang Sun, Caoyuan Ma et al.

ICCV 2025posterarXiv:2506.23711
2
citations
#946

Trade-offs in Image Generation: How Do Different Dimensions Interact?

Sicheng Zhang, Binzhu Xie, Zhonghao Yan et al.

ICCV 2025posterarXiv:2507.22100
2
citations
#947

Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks

Artem Nikonorov, Georgy Perevozchikov, Andrei Korepanov et al.

ICCV 2025posterarXiv:2503.11781
2
citations
#948

CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation

Jianyu Wu, Yizhou Wang, Xiangyu Yue et al.

ICCV 2025posterarXiv:2504.20830
2
citations
#949

FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

Mainak Singha, Subhankar Roy, Sarthak Mehrotra et al.

ICCV 2025posterarXiv:2504.20860
2
citations
#950

Deep Incomplete Multi-view Clustering with Distribution Dual-Consistency Recovery Guidance

Jiaqi Jin, Siwei Wang, Zhibin Dong et al.

ICCV 2025posterarXiv:2503.11017
2
citations
#951

Learning Robust Image Watermarking with Lossless Cover Recovery

jiale chen, Wei Wang, Chongyang Shi et al.

ICCV 2025poster
1
citations
#952

Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection

Juan Hu, Shaojing Fan, Terence Sim

ICCV 2025posterarXiv:2507.14807
1
citations
#953

M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision

Kailai Zhou, Fuqiang Yang, Shixian Wang et al.

ICCV 2025posterarXiv:2507.16318
1
citations
#954

MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction

Yusuke Yoshiyasu, Leyuan Sun, Ryusuke Sagawa

ICCV 2025posterarXiv:2507.15212
1
citations
#955

MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics

Bowei Guo, Shengkun Tang, Cong Zeng et al.

ICCV 2025posterarXiv:2510.11962
1
citations
#956

Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation

Yunze Tong, Fengda Zhang, Didi Zhu et al.

ICCV 2025poster
1
citations
#957

SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference

Samir Khaki, Junxian Guo, Jiaming Tang et al.

ICCV 2025posterarXiv:2510.17777
1
citations
#958

CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation

Zixin Zhu, Kevin Duarte, Mamshad Nayeem Rizve et al.

ICCV 2025posterarXiv:2509.01028
1
citations
#959

PLMP - Point-Line Minimal Problems for Projective SfM

Kim Kiehn, Albin Ahlbäck, Kathlén Kohn

ICCV 2025highlightarXiv:2503.04351
1
citations
#960

TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models

Christian Simon, Masato Ishii, Akio Hayakawa et al.

ICCV 2025posterarXiv:2508.00289
1
citations
#961

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

Ahmed Abdelreheem, Filippo Aleotti, Jamie Watson et al.

ICCV 2025posterarXiv:2505.05288
1
citations
#962

VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow

Ada Görgün, Bernt Schiele, Jonas Fischer

ICCV 2025posterarXiv:2503.22399
1
citations
#963

SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition

Zeqi Zheng, Yanchen Huang, Yingchao Yu et al.

ICCV 2025posterarXiv:2503.15986
1
citations
#964

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

YU WEI, Jiahui Zhang, Xiaoqin Zhang et al.

ICCV 2025posterarXiv:2507.13891
1
citations
#965

UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions

Siyuan Yao, Rui Zhu, Ziqi Wang et al.

ICCV 2025posterarXiv:2507.00648
1
citations
#966

Membership Inference Attacks with False Discovery Rate Control

Chenxu Zhao, Wei Qian, Aobo Chen et al.

ICCV 2025posterarXiv:2508.07066
1
citations
#967

D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection

Yanran Zhang, Bingyao Yu, Yu Zheng et al.

ICCV 2025poster
1
citations
#968

FICGen: Frequency-Inspired Contextual Disentanglement for Layout-driven Degraded Image Generation

Wenzhuang Wang, Yifan Zhao, Mingcan Ma et al.

ICCV 2025posterarXiv:2509.01107
1
citations
#969

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Kwanyoung Kim, Byeongsu Sim

ICCV 2025posterarXiv:2503.07677
1
citations
#970

MeshPad: Interactive Sketch-Conditioned Artist-Reminiscent Mesh Generation and Editing

Haoxuan Li, Ziya Erkoç, Lei Li et al.

ICCV 2025posterarXiv:2503.01425
1
citations
#971

Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction

Giuseppe Cartella, Vittorio Cuculo, Alessandro D'Amelio et al.

ICCV 2025posterarXiv:2507.23021
1
citations
#972

Outlier-Aware Post-Training Quantization for Image Super-Resolution

Hailing Wang, Jianglin Lu, Yitian Zhang et al.

ICCV 2025highlightarXiv:2511.00682
1
citations
#973

A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks

Hang Su, Yunlong Feng, Daniel Gehrig et al.

ICCV 2025highlightarXiv:2507.22733
1
citations
#974

Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images

Jinsol Song, Jiamu Wang, Anh Nguyen et al.

ICCV 2025posterarXiv:2508.15256
1
citations
#975

FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers

Yanbing Zhang, Zhe Wang, Qin Zhou et al.

ICCV 2025posterarXiv:2507.15249
1
citations
#976

OmniVTON: Training-Free Universal Virtual Try-On

Zhaotong Yang, Yuhui Li, Shengfeng He et al.

ICCV 2025posterarXiv:2507.15037
1
citations
#977

DAMap: Distance-aware MapNet for High Quality HD Map Construction

JINPENG DONG, Chen Li, Yutong Lin et al.

ICCV 2025posterarXiv:2510.22675
1
citations
#978

DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

Ziqi Gao, Qiufu Li, Linlin Shen

ICCV 2025highlightarXiv:2510.21635
1
citations
#979

Dataset Ownership Verification for Pre-trained Masked Models

Yuechen Xie, Jie Song, Yicheng Shan et al.

ICCV 2025posterarXiv:2507.12022
1
citations
#980

Neural Compression for 3D Geometry Sets

Siyu Ren, Junhui Hou, Weiyao Lin et al.

ICCV 2025posterarXiv:2405.15034
1
citations
#981

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025posterarXiv:2509.26231
1
citations
#982

From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning

Pengkun Jiao, Bin Zhu, Jingjing Chen et al.

ICCV 2025posterarXiv:2411.12787
1
citations
#983

Beyond Simple Edits: Composed Video Retrieval with Dense Modifications

Omkar Thawakar, Dmitry Demidov, Ritesh Thawkar et al.

ICCV 2025posterarXiv:2508.14039
1
citations
#984

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

ICCV 2025posterarXiv:2507.05256
1
citations
#985

Balanced Sharpness-Aware Minimization for Imbalanced Regression

Yahao Liu, Qin Wang, Lixin Duan et al.

ICCV 2025posterarXiv:2508.16973
1
citations
#986

You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception

hao si, Ehsan Javanmardi, Manabu Tsukada

ICCV 2025posterarXiv:2509.09310
1
citations
#987

Robust Unfolding Network for HDR Imaging with Modulo Cameras

Zhile Chen, Hui Ji

ICCV 2025poster
1
citations
#988

IAP: Invisible Adversarial Patch Attack through Perceptibility-Aware Localization and Perturbation Optimization

Subrat Kishore Dutta, Xiao Zhang

ICCV 2025posterarXiv:2507.06856
1
citations
#989

Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios

Chunxiao Li, Xiaoxiao Wang, Meiling Li et al.

ICCV 2025posterarXiv:2509.09172
1
citations
#990

Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering

Imad Eddine MAROUF, Enzo Tartaglione, Stéphane Lathuilière et al.

ICCV 2025posterarXiv:2502.04469
1
citations
#991

Benefit From Seen: Enhancing Open-Vocabulary Object Detection by Bridging Visual and Textual Co-Occurrence Knowledge

Yanqi Li, Jianwei Niu, Tao Ren

ICCV 2025poster
1
citations
#992

HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing

Junseong Shin, Seungwoo Chung, Yunjeong Yang et al.

ICCV 2025posterarXiv:2509.18190
1
citations
#993

Dual Reciprocal Learning of Language-based Human Motion Understanding and Generation

CHEN LIANG, Zhicheng Shi, Wenguan Wang et al.

ICCV 2025poster
1
citations
#994

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025posterarXiv:2507.10340
1
citations
#995

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.

ICCV 2025posterarXiv:2503.17539
1
citations
#996

Video Color Grading via Look-Up Table Generation

Seunghyun Shin, Dongmin Shin, Jisu Shin et al.

ICCV 2025posterarXiv:2508.00548
1
citations
#997

S$^3$E: Self-Supervised State Estimation for Radar-Inertial System

Shengpeng Wang, Yulong Xie, Qing Liao et al.

ICCV 2025posterarXiv:2509.25984
1
citations
#998

Robust Low-light Scene Restoration via Illumination Transition

Ze Li, Feng Zhang, Xiatian Zhu et al.

ICCV 2025posterarXiv:2507.03976
1
citations
#999

ForCenNet: Foreground-Centric Network for Document Image Rectification

Peng Cai, liqiang liqiang, Kaicheng Yang et al.

ICCV 2025posterarXiv:2507.19804
1
citations
#1000

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Yuchen Liu, Yaoming Wang, Bowen Shi et al.

ICCV 2025posterarXiv:2507.20842
1
citations