Most Cited 2025 "hardware robotic control" Papers

22,274 papers found • Page 83 of 112

Filters:Most Cited 2025 hardware robotic control Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#16401

DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations

Xiaohui Li, Yihao Liu, Shuo Cao et al.

ICCV 2025arXiv:2501.10110

#16402

Test-time Adaptation for Foundation Medical Segmentation Model Without Parametric Updates

Kecheng Chen, Xinyu Luo, Tiexin Qin et al.

ICCV 2025highlightarXiv:2504.02008

#16403

ESCNet:Edge-Semantic Collaborative Network for Camouflaged Object Detection

Sheng Ye, Xin Chen, Yan Zhang et al.

ICCV 2025

#16404

Power of Cooperative Supervision: Multiple Teachers Framework for Advanced 3D Semi-Supervised Object Detection

Jin-Hee Lee, Jae-keun Lee, Jeseok Kim et al.

ICCV 2025

#16405

Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining

Qi Fan, Kaiqi Liu, Nian Liu et al.

ICCV 2025arXiv:2504.21414

#16406

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

Tianming Liang, Kun-Yu Lin, Chaolei Tan et al.

ICCV 2025arXiv:2501.14607

#16407

ASGS: Single-Domain Generalizable Open-Set Object Detection via Adaptive Subgraph Searching

Yuxuan Yuan, Luyao Tang, Chaoqi Chen et al.

ICCV 2025

#16408

DADet: Safeguarding Image Conditional Diffusion Models against Adversarial and Backdoor Attacks via Diffusion Anomaly Detection

Hongwei Yu, Xinlong Ding, Jiawei Li et al.

ICCV 2025highlight

#16409

Multi-Schema Proximity Network for Composed Image Retrieval

Jiangming Shi, Xiangbo Yin, yeyunchen yeyunchen et al.

ICCV 2025

#16410

CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts

Olaf Dünkel, Artur Jesslen, Jiahao Xie et al.

ICCV 2025arXiv:2507.17651

#16411

LEGO-Maker: A Semantic-Driven Algorithm for Text-to-3D Generation

Yifei Zhang, Lei Chen

ICCV 2025

#16412

Graph Domain Adaptation with Dual-branch Encoder and Two-level Alignment for Whole Slide Image-based Survival Prediction

Yuntao Shou, Xiangyong Cao, PeiqiangYan PeiqiangYan et al.

ICCV 2025arXiv:2411.14001

#16413

COVTrack: Continuous Open-Vocabulary Tracking via Adaptive Multi-Cue Fusion

Zekun Qian, Ruize Han, Zhixiang Wang et al.

ICCV 2025

#16414

Dense Policy: Bidirectional Autoregressive Learning of Actions

Yue Su, Xinyu Zhan, Hongjie Fang et al.

ICCV 2025arXiv:2503.13217

#16415

monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation

Ren-Jie Lu, Yu Zhou, hao cheng et al.

ICCV 2025

#16416

An Efficient Hybrid Vision Transformer for TinyML Applications

Fanhong Zeng, Huanan LI, Juntao Guan et al.

ICCV 2025

#16417

DOGR: Towards Versatile Visual Document Grounding and Referring

Yinan Zhou, Yuxin Chen, Haokun Lin et al.

ICCV 2025arXiv:2411.17125

#16418

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

Xiwei Xuan, Ziquan Deng, Kwan-Liu Ma

ICCV 2025highlightarXiv:2506.21233

#16419

MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos

Hongyi Zhou, Xiaogang Wang, Yulan Guo et al.

ICCV 2025arXiv:2505.11868

#16420

The Burden of Interactive Alignment with Inconsistent Preferences

Ali Shirali

NEURIPS 2025arXiv:2510.16368

#16421

Performing Defocus Deblurring by Modeling its Formation Process

Zhengbo Zhang, Lin Geng Foo, Hossein Rahmani et al.

ICCV 2025

#16422

CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance

Peiqi Chen, Lei Yu, Yi Wan et al.

ICCV 2025highlightarXiv:2507.17312

#16423

Supervised Exploratory Learning for Long-Tailed Visual Recognition

Zhongquan Jian, Yanhao Chen, Wangyancheng Wangyancheng et al.

ICCV 2025

#16424

Collective Counterfactual Explanations: Balancing Individual Goals and Collective Dynamics

Ahmad-Reza Ehyaei, Ali Shirali, Samira Samadi

NEURIPS 2025arXiv:2402.04579

#16425

CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning

Kuniaki Saito, Donghyun Kim, Kwanyong Park et al.

ICCV 2025highlightarXiv:2507.01409

#16426

MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance

Zihan Cao, Yu Zhong, Ziqi Wang et al.

ICCV 2025arXiv:2503.14944

#16427

Blind Video Super-Resolution based on Implicit Kernels

Qiang Zhu, Yuxuan Jiang, Shuyuan Zhu et al.

ICCV 2025arXiv:2503.07856

#16428

OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning

Yuan Liu, Saihui Hou, Saijie Hou et al.

ICCV 2025arXiv:2503.11093

#16429

Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts

Chiao-An Yang, Kuan-Chuan Peng, Raymond A. Yeh

ICCV 2025arXiv:2507.16946

#16430

GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection

Jeffri Erwin Murrugarra Llerena, José Henrique Marques, Claudio Jung

CVPR 2025arXiv:2502.01565

#16431

More Reliable Pseudo-labels, Better Performance: A Generalized Approach to Single Positive Multi-label Learning

Luong Tran, Thieu Vo, Anh Nguyen et al.

ICCV 2025arXiv:2508.20381

#16432

Self supervised learning for in vivo localization of microelectrode arrays using raw local field potential

Tianxiao He, Malhar Patel, Chenyi Li et al.

NEURIPS 2025

#16433

TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding

Zuhao Yang, Yingchen Yu, Yunqing Zhao et al.

ICCV 2025arXiv:2508.01699

#16434

HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos

Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta

ICCV 2025arXiv:2505.12911

#16435

DCHM: Depth-Consistent Human Modeling for Multiview Detection

Jiahao Ma, Tianyu Wang, Miaomiao Liu et al.

ICCV 2025arXiv:2507.14505

#16436

Adversarial Robustness of Discriminative Self-Supervised Learning in Vision

Ömer Veysel Çağatan, Ömer TAL, M. Emre Gursoy

ICCV 2025arXiv:2503.06361

#16437

HPSv3: Towards Wide-Spectrum Human Preference Score

Yuhang Ma, Keqiang Sun, Xiaoshi Wu et al.

ICCV 2025arXiv:2508.03789

#16438

GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation

Tao Liu, Chongyu Wang, Rongjie Li et al.

NEURIPS 2025arXiv:2510.27210

#16439

Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments

Liang Qin, Min Wang, Peiwei Li et al.

ICCV 2025

#16440

Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive Segmentation

You Huang, Lichao Chen, Jiayi Ji et al.

ICCV 2025

#16441

UNIS: A Unified Framework for Achieving Unbiased Neural Implicit Surfaces in Volume Rendering

Junkai Deng, Hanting Niu, Jiaze Li et al.

ICCV 2025

#16442

On the Provable Importance of Gradients for Autonomous Language-Assisted Image Clustering

Bo Peng, Jie Lu, Guangquan Zhang et al.

ICCV 2025highlight

#16443

MH-LVC: Multi-Hypothesis Temporal Prediction for Learned Conditional Residual Video Coding

Gao Zong lin, Huu-Tai Phung, Yi-Chen Yao et al.

ICCV 2025arXiv:2510.12479

#16444

IntrinsicControlNet: Cross-distribution Image Generation with Real and Unreal

Jiayuan Lu, Rengan Xie, Zixuan Xie et al.

ICCV 2025

#16445

DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space

Junyu Chen, Dongyun Zou, Wenkun He et al.

ICCV 2025arXiv:2508.00413

#16446

Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions

Yiting Qu, Ziqing Yang, Yihan Ma et al.

ICCV 2025arXiv:2507.22617

#16447

Function-centric Bayesian Network for Zero-Shot Object Goal Navigation

Sixian Zhang, Xinyao Yu, Xinhang Song et al.

ICCV 2025

#16448

Loss Functions for Predictor-based Neural Architecture Search

Han Ji, Yuqi Feng, Jiahao Fan et al.

ICCV 2025arXiv:2506.05869

#16449

Advancing Text-to-3D Generation with Linearized Lookahead Variational Score Distillation

Yu Lei, Bingde Liu, Qingsong Xie et al.

ICCV 2025arXiv:2507.09748

#16450

Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning

Xianghua Zeng, Hao Peng, Yicheng Pan et al.

NEURIPS 2025oralarXiv:2509.21942

#16451

Steering Guidance for Personalized Text-to-Image Diffusion Models

Sunghyun Park, Seokeon Choi, Hyoungwoo Park et al.

ICCV 2025arXiv:2508.00319

#16452

ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models

Zifu Wan, Ce Zhang, Silong Yong et al.

ICCV 2025arXiv:2507.00898

#16453

Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis

Neeraj Kumar, Chad Vanderbilt

NEURIPS 2025arXiv:2506.05184

#16454

Precise Diffusion Inversion: Towards Novel Samples and Few-Step Models

Jing Zuo, Luoping Cui, Chuang Zhu et al.

NEURIPS 2025

#16455

Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds

Pei He, Lingling Li, Licheng Jiao et al.

ICCV 2025arXiv:2508.11265

#16456

All Parts Matter: A Unified Mask-Free Virtual Try-On Framework

Chenghu Du, Shengwu Xiong, Yi Rong

ICCV 2025

#16457

GaussianReg: Rapid 2D/3D Registration for Emergency Surgery via Explicit 3D Modeling with Gaussian Primitives

Weihao Yu, Xiaoqing Guo, Xinyu Liu et al.

ICCV 2025

#16458

Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning

Marwa Abdulhai, Ryan Cheng, Donovan Clay et al.

NEURIPS 2025arXiv:2511.00222

#16459

ArgoTweak: Towards Self-Updating HD Maps through Structured Priors

Lena Wild, Rafael Valencia, Patric Jensfelt

ICCV 2025arXiv:2509.08764

#16460

Event-aided Dense and Continuous Point Tracking: Everywhere and Anytime

Zhexiong Wan, Jianqin Luo, Yuchao Dai et al.

ICCV 2025

#16461

Context-Aware Academic Emotion Dataset and Benchmark

Luming Zhao, Jingwen Xuan, Jiamin Lou et al.

ICCV 2025arXiv:2507.00586

#16462

FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases

Matteo Poggi, Fabio Tosi

ICCV 2025arXiv:2509.05297

#16463

TPG-INR: Target Prior-Guided Implicit 3D CT Reconstruction for Enhanced Sparse-view Imaging

QingleiCao QingleiCao, Ziyao Tang, Xiaoqin Tang

ICCV 2025highlight

#16464

SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations

Songchun Zhang, Huiyao Xu, Sitong Guo et al.

ICCV 2025arXiv:2505.11992

#16465

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

JUNSEONG KIM, GeonU Kim, Kim Yu-Ji et al.

CVPR 2025highlightarXiv:2502.16652

#16466

JPEG Processing Neural Operator for Backward-Compatible Coding

Woo Kyoung Han, Yongjun Lee, Byeonghun Lee et al.

ICCV 2025arXiv:2507.23521

#16467

Efficient Visual Place Recognition Through Multimodal Semantic Knowledge Integration

Sitao Zhang, Hongda Mao, Qingshuang Chen et al.

ICCV 2025

#16468

COME: Dual Structure-Semantic Learning with Collaborative MoE for Universal Lesion Detection Across Heterogeneous Ultrasound Datasets

Lingyu Chen, Yawen Zeng, Yue Wang et al.

ICCV 2025arXiv:2508.09886

#16469

NATRA: Noise-Agnostic Framework for Trajectory Prediction with Noisy Observations

Rongqing Li, Changsheng Li, Ruilin Lv et al.

ICCV 2025

#16470

MS3D: High-Quality 3D Generation via Multi-Scale Representation Modeling

Guan Luo, Jianfeng Zhang

ICCV 2025

#16471

LayerLock: Non-collapsing Representation Learning with Progressive Freezing

Goker Erdogan, Nikhil Parthasarathy, Catalin Ionescu et al.

ICCV 2025arXiv:2509.10156

#16472

UniDxMD: Towards Unified Representation for Cross-Modal Unsupervised Domain Adaptation in 3D Semantic Segmentation

Zhengyin Liang, Hui Yin, Min Liang et al.

ICCV 2025highlight

#16473

Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics

Keming Wu, Junwen Chen, Zhanhao Liang et al.

ICCV 2025

#16474

PLAN: Proactive Low-Rank Allocation for Continual Learning

XIEQUN WANG, Zhan Zhuang, Yu Zhang

ICCV 2025arXiv:2510.21188

#16475

Leveraging Spatial Invariance to Boost Adversarial Transferability

Zihan Zhou, LI LI, Yanli Ren et al.

ICCV 2025

#16476

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

Chieh-Yun Chen, Min Shi, Gong Zhang et al.

ICCV 2025arXiv:2507.20536

#16477

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Moayed Haji-Ali, Willi Menapace, Aliaksandr Siarohin et al.

ICCV 2025arXiv:2412.15191

#16478

FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning

Maximilian Hoefler, Karsten Mueller, Wojciech Samek

ICCV 2025

#16479

Visual Textualization for Image Prompted Object Detection

Yongjian Wu, Yang Zhou, Jiya Saiyin et al.

ICCV 2025arXiv:2506.23785

#16480

TerraMind: Large-Scale Generative Multimodality for Earth Observation

Johannes Jakubik, Felix Yang, Benedikt Blumenstiel et al.

ICCV 2025arXiv:2504.11171

#16481

LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs

Haoran Lou, Chunxiao Fan, Ziyan Liu et al.

ICCV 2025arXiv:2507.00505

#16482

EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision

Yiming Zhao, Taein Kwon, Paul Streli et al.

CVPR 2025highlightarXiv:2409.02224

#16483

Generative Video Bi-flow

Chen Liu, Tobias Ritschel

ICCV 2025arXiv:2503.06364

#16484

FHGS: Feature-Homogenized Gaussian Splatting

qigeng duan, Benyun ZHAO, Mingqiao Han et al.

NEURIPS 2025arXiv:2505.19154

#16485

A Unified Framework for Industrial Cel-Animation Colorization with Temporal-Structural Awareness

Xiaoyi Feng, Tao Huang, Peng Wang et al.

ICCV 2025

#16486

Transformer-based Tooth Alignment Prediction with Occlusion and Collision Constraints

DongZhenXing DongZhenXing, Jiazhou Chen

ICCV 2025arXiv:2410.20806

#16487

Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification

Wenkui Yang, Jie Cao, Junxian Duan et al.

ICCV 2025highlightarXiv:2509.13922

#16488

Memory-Integrated Reconfigurable Adapters: A Unified Framework for Settings with Multiple Tasks

Susmit Agrawal, Krishn Vishwas Kher, Saksham Mittal et al.

NEURIPS 2025arXiv:2512.00940

#16489

ADCD-Net: Robust Document Image Forgery Localization via Adaptive DCT Feature and Hierarchical Content Disentanglement

KA WONG, Jicheng Zhou, Haiwei Wu et al.

ICCV 2025arXiv:2507.16397

#16490

SD2Actor: Continuous State Decomposition via Diffusion Embeddings for Robotic Manipulation

lijiayi jiayi

ICCV 2025

#16491

PixTalk: Controlling Photorealistic Image Processing and Editing with Language

Marcos Conde, Zihao Lu, Radu Timofte

ICCV 2025

#16492

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis

Xinyu Hou, Zongsheng Yue, Xiaoming Li et al.

ICCV 2025arXiv:2411.17769

#16493

Beyond Brain Decoding: Visual-Semantic Reconstructions to Mental Creation Extension Based on fMRI

Haodong Jing, Dongyao Jiang, Yongqiang Ma et al.

ICCV 2025

#16494

Scene Graph Guided Generation: Enable Accurate Relations Generation in Text-to-Image Models via Textural Rectification

Guibao SHEN, Luozhou Wang, Jiantao Lin et al.

ICCV 2025

#16495

ReMP-AD: Retrieval-enhanced Multi-modal Prompt Fusion for Few-Shot Industrial Visual Anomaly Detection

Hongchi Ma, Guanglei Yang, Debin Zhao et al.

ICCV 2025

#16496

GMMamba: Group Masking Mamba for Whole Slide Image Classification

Tingting Zheng, Hongxun Yao, Kui Jiang et al.

ICCV 2025

#16497

TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction

Dadong Jiang, Zhi Hou, Zhihui Ke et al.

ICCV 2025arXiv:2411.11941

#16498

Tracing Copied Pixels and Regularizing Patch Affinity in Copy Detection

Yichen Lu, Siwei Nie, Minlong Lu et al.

ICCV 2025

#16499

RareCLIP: Rarity-aware Online Zero-shot Industrial Anomaly Detection

Jianfang He, Min Cao, Silong Peng et al.

ICCV 2025

#16500

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

Rongkun Xue, Jinouwen Zhang, Yazhe Niu et al.

ICCV 2025arXiv:2412.01787

#16501

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation

Ruotong Wang, Mingli Zhu, Jiarong Ou et al.

ICCV 2025arXiv:2504.16907

#16502

Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation

Gang Dai, Yifan Zhang, Yutao Qin et al.

ICCV 2025arXiv:2508.03256

#16503

Temporal Rate Reduction Clustering for Human Motion Segmentation

Xianghan Meng, Zhengyu Tong, Zhiyuan Huang et al.

ICCV 2025arXiv:2506.21249

#16504

Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction

Hongyang Sun, Qinglin Yang, Jiawei Wang et al.

ICCV 2025

#16505

QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing

Tiancheng SHEN, Jun Hao Liew, Zilong Huang et al.

ICCV 2025

#16506

Backdoor Mitigation by Distance-Driven Detoxification

Shaokui Wei, Jiayin Liu, Hongyuan Zha

ICCV 2025highlightarXiv:2411.09585

#16507

Democratizing High-Fidelity Co-Speech Gesture Video Generation

Xu Yang, Shaoli Huang, Shenbo Xie et al.

ICCV 2025arXiv:2507.06812

#16508

UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI

Fangwei Zhong, Kui Wu, Churan Wang et al.

ICCV 2025highlightarXiv:2412.20977

#16509

HFD-Teacher: High-Frequency Depth Distillation from Depth Foundation Models for Enhanced Depth Completion

Zhiyuan Yang, Anqi Cheng, Haiyue Zhu et al.

ICCV 2025

#16510

FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning

Hang Guo, Yawei Li, Taolin Zhang et al.

ICCV 2025arXiv:2503.23367

#16511

DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution

Zheng-Peng Duan, jiawei zhang, Xin Jin et al.

ICCV 2025arXiv:2503.23580

#16512

Separation for Better Integration: Disentangling Edge and Motion in Event-based Deblurring

Yufei Zhu, Hao Chen, Yongjian Deng et al.

ICCV 2025

#16513

Teleportraits: Training-Free People Insertion into Any Scene

Jialu Gao, Joseph K J, Fernando De la Torre

ICCV 2025arXiv:2510.05660

#16514

LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing

Achint Soni, Meet Soni, Sirisha Rambhatla

ICCV 2025arXiv:2503.21541

#16515

Diversity-Enhanced Distribution Alignment for Dataset Distillation

Hongcheng Li, Yucan Zhou, Xiaoyan Gu et al.

ICCV 2025

#16516

Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection

Hanshi Wang, Jin Gao, Weiming Hu et al.

ICCV 2025highlightarXiv:2507.04369

#16517

SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking

Sixian Chan, Zedong Li, Xiaoqin Zhang et al.

ICCV 2025highlight

#16518

Two Losses, One Goal: Balancing Conflict Gradients for Semi-supervised Semantic Segmentation

Rui Sun, Huayu Mai, Wangkai Li et al.

ICCV 2025highlight

#16519

Region-based Cluster Discrimination for Visual Representation Learning

Yin Xie, Kaicheng Yang, Xiang An et al.

ICCV 2025highlightarXiv:2507.20025

#16520

CMB-ML: A Cosmic Microwave Background Dataset for the Oldest Possible Computer Vision Task

James Amato, Yunan Xie, Leonel Medina-Varela et al.

ICCV 2025

#16521

Adapt Foundational Segmentation Models with Heterogeneous Searching Space

Li Yi, Jie Hu, Songan Zhang et al.

ICCV 2025

#16522

Think Twice: Test-Time Reasoning for Robust CLIP Zero-Shot Classification

Shenyu Lu, Zhaoying Pan, Xiaoqian Wang

ICCV 2025

#16523

Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures

Nina Vesseron, Louis Bethune, Marco Cuturi

NEURIPS 2025arXiv:2503.10576

#16524

FlexGen: Flexible Multi-View Generation from Text and Image Inputs

Xinli Xu, Wenhang Ge, Jiantao Lin et al.

ICCV 2025arXiv:2410.10745

#16525

Dropout Regularization Versus l2-Penalization in the Linear Model

Gabriel Clara, Sophie Langer, Johannes Schmidt-Hieber

NEURIPS 2025

#16526

Shape of Motion: 4D Reconstruction from a Single Video

Qianqian Wang, Vickie Ye, Hang Gao et al.

ICCV 2025highlightarXiv:2407.13764

#16527

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

shaojin wu, Mengqi Huang, wenxu wu et al.

ICCV 2025arXiv:2504.02160

#16528

EditCLIP: Representation Learning for Image Editing

Qian Wang, Aleksandar Cvejic, Abdelrahman Eldesokey et al.

ICCV 2025arXiv:2503.20318

#16529

Counting Stacked Objects

Corentin Dumery, Noa Ette, Aoxiang Fan et al.

ICCV 2025arXiv:2411.19149

#16530

Gain-MLP: Improving HDR Gain Map Encoding via a Lightweight MLP

Trevor Canham, SaiKiran Tedla, Michael Murdoch et al.

ICCV 2025arXiv:2503.11883

#16531

Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera

Zhengdi Yu, Stefanos Zafeiriou, Tolga Birdal

CVPR 2025highlightarXiv:2412.12861

#16532

Allowing Oscillation Quantization: Overcoming Solution Space Limitation in Low Bit-Width Quantization

Weiying Xie, Zihan Meng, Jitao Ma et al.

ICCV 2025

#16533

MOVE: Motion-Guided Few-Shot Video Object Segmentation

Kaining Ying, Hengrui Hu, Henghui Ding

ICCV 2025arXiv:2507.22061

#16534

CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation

Dengke Zhang, Fagui Liu, Quan Tang

ICCV 2025arXiv:2411.10086

#16535

mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework

Bingyi Liu, Jian Teng, Hongfei Xue et al.

ICCV 2025arXiv:2501.12263

#16536

FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers

Junjie Zhang, Haisheng Su, Feixiang Song et al.

ICCV 2025arXiv:2510.15385

#16537

GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling

Pinxin Liu, Luchuan Song, Junhua Huang et al.

ICCV 2025arXiv:2501.18898

#16538

SDFormer: Vision-based 3D Semantic Scene Completion via SAM-assisted Dual-channel Voxel Transformer

Yujie Xue, Huilong Pi, Jiapeng Zhang et al.

ICCV 2025

#16539

ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts

Dmitrii M Petrov, Pradyumn Goyal, Divyansh Shivashok et al.

CVPR 2025arXiv:2412.02912

#16540

TopoTTA: Topology-Enhanced Test-Time Adaptation for Tubular Structure Segmentation

Jiale Zhou, Wenhan Wang, Shikun Li et al.

ICCV 2025arXiv:2508.00442

#16541

RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control

Teng Li, Guangcong Zheng, Rui Jiang et al.

ICCV 2025arXiv:2502.10059

#16542

CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching

Zizhuo Li, Yifan Lu, Linfeng Tang et al.

ICCV 2025highlightarXiv:2503.23925

#16543

MagShield: Towards Better Robustness in Sparse Inertial Motion Capture Under Magnetic Disturbances

Yunzhe Shao, Xinyu Yi, Lu Yin et al.

ICCV 2025arXiv:2506.22907

#16544

Semantic Discrepancy-aware Detector for Image Forgery Identification

Wang Ziye, Minghang Yu, Chunyan Xu et al.

ICCV 2025arXiv:2508.12341

#16545

UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis

Yuanrui Wang, Cong Han, Yafei Li et al.

ICCV 2025arXiv:2507.00992

#16546

DeFSS: Image-to-Mask Denoising Learning for Few-shot Segmentation

Zishu Qin, Junhao Xu, Weifeng Ge

ICCV 2025

#16547

UniversalBooth: Model-Agnostic Personalized Text-to-Image Generation

Songhua Liu, Ruonan Yu, Xinchao Wang

ICCV 2025

#16548

SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer

Zerui Gong, Zhonghua Wu, Qingyi Tao et al.

ICCV 2025arXiv:2506.13465

#16549

TAD-E2E: A Large-scale End-to-end Autonomous Driving Dataset

Chang Liu, mingxuzhu mingxuzhu, Zheyuan Zhang et al.

ICCV 2025

#16550

Put CASH on Bandits: A Max K-Armed Problem for Automated Machine Learning

Amir Rezaei Balef, Claire Vernade, Katharina Eggensperger

NEURIPS 2025arXiv:2505.05226

#16551

ProAPO: Progressively Automatic Prompt Optimization for Visual Classification

Xiangyan Qu, Gaopeng Gou, Jiamin Zhuang et al.

CVPR 2025arXiv:2502.19844

#16552

VAGUE: Visual Contexts Clarify Ambiguous Expressions

Heejeong Nam, Jinwoo Ahn, Keummin Ka et al.

ICCV 2025arXiv:2411.14137

#16553

Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer

YuanFu Yang, Hsiu-Hui Hsiao

ICCV 2025

#16554

Accelerating Diffusion Sampling via Exploiting Local Transition Coherence

shangwen zhu, Han Zhang, Zhantao Yang et al.

ICCV 2025arXiv:2503.09675

#16555

What's Making That Sound Right Now? Video-centric Audio-Visual Localization

hahyeon choi, Junhoo Lee, Nojun Kwak

ICCV 2025arXiv:2507.04667

#16556

Task-Specific Gradient Adaptation for Few-Shot One-Class Classification

Yunlong Li, Xiabi Liu, Liyuan Pan et al.

CVPR 2025

#16557

EEGMirror: Leveraging EEG data in the wild via Montage-Agnostic Self-Supervision for EEG to Video Decoding

Xuan-Hao Liu, Bao-liang Lu, Wei-Long Zheng

ICCV 2025

#16558

VehicleMAE: View-asymmetry Mutual Learning for Vehicle Re-identification Pre-training via Masked AutoEncoders

Qi Wang, Zeyu Zhang, Dong Wang et al.

ICCV 2025

#16559

Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models

Haoming Cai, Tsung-Wei Huang, Shiv Gehlot et al.

ICCV 2025arXiv:2503.21943

#16560

MagicCity: Geometry-Aware 3D City Generation from Satellite Imagery with Multi-View Consistency

Xingbo YAO, xuanmin Wang, Hao WU et al.

ICCV 2025

#16561

RARE: Refine Any Registration of Pairwise Point Clouds via Zero-Shot Learning

Chengyu Zheng, Honghua Chen, Jin Huang et al.

ICCV 2025arXiv:2507.19950

#16562

Multi-scenario Overlapping Text Segmentation with Depth Awareness

Yang Liu, Xudong Xie, Yuliang Liu et al.

ICCV 2025

#16563

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Kaiyu Yue, Vasu Singla, Menglin Jia et al.

ICCV 2025arXiv:2505.22664

#16564

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection

Adrian Chow, Evelien Riddell, Yimu Wang et al.

ICCV 2025arXiv:2503.06435

#16565

FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention

Xuan Ju, Weicai Ye, Quande Liu et al.

ICCV 2025

#16566

SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection

Chaesong Park, Eunbin Seo, JihyeonHwang JihyeonHwang et al.

ICCV 2025arXiv:2508.10411

#16567

Exploring the Visual Feature Space for Multimodal Neural Decoding

Weihao Xia, Cengiz Oztireli

ICCV 2025arXiv:2505.15755

#16568

Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing

Taihang Hu, Linxuan Li, Kai Wang et al.

ICCV 2025arXiv:2504.10434

#16569

ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement

Habin Lim, Youngseob Won, Juwon Seo et al.

ICCV 2025arXiv:2510.04668

#16570

ADMN: A Layer-Wise Adaptive Multimodal Network for Dynamic Input Noise and Compute Resources

Jason Wu, Yuyang Yuan, Kang Yang et al.

NEURIPS 2025arXiv:2502.07862

#16571

Backdoor Defense via Enhanced Splitting and Trap Isolation

Hongrui Yu, Lu Qi, Wanyu Lin et al.

ICCV 2025

#16572

Learning Hierarchical Line Buffer for Image Processing

Jiacheng Li, Feiran Li, Daisuke Iso

ICCV 2025

#16573

ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction

Soonwoo Cha, Jiwoo Song, Juan Yeo et al.

ICCV 2025arXiv:2506.08678

#16574

Preserve Anything: Controllable Image Synthesis with Object Preservation

Prasen Kumar Sharma, Neeraj Matiyali, Siddharth Srivastava et al.

ICCV 2025arXiv:2506.22531

#16575

Information Retrieval Induced Safety Degradation in AI Agents

Cheng Yu, Benedikt Stroebl, Diyi Yang et al.

NEURIPS 2025arXiv:2505.14215

#16576

Humans as Checkerboards: Calibrating Camera Motion Scale for World-Coordinate Human Mesh Recovery

Fengyuan Yang, Kerui Gu, Ha Linh Nguyen et al.

ICCV 2025arXiv:2407.00574

#16577

D3: Training-Free AI-Generated Video Detection Using Second-Order Features

Chende Zheng, Ruiqi suo, Chenhao Lin et al.

ICCV 2025arXiv:2508.00701

#16578

Understanding the Statistical Accuracy-Communication Trade-off in Personalized Federated Learning with Minimax Guarantees

Xin Yu, Zelin He, Ying Sun et al.

ICML 2025arXiv:2410.08934

#16579

Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging

Pierre Ablin, Angelos Katharopoulos, Skyler Seto et al.

ICML 2025arXiv:2502.01804

#16580

Core Context Aware Transformers for Long Context Language Modeling

Yaofo Chen, Zeng You, Shuhai Zhang et al.

ICML 2025arXiv:2412.12465

#16581

Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning

Fangwen Wu, Lechao Cheng, Shengeng Tang et al.

ICML 2025oralarXiv:2502.07560

#16582

Does Data Scaling Lead to Visual Compositional Generalization?

Arnas Uselis, Andrea Dittadi, Seong Joon Oh

ICML 2025arXiv:2507.07102

#16583

An Asymptotically Optimal Approximation Algorithm for Multiobjective Submodular Maximization at Scale

Fabian Spaeh, Atsushi Miyauchi

ICML 2025arXiv:2505.09525

#16584

Learning curves theory for hierarchically compositional data with power-law distributed features

Francesco Cagnetta, Hyunmo Kang, Matthieu Wyart

ICML 2025arXiv:2505.07067

#16585

Neurosymbolic World Models for Sequential Decision Making

Leonardo Hernandez Cano, Maxine Perroni-Scharf, Neil Dhir et al.

ICML 2025

#16586

Self-Disentanglement and Re-Composition for Cross-Domain Few-Shot Segmentation

Jintao Tong, Yixiong Zou, Guangyao Chen et al.

ICML 2025arXiv:2506.02677

#16587

Model-Based Exploration in Monitored Markov Decision Processes

Alireza Kazemipour, Matthew Taylor, Michael Bowling

ICML 2025arXiv:2502.16772

#16588

Deep Bayesian Filter for Bayes-Faithful Data Assimilation

Yuta Tarumi, Keisuke Fukuda, Shin-ichi Maeda

ICML 2025arXiv:2405.18674

#16589

KAN-AD: Time Series Anomaly Detection with Kolmogorov–Arnold Networks

Quan Zhou, Changhua Pei, Fei Sun et al.

ICML 2025arXiv:2411.00278

#16590

Oscillation-Reduced MXFP4 Training for Vision Transformers

Yuxiang Chen, Haocheng Xi, Jun Zhu et al.

ICML 2025arXiv:2502.20853

#16591

Policy Filtration for RLHF to Mitigate Noise in Reward Models

Chuheng Zhang, Wei Shen, Li Zhao et al.

ICML 2025arXiv:2409.06957

#16592

On the Importance of Embedding Norms in Self-Supervised Learning

Andrew Draganov, Sharvaree Vadgama, Sebastian Damrich et al.

ICML 2025arXiv:2502.09252

#16593

Robust Multi-bit Text Watermark with LLM-based Paraphrasers

Xiaojun Xu, jinghan jia, Yuanshun Yao et al.

ICML 2025arXiv:2412.03123

#16594

Should Decision-Makers Reveal Classifiers in Online Strategic Classification?

Han Shao, Shuo Xie, Kunhe Yang

ICML 2025arXiv:2506.01936

#16595

Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning

Changsheng Wang, Yihua Zhang, jinghan jia et al.

ICML 2025arXiv:2506.01339

#16596

Measuring Representational Shifts in Continual Learning: A Linear Transformation Perspective

Joonkyu Kim, Yejin Kim, Jy-yong Sohn

ICML 2025arXiv:2505.20970

#16597

Online Conformal Prediction via Online Optimization

Felipe Areces, Christopher Mohri, Tatsunori Hashimoto et al.

ICML 2025

#16598

Integer Programming for Generalized Causal Bootstrap Designs

Jennifer Brennan, Sébastien Lahaie, Adel Javanmard et al.

ICML 2025arXiv:2410.21464

#16599

Retraining with Predicted Hard Labels Provably Increases Model Accuracy

Rudrajit Das, Inderjit Dhillon, Alessandro Epasto et al.

ICML 2025arXiv:2406.11206

#16600

Self-Supervised Learning of Intertwined Content and Positional Features for Object Detection

Kang-Jun Liu, Masanori Suganuma, Takayuki Okatani

ICML 2025

← Previous

1...81 82 83 84 85...112