Most Cited 2025 "microtransactions" Papers

22,274 papers found • Page 100 of 112

#19801

DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models

Revant Teotia, Candace Ross, Karen Ullrich et al.

ICCV 2025posterarXiv:2506.05108
#19802

Cross-Granularity Online Optimization with Masked Compensated Information for Learned Image Compression

Haowei Kuang, Wenhan Yang, Zongming Guo et al.

ICCV 2025poster
#19803

TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance

Minghao Fu, Guo-Hua Wang, Xiaohao Chen et al.

ICCV 2025posterarXiv:2507.18192
#19804

CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation

Zixin Zhu, Kevin Duarte, Mamshad Nayeem Rizve et al.

ICCV 2025posterarXiv:2509.01028
#19805

PLA: Prompt Learning Attack against Text-to-Image Generative Models

XINQI LYU, Yihao LIU, Yanjie Li et al.

ICCV 2025posterarXiv:2508.03696
#19806

Holistic Tokenizer for Autoregressive Image Generation

Anlin Zheng, Haochen Wang, Yucheng Zhao et al.

ICCV 2025posterarXiv:2507.02358
#19807

DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions

Hengyuan Zhang, Zhe Li, Xingqun Qi et al.

ICCV 2025posterarXiv:2508.17342
#19808

Toward Better Out-painting: Improving the Image Composition with Initialization Policy Model

Xuan Han, Yihao Zhao, Yanhao Ge et al.

ICCV 2025poster
#19809

Versatile Transition Generation with Image-to-Video Diffusion

Zuhao Yang, Jiahui Zhang, Yingchen Yu et al.

ICCV 2025posterarXiv:2508.01698
#19810

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Shengbang Tong, David Fan, Jiachen Zhu et al.

ICCV 2025posterarXiv:2412.14164
#19811

DiffIP: Representation Fingerprints for Robust IP Protection of Diffusion Models

Zhuoling Li, Haoxuan Qu, Jason Kuen et al.

ICCV 2025poster
#19812

Processing and acquisition traces in visual encoders: What does CLIP know about your camera?

Ryan Ramos, Vladan Stojnić, Giorgos Kordopatis-Zilos et al.

ICCV 2025highlightarXiv:2508.10637
#19813

AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild

Siyoon Jin, Jisu Nam, Jiyoung Kim et al.

ICCV 2025poster
#19814

Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection

Yingsong Huang, Hui Guo, Jing Huang et al.

ICCV 2025posterarXiv:2601.14625
#19815

Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models

Hyungjin Kim, Seokho Ahn, Young-Duk Seo

ICCV 2025posterarXiv:2508.03481
#19816

V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models

Jisoo Kim, Wooseok Seo, Junwan Kim et al.

ICCV 2025posterarXiv:2508.03254
#19817

X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting

Zeyi Sun, Ziyang Chu, Pan Zhang et al.

ICCV 2025poster
#19818

AnyI2V: Animating Any Conditional Image with Motion Control

Ziye Li, Xincheng Shuai, Hao Luo et al.

ICCV 2025posterarXiv:2507.02857
#19819

EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing

Zexuan Yan, Yue Ma, Chang Zou et al.

ICCV 2025posterarXiv:2503.10270
#19820

RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation

Yuhan Li, Xianfeng Tan, Wenxiang Shang et al.

ICCV 2025highlightarXiv:2411.19528
#19821

Instruction-based Image Editing with Planning, Reasoning, and Generation

Liya Ji, Chenyang Qi, Qifeng Chen

ICCV 2025poster
#19822

HDR Image Generation via Gain Map Decomposed Diffusion

Yuanshen Guan, Ruikang Xu, Yinuo Liao et al.

ICCV 2025poster
#19823

ESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learning

Jongseo Lee, Kyungho Bae, Kyle Min et al.

ICCV 2025highlightarXiv:2508.10896
#19824

Accelerating Diffusion Transformer via Gradient-Optimized Cache

Junxiang Qiu, Lin Liu, Shuo Wang et al.

ICCV 2025posterarXiv:2503.05156
#19825

The Silent Assistant: NoiseQuery as Implicit Guidance for Goal-Driven Image Generation

Ruoyu Wang, Huayang Huang, Ye Zhu et al.

ICCV 2025highlightarXiv:2412.05101
#19826

Progressive Growing of Video Tokenizers for Temporally Compact Latent Spaces

Aniruddha Mahapatra, Long Mai, David Bourgin et al.

ICCV 2025posterarXiv:2501.05442
#19827

MC-Bench: A Benchmark for Multi-Context Visual Grounding in the Era of MLLMs

Yunqiu Xu, Linchao Zhu, Yi Yang

ICCV 2025posterarXiv:2410.12332
#19828

HyTIP: Hybrid Temporal Information Propagation for Masked Conditional Residual Video Coding

Yi-Hsin Chen, Yi-Chen Yao, Kuan-Wei Ho et al.

ICCV 2025posterarXiv:2508.02072
#19829

DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images

Kazuma Nagata, Naoshi Kaneko

ICCV 2025posterarXiv:2509.14685
#19830

Parametric Shadow Control for Portrait Generation in Text-to-Image Diffusion Models

Haoming Cai, Tsung-Wei Huang, Shiv Gehlot et al.

ICCV 2025posterarXiv:2503.21943
#19831

UniversalBooth: Model-Agnostic Personalized Text-to-Image Generation

Songhua Liu, Ruonan Yu, Xinchao Wang

ICCV 2025poster
#19832

CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching

Zizhuo Li, Yifan Lu, Linfeng Tang et al.

ICCV 2025highlightarXiv:2503.23925
#19833

LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing

Achint Soni, Meet Soni, Sirisha Rambhatla

ICCV 2025posterarXiv:2503.21541
#19834

FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning

Hang Guo, Yawei Li, Taolin Zhang et al.

ICCV 2025posterarXiv:2503.23367
#19835

Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation

Gang Dai, Yifan Zhang, Yutao Qin et al.

ICCV 2025posterarXiv:2508.03256
#19836

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation

Ruotong Wang, Mingli Zhu, Jiarong Ou et al.

ICCV 2025posterarXiv:2504.16907
#19837

Tracing Copied Pixels and Regularizing Patch Affinity in Copy Detection

Yichen Lu, Siwei Nie, Minlong Lu et al.

ICCV 2025poster
#19838

PixTalk: Controlling Photorealistic Image Processing and Editing with Language

Marcos Conde, Zihao Lu, Radu Timofte

ICCV 2025poster
#19839

A Unified Framework for Industrial Cel-Animation Colorization with Temporal-Structural Awareness

Xiaoyi Feng, Tao Huang, Peng Wang et al.

ICCV 2025poster
#19840

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

Chieh-Yun Chen, Min Shi, Gong Zhang et al.

ICCV 2025posterarXiv:2507.20536
#19841

LayerLock: Non-collapsing Representation Learning with Progressive Freezing

Goker Erdogan, Nikhil Parthasarathy, Catalin Ionescu et al.

ICCV 2025posterarXiv:2509.10156
#19842

Function-centric Bayesian Network for Zero-Shot Object Goal Navigation

Sixian Zhang, Xinyao Yu, Xinhang Song et al.

ICCV 2025poster
#19843

Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive Segmentation

You Huang, Lichao Chen, Jiayi Ji et al.

ICCV 2025poster
#19844

CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning

Kuniaki Saito, Donghyun Kim, Kwanyong Park et al.

ICCV 2025highlightarXiv:2507.01409
#19845

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

Tianming Liang, Kun-Yu Lin, Chaolei Tan et al.

ICCV 2025posterarXiv:2501.14607
#19846

Test-time Adaptation for Foundation Medical Segmentation Model Without Parametric Updates

Kecheng Chen, Xinyu Luo, Tiexin Qin et al.

ICCV 2025highlightarXiv:2504.02008
#19847

Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation

Xiaoling Hu, Xiangrui Zeng, Oula Puonti et al.

ICCV 2025posterarXiv:2411.16719
#19848

Representation Shift: Unifying Token Compression with FlashAttention

Joonmyung Choi, Sanghyeok Lee, Byungoh Ko et al.

ICCV 2025posterarXiv:2508.00367
#19849

ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity

Yefei He, Feng Chen, Jing Liu et al.

ICCV 2025poster
#19850

FastJSMA: Accelerating Jacobian-based Saliency Map Attacks through Gradient Decoupling

Zhenghao Gao, Shengjie Xu, Zijing Li et al.

ICCV 2025poster
#19851

Federated Continuous Category Discovery and Learning

Lixu Wang, Chenxi Liu, Junfeng Guo et al.

ICCV 2025poster
#19852

ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts

Xiaoqi Wang, Clint Sebastian, Wenbin He et al.

ICCV 2025posterarXiv:2506.21835
#19853

Zero-Shot Compositional Video Learning with Coding Rate Reduction

Heeseok Jung, Jun-Hyeon Bak, Yujin Jeong et al.

ICCV 2025poster
#19854

Fuzzy Contrastive Decoding to Alleviate Object Hallucination in Large Vision-Language Models

Jieun Kim, Jinmyeong Kim, Yoonji Kim et al.

ICCV 2025poster
#19855

Superpowering Open-Vocabulary Object Detectors for X-ray Vision

Pablo Garcia-Fernandez, Lorenzo Vaquero, Mingxuan Liu et al.

ICCV 2025posterarXiv:2503.17071
#19856

RhythmGuassian: Repurposing Generalizable Gaussian Model For Remote Physiological Measurement

Hao LU, Yuting Zhang, Jiaqi Tang et al.

ICCV 2025highlight
#19857

CABLD: Contrast-Agnostic Brain Landmark Detection with Consistency-Based Regularization

Soorena Salari, Arash Harirpoush, Hassan Rivaz et al.

ICCV 2025posterarXiv:2411.17845
#19858

Robustifying Zero-Shot Vision Language Models by Subspaces Alignment

Junhao Dong, Piotr Koniusz, Liaoyuan Feng et al.

ICCV 2025poster
#19859

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Weiming Ren, Wentao Ma, Huan Yang et al.

ICCV 2025posterarXiv:2503.11579
#19860

FE-CLIP: Frequency Enhanced CLIP Model for Zero-Shot Anomaly Detection and Segmentation

Tao Gong, Qi Chu, Bin Liu et al.

ICCV 2025poster
#19861

Bias-Resilient Weakly Supervised Semantic Segmentation Using Normalizing Flows

Xianglin Qiu, Xiaoyang Wang, Zhen Zhang et al.

ICCV 2025poster
#19862

Cracking Instance Jigsaw Puzzles: A Superior Alternative to Multiple Instance Learning for Whole Slide Image Analysis

Xiwen Chen, Peijie Qiu, Wenhui Zhu et al.

ICCV 2025poster
#19863

DecAD: Decoupling Anomalies in Latent Space for Multi-Class Unsupervised Anomaly Detection

Xiaolei Wang, Xiaoyang Wang, Huihui Bai et al.

ICCV 2025poster
#19864

Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding

Minghang Zheng, Yuxin Peng, Benyuan Sun et al.

ICCV 2025posterarXiv:2508.04546
#19865

RA-BUSSeg: Relation-aware Semi-supervised Breast Ultrasound Image Segmentation via Adjacent Propagation and Cross-layer Alignment

Wanting ZHANG, Zhenhui Ding, Guilian Chen et al.

ICCV 2025poster
#19866

Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval

WonJun Moon, Cheol-Ho Cho, Woojin Jun et al.

ICCV 2025posterarXiv:2504.13035
#19867

Auto-Controlled Image Perception in MLLMs via Visual Perception Tokens

Runpeng Yu, Xinyin Ma, Xinchao Wang

ICCV 2025poster
#19868

SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting

Zihui Gao, Jia-Wang Bian, Guosheng Lin et al.

ICCV 2025posterarXiv:2507.15602
#19869

Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection

Ji Du, Xin WANG, Fangwei Hao et al.

ICCV 2025posterarXiv:2510.18437
#19870

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment

Shi-Chen Zhang, Yunheng Li, Yu-Huan Wu et al.

ICCV 2025posterarXiv:2508.08811
#19871

Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation

Dong Zhao, Qi Zang, Shuang Wang et al.

ICCV 2025poster
#19872

Is CLIP ideal? No. Can we fix it? Yes!

Raphaela Kang, Yue Song, Georgia Gkioxari et al.

ICCV 2025posterarXiv:2503.08723
#19873

HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models

ZHIXIANG WEI, Guangting Wang, Xiaoxiao Ma et al.

ICCV 2025posterarXiv:2507.22431
#19874

Dynamic Dictionary Learning for Remote Sensing Image Segmentation

Xuechao Zou, Yue Li, Shun Zhang et al.

ICCV 2025posterarXiv:2503.06683
#19875

Temporal-aware Query Routing for Real-time Video Instance Segmentation

Zesen Cheng, Kehan Li, Yian Zhao et al.

ICCV 2025poster
#19876

Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation

Qin Zhou, Guoyan Liang, Xindi Li et al.

ICCV 2025posterarXiv:2507.07568
#19877

Anomaly Detection of Integrated Circuits Package Substrates Using the Large Vision Model SAIC: Dataset Construction, Methodology, and Application

Ruiyun Yu, Bingyang Guo, Haoyuan Li

ICCV 2025poster
#19878

Memory-Efficient 4-bit Preconditioned Stochastic Optimization

Jingyang Li, Kuangyu Ding, Kim-chuan Toh et al.

ICCV 2025posterarXiv:2412.10663
#19879

No More Sibling Rivalry: Debiasing Human-Object Interaction Detection

Bin Yang, Yulin Zhang, Hong-Yu Zhou et al.

ICCV 2025posterarXiv:2509.00760
#19880

DASH: Detection and Assessment of Systematic Hallucinations of VLMs

Maximilian Augustin, Yannic Neuhaus, Matthias Hein

ICCV 2025posterarXiv:2503.23573
#19881

HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics

Gueter Josmy Faure, Jia-Fong Yeh, Min-Hung Chen et al.

ICCV 2025posterarXiv:2408.17443
#19882

Debiasing Trace Guidance: Top-down Trace Distillation and Bottom-up Velocity Alignment for Unsupervised Anomaly Detection

Xingjian Wang, Li Chai, Jiming Chen

ICCV 2025
#19883

ODDR: Outlier Detection & Dimension Reduction Based Defense Against Adversarial Patches

Nandish Chattopadhyay, Amira Guesmi, Muhammad Abdullah Hanif et al.

ICCV 2025posterarXiv:2311.12084
#19884

FIND: Few-Shot Anomaly Inspection with Normal-Only Multi-Modal Data

YITING LI, Fayao Liu, Jingyi Liao et al.

ICCV 2025poster
#19885

Unsupervised Histopathological Image Semantic Segmentation with Overlapping Patches Consistency Constraint

Wentian Cai, Weizhao Weng, Zihao Huang et al.

ICCV 2025poster
#19886

How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?

Yujian Lee, Peng Gao, Yongqi Xu et al.

ICCV 2025posterarXiv:2601.08133
#19887

UINavBench: A Framework for Comprehensive Evaluation of Interactive Digital Agents

Harsh Agrawal, Eldon Schoop, Xinlei Pan et al.

ICCV 2025poster
#19888

LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Xinyu Yan, Meijun Sun, Ge-Peng Ji et al.

ICCV 2025posterarXiv:2508.01152
#19889

Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration

Ting Lei, Shaofeng Yin, Qingchao Chen et al.

ICCV 2025posterarXiv:2508.03207
#19890

Teaching AI the Anatomy Behind the Scan: Addressing Anatomical Flaws in Medical Image Segmentation with Learnable Prior

Young Seok Jeon, Hongfei Yang, Huazhu Fu et al.

ICCV 2025posterarXiv:2403.18878
#19891

Enrich and Detect: Video Temporal Grounding with Multimodal LLMs

Shraman Pramanick, Effrosyni Mavroudi, Yale Song et al.

ICCV 2025highlightarXiv:2510.17023
#19892

Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning

Zeyu Xi, Haoying Sun, Yaofei Wu et al.

ICCV 2025posterarXiv:2507.20163
#19893

Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation

Maximilian Ulmer, Wout Boerdijk, Rudolph Triebel et al.

ICCV 2025posterarXiv:2508.04122
#19894

Breaking Grid Constraints: Dynamic Graph Reconstruction Network for Multi-organ Segmentation

Junhao Xiao, Yang Wei, Jingyu Wang et al.

ICCV 2025poster
#19895

Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval

Bangxiang Lan, Ruobing Xie, Ruixiang Zhao et al.

ICCV 2025posterarXiv:2509.04773
#19896

ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba

Juncan Deng, Shuaiting Li, Zeyu Wang et al.

ICCV 2025posterarXiv:2503.09509
#19897

Mixture-of-Scores: Robust Image-Text Data Valuation via Three Lines of Code

WU Sitong, Haoru Tan, Yukang Chen et al.

ICCV 2025poster
#19898

Axis-level Symmetry Detection with Group-Equivariant Representation

Wongyun Yu, Ahyun Seo, Minsu Cho

ICCV 2025posterarXiv:2508.10740
#19899

U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration

Xiaofan Li, Zhihao Xu, Chenming Wu et al.

ICCV 2025posterarXiv:2507.04503
#19900

Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images

Qi Xun Yeo, Yanyan Li, Gim Hee Lee

ICCV 2025posterarXiv:2508.06546
#19901

Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging

Chongjie Ye, Yushuang Wu, Ziteng Lu et al.

ICCV 2025posterarXiv:2503.22236
#19902

Dual-S3D: Hierarchical Dual-Path Selective SSM-CNN for High-Fidelity Implicit Reconstruction

Luoxi Zhang, Pragyan Shrestha, Yu Zhou et al.

ICCV 2025poster
#19903

MMGeo: Multimodal Compositional Geo-Localization for UAVs

Yuxiang Ji, Boyong He, Zhuoyue Tan et al.

ICCV 2025poster
#19904

Large Scene Generation with Cube-Absorb Discrete Diffusion

Qianjiang Hu, Wei Hu

ICCV 2025poster
#19905

SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration

Jongsuk Kim, Jae Young Lee, Gyojin Han et al.

ICCV 2025posterarXiv:2510.24052
#19906

Benchmarking Egocentric Visual-Inertial SLAM at City Scale

Anusha Krishnan, Shaohui Liu, Paul-Edouard Sarlin et al.

ICCV 2025highlightarXiv:2509.26639
#19907

DAA*: Deep Angular A Star for Image-based Path Planning

Zhiwei Xu

ICCV 2025posterarXiv:2507.09305
#19908

RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion

Geonho Bang, Minjae Seong, Jisong Kim et al.

ICCV 2025posterarXiv:2509.17712
#19909

EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device

Gunjan Chhablani, Xiaomeng Ye, Muhammad Zubair Irshad et al.

ICCV 2025posterarXiv:2509.17430
#19910

NGD: Neural Gradient Based Deformation for Monocular Garment Reconstruction

Soham Dasgupta, Shanthika Naik, Preet Savalia et al.

ICCV 2025posterarXiv:2508.17712
#19911

RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians

Shenxing Wei, Jinxi Li, Yafei YANG et al.

ICCV 2025highlightarXiv:2508.09830
#19912

Semantic-guided Camera Ray Regression for Visual Localization

Yesheng Zhang, Xu Zhao

ICCV 2025poster
#19913

Polarimetric Neural Field via Unified Complex-Valued Wave Representation

Chu Zhou, Yixin Yang, Junda Liao et al.

ICCV 2025poster
#19914

From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos

Chenjian Gao, Lihe Ding, Rui Han et al.

ICCV 2025posterarXiv:2507.20331
#19915

Street Gaussians without 3D Object Tracker

Ruida Zhang, Chengxi Li, Chenyangguang Zhang et al.

ICCV 2025posterarXiv:2412.05548
#19916

HiNeuS: High-fidelity Neural Surface Mitigating Low-texture and Reflective Ambiguity

Yida Wang, Xueyang Zhang, Kun Zhan et al.

ICCV 2025highlightarXiv:2506.23854
#19917

I2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting

Zhimin Liao, Ping Wei, Ruijie Zhang et al.

ICCV 2025poster
#19918

MetaScope: Optics-Driven Neural Network for Ultra-Micro Metalens Endoscopy

Wuyang Li, Wentao Pan, Xiaoyuan Liu et al.

ICCV 2025highlightarXiv:2508.03596
#19919

Free-running vs Synchronous: Single-Photon Lidar for High-flux 3D Imaging

Ruangrawee Kitichotkul, Shashwath Bharadwaj, Joshua Rapp et al.

ICCV 2025posterarXiv:2507.09386
#19920

Leaps and Bounds: An Improved Point Cloud Winding Number Formulation for Fast Normal Estimation and Surface Reconstruction

Chamin Hewa Koneputugodage, Dylan Campbell, Stephen Gould

ICCV 2025poster
#19921

Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning

Yiyang Chen, Shanshan Zhao, Lunhao Duan et al.

ICCV 2025posterarXiv:2507.09102
#19922

OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous Driving

Kota Shimomura, Masaki Nambata, Atsuya Ishikawa et al.

ICCV 2025poster
#19923

UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images

Jiamin WU, Kenkun Liu, Xiaoke Jiang et al.

ICCV 2025posterarXiv:2410.13195
#19924

TOTP: Transferable Online Pedestrian Trajectory Prediction with Temporal-Adaptive Mamba Latent Diffusion

Ziyang Ren, Ping Wei, Shangqi Deng et al.

ICCV 2025poster
#19925

UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields

Fabian Perez, Sara Rojas Martinez, Carlos Hinojosa et al.

ICCV 2025posterarXiv:2506.21884
#19926

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Zebin He, Mx Yang, Shuhui Yang et al.

ICCV 2025highlightarXiv:2503.10289
#19927

Visual Surface Wave Elastography: Revealing Subsurface Physical Properties via Visible Surface Waves

Alexander Ogren, Berthy Feng, Jihoon Ahn et al.

ICCV 2025posterarXiv:2507.09207
#19928

LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation

WEI-JER Chang, Masayoshi Tomizuka, Wei Zhan et al.

ICCV 2025posterarXiv:2504.11521
#19929

Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

Ziliang Miao, Runjian Chen, Yixi Cai et al.

ICCV 2025posterarXiv:2503.07167
#19930

GeoFormer: Geometry Point Encoder for 3D Object Detection with Graph-based Transformer

Xin Jin, Haisheng Su, Cong Ma et al.

ICCV 2025poster
#19931

AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion

Liuyue Xie, Jiancong Guo, Ozan Cakmakci et al.

ICCV 2025posterarXiv:2503.21581
#19932

Tile-wise vs. Image-wise: Random-Tile Loss and Training Paradigm for Gaussian Splatting

Xiaoyu Zhang, Weihong Pan, Xiaojun Xiang et al.

ICCV 2025poster
#19933

RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation

Yuwen Du, Anning Hu, Zichen Chao et al.

ICCV 2025posterarXiv:2503.10410
#19934

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Zijie Wang, Weiming Zhang, Wei Zhang et al.

ICCV 2025posterarXiv:2511.06272
#19935

Planar Affine Rectification from Local Change of Scale and Orientation

Yuval Nissan, Marc Pollefeys, Daniel Barath

ICCV 2025highlight
#19936

ERNet: Efficient Non-Rigid Registration Network for Point Sequences

Guangzhao He, Yuxi Xiao, Zhen Xu et al.

ICCV 2025posterarXiv:2510.15800
#19937

Doppler-Aware LiDAR-RADAR Fusion for Weather-Robust 3D Detection

Yujeong Chae, Heejun Park, Hyeonseong Kim et al.

ICCV 2025poster
#19938

Egocentric Action-aware Inertial Localization in Point Clouds with Vision-Language Guidance

Mingfang Zhang, Ryo Yonetani, Yifei Huang et al.

ICCV 2025posterarXiv:2505.14346
#19939

InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models

Yifan Lu, Xuanchi Ren, Jiawei Yang et al.

ICCV 2025posterarXiv:2412.03934
#19940

GenFlow3D: Generative Scene Flow Estimation and Prediction on Point Cloud Sequences

Hanlin Li, Wenming Weng, Yueyi Zhang et al.

ICCV 2025poster
#19941

Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping

Emanuele Giacomini, Luca Di Giammarino, Lorenzo De Rebotti et al.

ICCV 2025posterarXiv:2503.17491
#19942

AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering

Michael Steiner, Thomas Köhler, Lukas Radl et al.

ICCV 2025highlightarXiv:2504.12811
#19943

SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video

David Stotko, Reinhard Klein

ICCV 2025highlightarXiv:2509.08828
#19944

BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment

Tongfan Guan, Jiaxin Guo, Chen Wang et al.

ICCV 2025highlightarXiv:2508.04611
#19945

Decoupled Diffusion Sparks Adaptive Scene Generation

Yunsong Zhou, Naisheng Ye, William Ljungbergh et al.

ICCV 2025posterarXiv:2504.10485
#19946

Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior

Renzhi He, Haowen Zhou, Yubei Chen et al.

ICCV 2025posterarXiv:2510.16391
#19947

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Xin Zhou, DINGKANG LIANG, Sifan Tu et al.

ICCV 2025posterarXiv:2501.14729
#19948

Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting

Zhaojie Zeng, Yuesong Wang, Chao Yang et al.

ICCV 2025posterarXiv:2506.23479
#19949

NeuraLeaf: Neural Parametric Leaf Models with Shape and Deformation Disentanglement

Yang Yang, Dongni Mao, Hiroaki Santo et al.

ICCV 2025highlightarXiv:2507.12714
#19950

Stochastic Gradient Estimation for Higher-Order Differentiable Rendering

Zican Wang, Michael Fischer, Tobias Ritschel

ICCV 2025highlightarXiv:2412.03489
#19951

Uncertainty-Aware Diffusion-Guided Refinement of 3D Scenes

Sarosij Bose, Arindam Dutta, Sayak Nag et al.

ICCV 2025posterarXiv:2503.15742
#19952

HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Models

YIWEN CHEN, Hieu Nguyen, Vikram Voleti et al.

ICCV 2025highlightarXiv:2406.20077
#19953

Hi-Gaussian: Hierarchical Gaussians under Normalized Spherical Projection for Single-View 3D Reconstruction

Binjian Xie, Pengju Zhang, Hao Wei et al.

ICCV 2025poster
#19954

Exploiting Vision Language Model for Training-Free 3D Point Cloud OOD Detection via Graph Score Propagation

Tiankai Chen, Yushu Li, Adam Goodge et al.

ICCV 2025posterarXiv:2506.22375
#19955

Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving

Junhao Ge, Zuhong Liu, Longteng Fan et al.

ICCV 2025posterarXiv:2503.18108
#19956

Lidar Waveforms are Worth 40x128x33 Words

Dominik Scheuble, Hanno Holzhüter, Steven Peters et al.

ICCV 2025highlight
#19957

Wide2Long: Learning Lens Compression and Perspective Adjustment for Wide-Angle to Telephoto Translation

Soumyadipta Banerjee, Jiaul Paik, Debashis Sen

ICCV 2025poster
#19958

SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection

Maximilian Pittner, Joel Janai, Mario Faigle et al.

ICCV 2025posterarXiv:2601.04968
#19959

Relative Illumination Fields: Learning Medium and Light Independent Underwater Scenes

Mengkun She, Felix Seegräber, David Nakath et al.

ICCV 2025posterarXiv:2504.10024
#19960

HVPUNet: Hybrid-Voxel Point-cloud Upsampling Network

Juhyung Ha, Vibhas Vats, Alimoor Reza et al.

ICCV 2025poster
#19961

Stealthy Backdoor Attack in Federated Learning via Adaptive Layer-wise Gradient Alignment

Qingqian Yang, Peishen Yan, Xiaoyu Wu et al.

ICCV 2025poster
#19962

RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model

Huiyang Hu, Peijin Wang, Hanbo Bi et al.

ICCV 2025posterarXiv:2411.17984
#19963

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

Xianglong He, Zi-Xin Zou, Chia Hao Chen et al.

ICCV 2025posterarXiv:2503.21732
#19964

Diffusion Transformer meets Multi-level Wavelet Spectrum for Single Image Super-Resolution

Peng Du, Hui Li, Han Xu et al.

ICCV 2025posterarXiv:2511.01175
#19965

Spatially-Varying Autofocus

Yingsi Qin, Aswin Sankaranarayanan, Matthew O'Toole

ICCV 2025poster
#19966

M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization

Ju-Hyeon Nam, Dong-Hyun Moon, Sang-Chul Lee

ICCV 2025highlightarXiv:2506.20922
#19967

Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description

Anna-Maria Halacheva, Yang Miao, Jan-Nico Zaech et al.

ICCV 2025posterarXiv:2412.01398
#19968

SU-RGS: Relightable 3D Gaussian Splatting from Sparse Views under Unconstrained Illuminations

Qi Zhang, Chi Huang, Qian Zhang et al.

ICCV 2025poster
#19969

Gradient Extrapolation for Debiased Representation Learning

Ihab Asaad, Maha Shadaydeh, Joachim Denzler

ICCV 2025posterarXiv:2503.13236
#19970

World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model

Yupeng Zheng, Pengxuan Yang, Zebin Xing et al.

ICCV 2025posterarXiv:2507.00603
#19971

Customizing Domain Adapters for Domain Generalization

Yuyang Ji, Zeyi Huang, Haohan Wang et al.

ICCV 2025poster
#19972

Soft Separation and Distillation: Toward Global Uniformity in Federated Unsupervised Learning

Hung-Chieh Fang, Hsuan-Tien Lin, Irwin King et al.

ICCV 2025posterarXiv:2508.01251
#19973

Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image

Jerred Chen, Ronald Clark

ICCV 2025posterarXiv:2503.17358
#19974

Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts

Zixuan Hu, Dongxiao Li, Xinzhu Ma et al.

ICCV 2025highlightarXiv:2508.20488
#19975

Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective

Hoang Phan, Tung Lam Tran, Quyen Tran et al.

ICCV 2025highlightarXiv:2211.13723
#19976

Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity

Mingyuan Sun, Zheng Fang, Jiaxu Wang et al.

ICCV 2025posterarXiv:2507.15775
#19977

Object-centric Video Question Answering with Visual Grounding and Referring

Haochen Wang, Qirui Chen, Cilin Yan et al.

ICCV 2025posterarXiv:2507.19599
#19978

Exploiting Frequency Dynamics for Enhanced Multimodal Event-based Action Recognition

Meiqi Cao, Xiangbo Shu, Xin Jiang et al.

ICCV 2025poster
#19979

How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach

Chirui CHANG, Jiahui Liu, Zhengzhe Liu et al.

ICCV 2025posterarXiv:2406.19568
#19980

WIPES: Wavelet-based Visual Primitives

Wenhao Zhang, Hao Zhu, Delong Wu et al.

ICCV 2025posterarXiv:2508.12615
#19981

CoSMIC: Continual Self-supervised Learning for Multi-Domain Medical Imaging via Conditional Mutual Information Maximization

Yihang Liu, Ying Wen, Longzhen Yang et al.

ICCV 2025poster
#19982

Diffusion Curriculum: Synthetic-to-Real Data Curriculum via Image-Guided Diffusion

Yijun Liang, Shweta Bhardwaj, Tianyi Zhou

ICCV 2025posterarXiv:2410.13674
#19983

Advancing Textual Prompt Learning with Anchored Attributes

Zheng Li, Yibing Song, Ming-Ming Cheng et al.

ICCV 2025posterarXiv:2412.09442
#19984

Dual-Rate Dynamic Teacher for Source-Free Domain Adaptive Object Detection

Qi He, Xiao Wu, Jun-Yan He et al.

ICCV 2025poster
#19985

OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance

Mingquan Zhou, Chen He, Ruiping Wang et al.

ICCV 2025poster
#19986

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Zhisheng Zhong, Chengyao Wang, Yuqi Liu et al.

ICCV 2025posterarXiv:2412.09501
#19987

Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction

Mang Cao, Sanping Zhou, Yizhe Li et al.

ICCV 2025posterarXiv:2508.20376
#19988

SITE: towards Spatial Intelligence Thorough Evaluation

Wenqi Wang, Reuben Tan, Pengyue Zhu et al.

ICCV 2025posterarXiv:2505.05456
#19989

SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models

Sudong Wang, Yunjian Zhang, Yao Zhu et al.

ICCV 2025poster
#19990

Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting

Hengyu Meng, Duotun Wang, Zhijing Shao et al.

ICCV 2025posterarXiv:2502.20045
#19991

Mamba-3VL: Taming State Space Model for 3D Vision Language Learning

Yuan Wang, Yuxin Chen, Zhongang Qi et al.

ICCV 2025poster
#19992

MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Min Yang, Zihan Jia, Zhilin Dai et al.

ICCV 2025posterarXiv:2508.07312
#19993

MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking

Han Han, Wei Zhai, Yang Cao et al.

ICCV 2025posterarXiv:2412.01300
#19994

Asynchronous Event Error-Minimizing Noise for Safeguarding Event Dataset

Ruofei WANG, Peiqi Duan, Boxin Shi et al.

ICCV 2025highlightarXiv:2507.05728
#19995

Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision

Yuting He, Shuo Li

ICCV 2025posterarXiv:2506.20850
#19996

Efficient Fine-Tuning of Large Models via Nested Low-Rank Adaptation

Lujun Li, Cheng Lin, Dezhi Li et al.

ICCV 2025poster
#19997

Dual-level Prototype Learning for Composite Degraded Image Restoration

Zhongze Wang, Haitao Zhao, Lujian Yao et al.

ICCV 2025poster
#19998

Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation

Shengfang ZHAI, Jiajun Li, Yue Liu et al.

ICCV 2025highlightarXiv:2503.06453
#19999

GReg: Geometry-Aware Region Refinement for Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang et al.

ICCV 2025poster
#20000

FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing

Bizhu Wu, Jinheng Xie, Meidan Ding et al.

ICCV 2025posterarXiv:2507.19850