Most Cited ICCV "recall maximization" Papers

2,701 papers found • Page 12 of 14

Filters:Most Cited ICCV recall maximization Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2201

Street Gaussians without 3D Object Tracker

Ruida Zhang, Chengxi Li, Chenyangguang Zhang et al.

ICCV 2025arXiv:2412.05548

#2202

HiNeuS: High-fidelity Neural Surface Mitigating Low-texture and Reflective Ambiguity

Yida Wang, Xueyang Zhang, Kun Zhan et al.

ICCV 2025highlightarXiv:2506.23854

#2203

Resonance: Learning to Predict Social-Aware Pedestrian Trajectories as Co-Vibrations

Conghao Wong, Ziqian Zou, Beihao Xia

ICCV 2025arXiv:2412.02447

#2204

I2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting

Zhimin Liao, Ping Wei, Ruijie Zhang et al.

ICCV 2025

#2205

InsideOut: Integrated RGB-Radiative Gaussian Splatting for Comprehensive 3D Object Representation

Jungmin Lee, Seonghyuk Hong, Juyong Lee et al.

ICCV 2025arXiv:2510.17864

#2206

RIOcc: Efficient Cross-Modal Fusion Transformer with Collaborative Feature Refinement for 3D Semantic Occupancy Prediction

Baojie Fan, Xiaotian Li, Yuhan Zhou et al.

ICCV 2025

#2207

MetaScope: Optics-Driven Neural Network for Ultra-Micro Metalens Endoscopy

Wuyang Li, Wentao Pan, Xiaoyuan Liu et al.

ICCV 2025highlightarXiv:2508.03596

#2208

CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving

Changxing Liu, Genjia Liu, Zijun Wang et al.

ICCV 2025arXiv:2503.08683

#2209

Free-running vs Synchronous: Single-Photon Lidar for High-flux 3D Imaging

Ruangrawee Kitichotkul, Shashwath Bharadwaj, Joshua Rapp et al.

ICCV 2025arXiv:2507.09386

#2210

Mitigating Geometric Degradation in Fast DownSampling via FastAdapter for Point Cloud Segmentation

Shuofeng Sun, Haibin Yan

ICCV 2025

#2211

SEHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing

Yiyu Li, Haoyuan Wang, Ke Xu et al.

ICCV 2025arXiv:2509.20400

#2212

TARS: Traffic-Aware Radar Scene Flow Estimation

Jialong Wu, Marco Braun, Dominic Spata et al.

ICCV 2025arXiv:2503.10210

#2213

DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection

Yuval Haitman, Oded Bialer

ICCV 2025arXiv:2508.12330

#2214

Leaps and Bounds: An Improved Point Cloud Winding Number Formulation for Fast Normal Estimation and Surface Reconstruction

Chamin Hewa Koneputugodage, Dylan Campbell, Stephen Gould

ICCV 2025

#2215

Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning

Yiyang Chen, Shanshan Zhao, Lunhao Duan et al.

ICCV 2025arXiv:2507.09102

#2216

OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous Driving

Kota Shimomura, Masaki Nambata, Atsuya Ishikawa et al.

ICCV 2025

#2217

MDP-Omni: Parameter-free Multimodal Depth Prior-based Sampling for Omnidirectional Stereo Matching

Eunjin Son, HyungGi Jo, Wookyong Kwon et al.

ICCV 2025

#2218

EDM: Efficient Deep Feature Matching

Xi Li, Tong Rao, Cihui Pan

ICCV 2025highlightarXiv:2503.05122

#2219

UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images

Jiamin WU, Kenkun Liu, Xiaoke Jiang et al.

ICCV 2025arXiv:2410.13195

#2220

TOTP: Transferable Online Pedestrian Trajectory Prediction with Temporal-Adaptive Mamba Latent Diffusion

Ziyang Ren, Ping Wei, Shangqi Deng et al.

ICCV 2025

#2221

UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields

Fabian Perez, Sara Rojas Martinez, Carlos Hinojosa et al.

ICCV 2025arXiv:2506.21884

#2222

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Zebin He, Mx Yang, Shuhui Yang et al.

ICCV 2025highlightarXiv:2503.10289

#2223

Visual Surface Wave Elastography: Revealing Subsurface Physical Properties via Visible Surface Waves

Alexander Ogren, Berthy Feng, Jihoon Ahn et al.

ICCV 2025arXiv:2507.09207

#2224

LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions

Jingjing Wang, Qirui Hu, Chong Bao et al.

ICCV 2025arXiv:2602.01118

#2225

Feature Extraction and Representation of Pre-training Point Cloud Based on Diffusion Models

Chang Qiu, Feipeng Da, Zilei Zhang

ICCV 2025

#2226

Occupancy Learning with Spatiotemporal Memory

Ziyang Leng, Jiawei Yang, Wenlong Yi et al.

ICCV 2025arXiv:2508.04705

#2227

LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation

WEI-JER Chang, Masayoshi Tomizuka, Wei Zhan et al.

ICCV 2025arXiv:2504.11521

#2228

Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

Ziliang Miao, Runjian Chen, Yixi Cai et al.

ICCV 2025arXiv:2503.07167

#2229

S²M²: Scalable Stereo Matching Model for Reliable Depth Estimation

JUNHONG MIN, YOUNGPIL JEON, Jimin Kim et al.

ICCV 2025

#2230

ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training

Leonard Bruns, Axel Barroso-Laguna, Tommaso Cavallari et al.

ICCV 2025arXiv:2510.11605

#2231

Towards Visual Localization Interoperability: Cross-Feature for Collaborative Visual Localization and Mapping

Alberto Jaenal, Paula Carbó Cubero, Jose Araujo et al.

ICCV 2025

#2232

MiDSummer: Multi-Guidance Diffusion for Controllable Zero-Shot Immersive Gaussian Splatting Scene Generation

Anjun Hu, Richard Tomsett, Valentin Gourmet et al.

ICCV 2025

#2233

Spatio-Spectral Pattern Illumination for Direct and Indirect Separation from a Single Hyperspectral Image

Shin Ishihara, Imari Sato

ICCV 2025highlight

#2234

GeoFormer: Geometry Point Encoder for 3D Object Detection with Graph-based Transformer

Xin Jin, Haisheng Su, Cong Ma et al.

ICCV 2025

#2235

AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion

Liuyue Xie, Jiancong Guo, Ozan Cakmakci et al.

ICCV 2025arXiv:2503.21581

#2236

Tile-wise vs. Image-wise: Random-Tile Loss and Training Paradigm for Gaussian Splatting

Xiaoyu Zhang, Weihong Pan, Xiaojun Xiang et al.

ICCV 2025

#2237

Explaining Human Preferences via Metrics for Structured 3D Reconstruction

Jack Langerman, Denis Rozumny, Yuzhong Huang et al.

ICCV 2025highlightarXiv:2503.08208

#2238

RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation

Yuwen Du, Anning Hu, Zichen Chao et al.

ICCV 2025arXiv:2503.10410

#2239

Inverse 3D Microscopy Rendering for Cell Shape Inference with Active Mesh

Sacha Ichbiah, Anshuman Sinha, Fabrice Delbary et al.

ICCV 2025highlightarXiv:2303.10440

#2240

UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction

Jin Cao, Hongrui Wu, Ziyong Feng et al.

ICCV 2025arXiv:2510.01669

#2241

ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors

Minsu Kim, Subin Jeon, In Cho et al.

ICCV 2025arXiv:2508.06014

#2242

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Zijie Wang, Weiming Zhang, Wei Zhang et al.

ICCV 2025arXiv:2511.06272

#2243

Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation

Bozhong Zheng, Jinye Gan, Xiaohao Xu et al.

ICCV 2025arXiv:2505.24431

#2244

SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching

Xiangzeng Liu, CHI WANG, Guanglu Shi et al.

ICCV 2025highlightarXiv:2508.02278

#2245

Planar Affine Rectification from Local Change of Scale and Orientation

Yuval Nissan, Marc Pollefeys, Daniel Barath

ICCV 2025highlight

#2246

ERNet: Efficient Non-Rigid Registration Network for Point Sequences

Guangzhao He, Yuxi Xiao, Zhen Xu et al.

ICCV 2025arXiv:2510.15800

#2247

Doppler-Aware LiDAR-RADAR Fusion for Weather-Robust 3D Detection

Yujeong Chae, Heejun Park, Hyeonseong Kim et al.

ICCV 2025

#2248

Egocentric Action-aware Inertial Localization in Point Clouds with Vision-Language Guidance

Mingfang Zhang, Ryo Yonetani, Yifei Huang et al.

ICCV 2025arXiv:2505.14346

#2249

InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models

Yifan Lu, Xuanchi Ren, Jiawei Yang et al.

ICCV 2025arXiv:2412.03934

#2250

ArgMatch: Adaptive Refinement Gathering for Efficient Dense Matching

Yuxin Deng, Kaining Zhang, Linfeng Tang et al.

ICCV 2025

#2251

Thermal Polarimetric Multi-view Stereo

Takahiro Kushida, Kenichiro Tanaka

ICCV 2025highlightarXiv:2510.20972

#2252

GenFlow3D: Generative Scene Flow Estimation and Prediction on Point Cloud Sequences

Hanlin Li, Wenming Weng, Yueyi Zhang et al.

ICCV 2025

#2253

Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors

Katja Schwarz, Norman Müller, Peter Kontschieder

ICCV 2025arXiv:2503.13272

#2254

Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction

Zhirui Gao, Renjiao Yi, YaQiao Dai et al.

ICCV 2025arXiv:2506.21401

#2255

Tree Skeletonization from 3D Point Clouds by Denoising Diffusion

Elias Marks, Lucas Nunes, Federico Magistri et al.

ICCV 2025

#2256

Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping

Emanuele Giacomini, Luca Di Giammarino, Lorenzo De Rebotti et al.

ICCV 2025arXiv:2503.17491

#2257

AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering

Michael Steiner, Thomas Köhler, Lukas Radl et al.

ICCV 2025highlightarXiv:2504.12811

#2258

SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video

David Stotko, Reinhard Klein

ICCV 2025highlightarXiv:2509.08828

#2259

BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment

Tongfan Guan, Jiaxin Guo, Chen Wang et al.

ICCV 2025highlightarXiv:2508.04611

#2260

Neural Inverse Rendering for High-Accuracy 3D Measurement of Moving Objects with Fewer Phase-Shifting Patterns

Yuki Urakawa, Yoshihiro Watanabe

ICCV 2025

#2261

Decoupled Diffusion Sparks Adaptive Scene Generation

Yunsong Zhou, Naisheng Ye, William Ljungbergh et al.

ICCV 2025arXiv:2504.10485

#2262

Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior

Renzhi He, Haowen Zhou, Yubei Chen et al.

ICCV 2025arXiv:2510.16391

#2263

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Xin Zhou, DINGKANG LIANG, Sifan Tu et al.

ICCV 2025arXiv:2501.14729

#2264

Instant GaussianImage: A Generalizable and Self-Adaptive Image Representation via 2D Gaussian Splatting

Zhaojie Zeng, Yuesong Wang, Chao Yang et al.

ICCV 2025arXiv:2506.23479

#2265

When Anchors Meet Cold Diffusion: A Multi-Stage Approach to Lane Detection

Bo-Lun Huang, Tzu-Hsiang Ni, Feng-Kai Huang et al.

ICCV 2025

#2266

Sat2City: 3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion

Tongyan Hua, Lutao Jiang, Ying-Cong Chen et al.

ICCV 2025arXiv:2507.04403

#2267

NeuFrameQ: Neural Frame Fields for Scalable and Generalizable Anisotropic Quadrangulation

Ying-Tian Liu, Jiajun Li, Yu-Tao Liu et al.

ICCV 2025highlight

#2268

Controllable 3D Outdoor Scene Generation via Scene Graphs

Yuheng Liu, Xinke Li, Yuning Zhang et al.

ICCV 2025arXiv:2503.07152

#2269

PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction

Yufei Han, Bowen Tie, Heng Guo et al.

ICCV 2025arXiv:2509.19726

#2270

Driving View Synthesis on Free-form Trajectories with Generative Prior

Zeyu Yang, Zijie Pan, Yuankun Yang et al.

ICCV 2025arXiv:2412.01717

#2271

NeuraLeaf: Neural Parametric Leaf Models with Shape and Deformation Disentanglement

Yang Yang, Dongni Mao, Hiroaki Santo et al.

ICCV 2025highlightarXiv:2507.12714

#2272

CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection

Hanzhi Zhong, Zhiyu Xiang, Ruoyu Xu et al.

ICCV 2025arXiv:2507.04587

#2273

Stochastic Gradient Estimation for Higher-Order Differentiable Rendering

Zican Wang, Michael Fischer, Tobias Ritschel

ICCV 2025highlightarXiv:2412.03489

#2274

Uncertainty-Aware Diffusion-Guided Refinement of 3D Scenes

Sarosij Bose, Arindam Dutta, Sayak Nag et al.

ICCV 2025arXiv:2503.15742

#2275

MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception

ChangWon Kang, Jisong Kim, Hongjae Shin et al.

ICCV 2025arXiv:2509.17462

#2276

Joint Semantic and Rendering Enhancements in 3D Gaussian Modeling with Anisotropic Local Encoding

Jingming He, Chongyi Li, Shiqi Wang et al.

ICCV 2025arXiv:2601.02339

#2277

V2XScenes: A Multiple Challenging Traffic Conditions Dataset for Large-Range Vehicle-Infrastructure Collaborative Perception

Bowen Wang, Yafei Wang, Wei Gong et al.

ICCV 2025

#2278

HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Models

YIWEN CHEN, Hieu Nguyen, Vikram Voleti et al.

ICCV 2025highlightarXiv:2406.20077

#2279

Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis

Junyan Ye, Jun He, Weijia Li et al.

ICCV 2025arXiv:2408.01812

#2280

EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting

Xiaobao Wei, Qingpo Wuwu, Zhongyu Zhao et al.

ICCV 2025arXiv:2411.15582

#2281

Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning

Giwon Lee, Wooseong Jeong, Daehee Park et al.

ICCV 2025highlightarXiv:2507.04790

#2282

Communication-Efficient Multi-Vehicle Collaborative Semantic Segmentation via Sparse 3D Gaussian Sharing

Tianyu Hong, Xiaobo Zhou, Wenkai Hu et al.

ICCV 2025

#2283

DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception

Chengchang Tian, Jianwei Ma, Yan Huang et al.

ICCV 2025arXiv:2507.18237

#2284

Hi-Gaussian: Hierarchical Gaussians under Normalized Spherical Projection for Single-View 3D Reconstruction

Binjian Xie, Pengju Zhang, Hao Wei et al.

ICCV 2025

#2285

Heatmap Regression without Soft-Argmax for Facial Landmark Detection

Chiao-An Yang, Raymond A. Yeh

ICCV 2025arXiv:2508.14929

#2286

Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration

Katie Luo, Minh-Quan Dao, Zhenzhen Liu et al.

ICCV 2025arXiv:2502.14156

#2287

Exploiting Vision Language Model for Training-Free 3D Point Cloud OOD Detection via Graph Score Propagation

Tiankai Chen, Yushu Li, Adam Goodge et al.

ICCV 2025arXiv:2506.22375

#2288

Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving

Junhao Ge, Zuhong Liu, Longteng Fan et al.

ICCV 2025arXiv:2503.18108

#2289

Puzzle Similarity: A Perceptually-guided Cross-Reference Metric for Artifact Detection in 3D Scene Reconstructions

Nicolai Hermann, Jorge Condor, Piotr Didyk

ICCV 2025arXiv:2411.17489

#2290

Authentic 4D Driving Simulation with a Video Generation Model

Lening Wang, Wenzhao Zheng, Dalong Du et al.

ICCV 2025

#2291

Lidar Waveforms are Worth 40x128x33 Words

Dominik Scheuble, Hanno Holzhüter, Steven Peters et al.

ICCV 2025highlight

#2292

Spherical Epipolar Rectification for Deep Two-View Absolute Depth Estimation

Pierre-André Brousseau, Sébastien Roy

ICCV 2025

#2293

Wide2Long: Learning Lens Compression and Perspective Adjustment for Wide-Angle to Telephoto Translation

Soumyadipta Banerjee, Jiaul Paik, Debashis Sen

ICCV 2025

#2294

Leveraging 2D Priors and SDF Guidance for Urban Scene Rendering

Siddharth Tourani, Jayaram Reddy, Akash Kumbar et al.

ICCV 2025

#2295

SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection

Maximilian Pittner, Joel Janai, Mario Faigle et al.

ICCV 2025arXiv:2601.04968

#2296

Relative Illumination Fields: Learning Medium and Light Independent Underwater Scenes

Mengkun She, Felix Seegräber, David Nakath et al.

ICCV 2025arXiv:2504.10024

#2297

Super Resolved Imaging with Adaptive Optics

Robin Swanson, Esther Y. H. Lin, Masen Lamb et al.

ICCV 2025highlightarXiv:2508.04648

#2298

HVPUNet: Hybrid-Voxel Point-cloud Upsampling Network

Juhyung Ha, Vibhas Vats, Alimoor Reza et al.

ICCV 2025

#2299

Stealthy Backdoor Attack in Federated Learning via Adaptive Layer-wise Gradient Alignment

Qingqian Yang, Peishen Yan, Xiaoyu Wu et al.

ICCV 2025

#2300

Knowledge Distillation for Learned Image Compression

Yunuo Chen, Zezheng Lyu, Bing He et al.

ICCV 2025

#2301

RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model

Huiyang Hu, Peijin Wang, Hanbo Bi et al.

ICCV 2025arXiv:2411.17984

#2302

Teeth Reconstruction and Performance Capture Using a Phone Camera

Weixi Zheng, Jingwang Ling, Zhibo Wang et al.

ICCV 2025

#2303

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Jianhong Bai, Menghan Xia, Xiao Fu et al.

ICCV 2025arXiv:2503.11647

#2304

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

Xianglong He, Zi-Xin Zou, Chia Hao Chen et al.

ICCV 2025arXiv:2503.21732

#2305

Diving into the Fusion of Monocular Priors for Generalized Stereo Matching

Chengtang Yao, Lidong Yu, Zhidan Liu et al.

ICCV 2025arXiv:2505.14414

#2306

ROAR: Reducing Inversion Error in Generative Image Watermarking

Hanyi Wang, Han Fang, Shi-Lin Wang et al.

ICCV 2025

#2307

Diffusion Transformer meets Multi-level Wavelet Spectrum for Single Image Super-Resolution

Peng Du, Hui Li, Han Xu et al.

ICCV 2025arXiv:2511.01175

#2308

Automated Model Evaluation for Object Detection via Prediction Consistency and Reliability

Seungju Yoo, Hyuk Kwon, Joong-Won Hwang et al.

ICCV 2025arXiv:2508.12082

#2309

LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing

Federico Girella, Davide Talon, Ziyue Liu et al.

ICCV 2025arXiv:2507.22627

#2310

FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models

Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas et al.

ICCV 2025arXiv:2412.08629

#2311

Spatially-Varying Autofocus

Yingsi Qin, Aswin Sankaranarayanan, Matthew O'Toole

ICCV 2025

#2312

Event-based Visual Vibrometry

Xinyu Zhou, Peiqi Duan, Yeliduosi Xiaokaiti et al.

ICCV 2025

#2313

M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization

Ju-Hyeon Nam, Dong-Hyun Moon, Sang-Chul Lee

ICCV 2025highlightarXiv:2506.20922

#2314

Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description

Anna-Maria Halacheva, Yang Miao, Jan-Nico Zaech et al.

ICCV 2025arXiv:2412.01398

#2315

ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives

Yuqian Fu, Runze Wang, Bin Ren et al.

ICCV 2025highlightarXiv:2411.19083

#2316

SU-RGS: Relightable 3D Gaussian Splatting from Sparse Views under Unconstrained Illuminations

Qi Zhang, Chi Huang, Qian Zhang et al.

ICCV 2025

#2317

Sibai: A Few-Shot Meta-Classifier for Poisoning Detection in Federated Learning

Melanie Götz, Torsten Krauß, Alexandra Dmitrienko

ICCV 2025

#2318

Gradient Extrapolation for Debiased Representation Learning

Ihab Asaad, Maha Shadaydeh, Joachim Denzler

ICCV 2025arXiv:2503.13236

#2319

World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model

Yupeng Zheng, Pengxuan Yang, Zebin Xing et al.

ICCV 2025arXiv:2507.00603

#2320

Scaling Transformer-Based Novel View Synthesis with Models Token Disentanglement and Synthetic Data

Nithin Gopalakrishnan Nair, Srinivas Kaza, Xuan Luo et al.

ICCV 2025

#2321

Customizing Domain Adapters for Domain Generalization

Yuyang Ji, Zeyi Huang, Haohan Wang et al.

ICCV 2025

#2322

DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

Chen Shi, Shaoshuai Shi, Kehua Sheng et al.

ICCV 2025arXiv:2505.19239

#2323

MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model

Yaoye Zhu, Zhe Wang, Yan Wang

ICCV 2025arXiv:2507.23595

#2324

Soft Separation and Distillation: Toward Global Uniformity in Federated Unsupervised Learning

Hung-Chieh Fang, Hsuan-Tien Lin, Irwin King et al.

ICCV 2025arXiv:2508.01251

#2325

Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image

Jerred Chen, Ronald Clark

ICCV 2025arXiv:2503.17358

#2326

PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image

Hyeongjin Nam, Donghwan Kim, Gyeongsik Moon et al.

ICCV 2025arXiv:2507.17332

#2327

Boosting MLLM Reasoning with Text-Debiased Hint-GRPO

Qihan Huang, Weilong Dai, Jinlong Liu et al.

ICCV 2025arXiv:2503.23905

#2328

Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts

Zixuan Hu, Dongxiao Li, Xinzhu Ma et al.

ICCV 2025highlightarXiv:2508.20488

#2329

AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference

Kai Huang, hao zou, Bochen Wang et al.

ICCV 2025arXiv:2503.23956

#2330

FlowStyler: Artistic Video Stylization via Transformation Fields Transports

YuNing Gong, Jiaming Chen, Xiaohua Ren et al.

ICCV 2025

#2331

ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer

Jin Hu, Mingjia Li, Xiaojie Guo

ICCV 2025arXiv:2412.02545

#2332

Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective

Hoang Phan, Tung Lam Tran, Quyen Tran et al.

ICCV 2025highlightarXiv:2211.13723

#2333

FastJSMA: Accelerating Jacobian-based Saliency Map Attacks through Gradient Decoupling

Zhenghao Gao, Shengjie Xu, Zijing Li et al.

ICCV 2025

#2334

Toward Fair and Accurate Cross-Domain Medical Image Segmentation: A VLM-Driven Active Domain Adaptation Paradigm

Hongqiu Wang, Wu Chen, Xiangde Luo et al.

ICCV 2025

#2335

Decouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible Fusion

Yidi Liu, Dong Li, Yuxin Ma et al.

ICCV 2025arXiv:2503.12764

#2336

Federated Continuous Category Discovery and Learning

Lixu Wang, Chenxi Liu, Junfeng Guo et al.

ICCV 2025

#2337

BlueNeg: A 35mm Negative Film Dataset for Restoring Channel-Heterogeneous Deterioration

Hanyuan Liu, Chengze Li, Minshan Xie et al.

ICCV 2025

#2338

Rethinking Key-frame-based Micro-expression Recognition: A Robust and Accurate Framework Against Key-frame Errors

Zheyuan Zhang, Weihao Tang, Hong Chen

ICCV 2025highlightarXiv:2508.06640

#2339

Pretend Benign: A Stealthy Adversarial Attack by Exploiting Vulnerabilities in Cooperative Perception

Hongwei Lin, Dongyu Pan, Qiming Xia et al.

ICCV 2025

#2340

What we need is explicit controllability: Training 3D gaze estimator using only facial images

Tingwei Li, Jun Bao, Zhenzhong Kuang et al.

ICCV 2025

#2341

SemiVisBooster: Boosting Semi-Supervised Learning for Fine-Grained Classification through Pseudo-Label Semantic Guidance

Wenjin Zhang, Xinyu Li, Chenyang Gao et al.

ICCV 2025

#2342

Enhancing Prompt Generation with Adaptive Refinement for Camouflaged Object Detection

Xuehan Chen, Guangyu Ren, Tianhong Dai et al.

ICCV 2025

#2343

Hypergraph Clustering Network with Partial Attribute Imputation

Qianqian Wang, Bowen Zhao, Zhengming Ding et al.

ICCV 2025

#2344

SAMPLE: Semantic Alignment through Temporal-Adaptive Multimodal Prompt Learning for Event-Based Open-Vocabulary Action Recognition

Jing Wang, Rui Zhao, Ruiqin Xiong et al.

ICCV 2025

#2345

Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity

Mingyuan Sun, Zheng Fang, Jiaxu Wang et al.

ICCV 2025arXiv:2507.15775

#2346

Object-centric Video Question Answering with Visual Grounding and Referring

Haochen Wang, Qirui Chen, Cilin Yan et al.

ICCV 2025arXiv:2507.19599

#2347

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ICCV 2025highlightarXiv:2503.22677

#2348

EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds

Lu Chen, Yizhou Wang, SHIXIANG TANG et al.

ICCV 2025arXiv:2502.05857

#2349

LIRA: Reasoning Reconstruction via Multimodal Large Language Models

Zhen Zhou, Tong Wang, Yunkai Ma et al.

ICCV 2025

#2350

Learning an Implicit Physics Model for Image-based Fluid Simulation

Emily Jia, Jiageng Mao, Zhiyuan Gao et al.

ICCV 2025arXiv:2508.08254

#2351

Exploiting Frequency Dynamics for Enhanced Multimodal Event-based Action Recognition

Meiqi Cao, Xiangbo Shu, Xin Jiang et al.

ICCV 2025

#2352

How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach

Chirui CHANG, Jiahui Liu, Zhengzhe Liu et al.

ICCV 2025arXiv:2406.19568

#2353

SIC: Similarity-Based Interpretable Image Classification with Neural Networks

Tom Nuno Wolf, Emre Kavak, Fabian Bongratz et al.

ICCV 2025arXiv:2501.17328

#2354

WIPES: Wavelet-based Visual Primitives

Wenhao Zhang, Hao Zhu, Delong Wu et al.

ICCV 2025arXiv:2508.12615

#2355

MambaML: Exploring State Space Models for Multi-Label Image Classification

Xuelin Zhu, Jian liu, Jiuxin Cao et al.

ICCV 2025

#2356

CoSMIC: Continual Self-supervised Learning for Multi-Domain Medical Imaging via Conditional Mutual Information Maximization

Yihang Liu, Ying Wen, Longzhen Yang et al.

ICCV 2025

#2357

SEAL: Semantic Aware Image Watermarking

Kasra Arabi, R. Teal Witter, Chinmay Hegde et al.

ICCV 2025arXiv:2503.12172

#2358

ArchiSet: Benchmarking Editable and Consistent Single-View 3D Reconstruction of Buildings with Specific Window-to-Wall Ratios

Jun Yin, Pengyu Zeng, Licheng Shen et al.

ICCV 2025

#2359

Unsupervised Identification of Protein Compositions and Conformations via Implicit Content-Transformation Disentanglement

Mostofa Rafid Uddin, Jana Armouti, Min Xu

ICCV 2025

#2360

Splat-based 3D Scene Reconstruction with Extreme Motion-blur

Hyeonjoong Jang, Dongyoung Choi, Donggun Kim et al.

ICCV 2025

#2361

Diffusion Curriculum: Synthetic-to-Real Data Curriculum via Image-Guided Diffusion

Yijun Liang, Shweta Bhardwaj, Tianyi Zhou

ICCV 2025arXiv:2410.13674

#2362

Advancing Textual Prompt Learning with Anchored Attributes

Zheng Li, Yibing Song, Ming-Ming Cheng et al.

ICCV 2025arXiv:2412.09442

#2363

AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction

Xuying Zhang, Yupeng Zhou, Kai Wang et al.

ICCV 2025

#2364

Dual-Rate Dynamic Teacher for Source-Free Domain Adaptive Object Detection

Qi He, Xiao Wu, Jun-Yan He et al.

ICCV 2025

#2365

OV3D-CG: Open-vocabulary 3D Instance Segmentation with Contextual Guidance

Mingquan Zhou, Chen He, Ruiping Wang et al.

ICCV 2025

#2366

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

Peng Zheng, Junke Wang, Yi Chang et al.

ICCV 2025arXiv:2507.01756

#2367

CogCM: Cognition-Inspired Contextual Modeling for Audio-Visual Speech Enhancement

Feixiang Wang, Shuang Yang, Shiguang Shan et al.

ICCV 2025

#2368

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Zhisheng Zhong, Chengyao Wang, Yuqi Liu et al.

ICCV 2025arXiv:2412.09501

#2369

EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration

Haokai Zhu, Bo Qu, Si-Yuan Cao et al.

ICCV 2025arXiv:2509.07662

#2370

Enhancing Mamba Decoder with Bidirectional Interaction in Multi-Task Dense Prediction

Mang Cao, Sanping Zhou, Yizhe Li et al.

ICCV 2025arXiv:2508.20376

#2371

Leveraging Debiased Cross-modal Attention Maps and Code-based Reasoning for Zero-shot Referring Expression Comprehension

Juntao Chen, Wen Shen, Zhihua Wei et al.

ICCV 2025

#2372

UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling

Peiming Li, Ziyi Wang, Yulin Yuan et al.

ICCV 2025arXiv:2508.14604

#2373

SITE: towards Spatial Intelligence Thorough Evaluation

Wenqi Wang, Reuben Tan, Pengyue Zhu et al.

ICCV 2025arXiv:2505.05456

#2374

SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models

Sudong Wang, Yunjian Zhang, Yao Zhu et al.

ICCV 2025

#2375

Automated Red Teaming for Text-to-Image Models through Feedback-Guided Prompt Iteration with Vision-Language Models

Wei Xu, Kangjie Chen, Jiawei Qiu et al.

ICCV 2025

#2376

Enhancing Spatial Reasoning in Multimodal Large Language Models through Reasoning-based Segmentation

Zhenhua Ning, Zhuotao Tian, Shaoshuai Shi et al.

ICCV 2025arXiv:2506.23120

#2377

OVG-HQ: Online Video Grounding with Hybrid-modal Queries

Runhao Zeng, Jiaqi Mao, Minghao Lai et al.

ICCV 2025arXiv:2508.11903

#2378

BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting

Zipei Ma, Junzhe Jiang, Yurui Chen et al.

ICCV 2025arXiv:2506.22099

#2379

CLIPSym: Delving into Symmetry Detection with CLIP

Tinghan Yang, Md Ashiqur Rahman, Raymond A. Yeh

ICCV 2025arXiv:2508.14197

#2380

Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting

Hengyu Meng, Duotun Wang, Zhijing Shao et al.

ICCV 2025arXiv:2502.20045

#2381

Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method

Enming Zhang, Yuzhe Li, Yuliang Liu et al.

ICCV 2025

#2382

A Unified Interpretation of Training-Time Out-of-Distribution Detection

Xu Cheng, Xin Jiang, Zechao Li

ICCV 2025highlight

#2383

Federated Domain Generalization with Domain-specific Soft Prompts Generation

Jianhan Wu, Xiaoyang Qu, Zhangcheng Huang et al.

ICCV 2025arXiv:2509.20807

#2384

Removing Out-of-Focus Reflective Flares via Color Alignment

Fengbo Lan, Chang Wen Chen

ICCV 2025

#2385

ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection

Yingjian Chen, Lei Zhang, Yakun Niu

ICCV 2025arXiv:2408.13697

#2386

Mamba-3VL: Taming State Space Model for 3D Vision Language Learning

Yuan Wang, Yuxin Chen, Zhongang Qi et al.

ICCV 2025

#2387

Embodied Representation Alignment with Mirror Neurons

Wentao Zhu, Zhining Zhang, Yuwei Ren et al.

ICCV 2025arXiv:2509.21136

#2388

Selective Contrastive Learning for Weakly Supervised Affordance Grounding

WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

ICCV 2025arXiv:2508.07877

#2389

M2EIT: Multi-Domain Mixture of Experts for Robust Neural Inertial Tracking

Yan Li, Yang Xu, Changhao Chen et al.

ICCV 2025

#2390

MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Min Yang, Zihan Jia, Zhilin Dai et al.

ICCV 2025arXiv:2508.07312

#2391

EVOLVE: Event-Guided Deformable Feature Transfer and Dual-Memory Refinement for Low-Light Video Object Segmentation

Jong Hyeon Baek, Jiwon oh, Yeong Jun Koh

ICCV 2025

#2392

MATE: Motion-Augmented Temporal Consistency for Event-based Point Tracking

Han Han, Wei Zhai, Yang Cao et al.

ICCV 2025arXiv:2412.01300

#2393

Asynchronous Event Error-Minimizing Noise for Safeguarding Event Dataset

Ruofei WANG, Peiqi Duan, Boxin Shi et al.

ICCV 2025highlightarXiv:2507.05728

#2394

AG2aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing

Zhaonan Wang, Manyi Li, Changhe Tu

ICCV 2025

#2395

Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision

Yuting He, Shuo Li

ICCV 2025arXiv:2506.20850

#2396

InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior

Minghao Wen, Shengjie Wu, Kangkan Wang et al.

ICCV 2025arXiv:2507.04961

#2397

Benchmarking Multimodal Large Language Models Against Image Corruptions

Xinkuan Qiu, Meina Kan, Yongbin Zhou et al.

ICCV 2025

#2398

Efficient Fine-Tuning of Large Models via Nested Low-Rank Adaptation

Lujun Li, Cheng Lin, Dezhi Li et al.

ICCV 2025

#2399

Dual-level Prototype Learning for Composite Degraded Image Restoration

Zhongze Wang, Haitao Zhao, Lujian Yao et al.

ICCV 2025

#2400

Deterministic Object Pose Confidence Region Estimation

Jinghao Wang, Zhang Li, Zi Wang et al.

ICCV 2025arXiv:2506.22720

← Previous

1...10 11 12 13 14