Most Cited CVPR "blind iqa" Papers

5,589 papers found • Page 15 of 28

Filters:Most Cited CVPR blind iqa Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2801

Insightful Instance Features for 3D Instance Segmentation

Wonseok Roh, Hwanhee Jung, Giljoo Nam et al.

CVPR 2025poster

citations

#2802

Hybrid Concept Bottleneck Models

Yang Liu, Tianwei Zhang, Shi Gu

CVPR 2025poster

citations

#2803

Track Any Anomalous Object:A Granular Video Anomaly Detection Pipeline

Yuzhi Huang, Chenxin Li, Haitao Zhang et al.

CVPR 2025posterarXiv:2506.05175

citations

#2804

HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion

Yifang Xu, BenXiang Zhai, Yunzhuo Sun et al.

CVPR 2025posterarXiv:2512.14542

citations

#2805

GBC-Splat: Generalizable Gaussian-Based Clothed Human Digitalization under Sparse RGB Cameras

Hanzhang Tu, Zhanfeng Liao, Boyao Zhou et al.

CVPR 2025poster

citations

#2806

Improving Visual and Downstream Performance of Low-Light Enhancer with Vision Foundation Models Collaboration

yuxuan Gu, Huaian Chen, Yi Jin et al.

CVPR 2025poster

citations

#2807

OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit

Benquan Wang, Ruyi An, Jin-Kyu So et al.

CVPR 2025highlight

citations

#2808

ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion

Nissim Maruani, Wang Yifan, Matthew Fisher et al.

CVPR 2025posterarXiv:2502.02187

citations

#2809

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Yung-Hsuan Lai, Janek Ebbers, Yu-Chiang Frank Wang et al.

CVPR 2025posterarXiv:2505.09615

citations

#2810

SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation

Hao Du, Bo Wu, Yan Lu et al.

CVPR 2025posterarXiv:2504.05925

citations

#2811

Non-Rigid Structure-from-Motion: Temporally-Smooth Procrustean Alignment and Spatially-Variant Deformation Modeling

Jiawei Shi, Hui Deng, Yuchao Dai

CVPR 2024posterarXiv:2405.04309

citations

#2812

MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning

Wenhao Gu, Li Gu, Ching Suen et al.

CVPR 2025posterarXiv:2505.20513

citations

#2813

Attraction Diminishing and Distributing for Few-Shot Class-Incremental Learning

Li-Jun Zhao, Zhen-Duo Chen, Yongxin Wang et al.

CVPR 2025poster

citations

#2814

DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery

Jing Gao, Ce Zheng, Laszlo Jeni et al.

CVPR 2025posterarXiv:2504.03006

citations

#2815

End-to-End HOI Reconstruction Transformer with Graph-based Encoding

Zhenrong Wang, Qi Zheng, Sihan Ma et al.

CVPR 2025highlightarXiv:2503.06012

citations

#2816

Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration

Jiani Ni, He Zhao, Jintong Gao et al.

CVPR 2025posterarXiv:2504.10007

citations

#2817

Data Distributional Properties As Inductive Bias for Systematic Generalization

Felipe del Rio, Alain Raymond, Daniel Florea et al.

CVPR 2025posterarXiv:2502.20499

citations

#2818

CaMuViD: Calibration-Free Multi-View Detection

Amir Etefaghi Daryani, M. Usman Maqbool Bhutta, Byron Hernandez et al.

CVPR 2025poster

citations

#2819

TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification

Dongyoon Yang, Jihu Lee, Yongdai Kim

CVPR 2025posterarXiv:2505.06580

citations

#2820

TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning

Seungmin Baek, Soyul Lee, Hayeon Jo et al.

CVPR 2025posterarXiv:2501.04293

citations

#2821

Instance-wise Supervision-level Optimization in Active Learning

Shinnosuke Matsuo, Riku Togashi, Ryoma Bise et al.

CVPR 2025posterarXiv:2503.06517

citations

#2822

VSNet: Focusing on the Linguistic Characteristics of Sign Language

Yuhao Li, Xinyue Chen, Hongkai Li et al.

CVPR 2025poster

citations

#2823

Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection

Qi Chen, Hu Ding

CVPR 2025poster

citations

#2824

FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields

Kwan Yun, Chaelin Kim, Hangyeul Shin et al.

CVPR 2025posterarXiv:2503.17095

citations

#2825

SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction

Kai Chen, Xiaodong Zhao, Yujie Huang et al.

CVPR 2025posterarXiv:2504.15616

citations

#2826

EASEMVC:Efficient Dual Selection Mechanism for Deep Multi-View Clustering

Baili Xiao, Zhibin Dong, KE LIANG et al.

CVPR 2025poster

citations

#2827

Targeted Forgetting of Image Subgroups in CLIP Models

Zeliang Zhang, Gaowen Liu, Charles Fleming et al.

CVPR 2025posterarXiv:2506.03117

citations

#2828

SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation

Hritam Basak, Zhaozheng Yin

CVPR 2025posterarXiv:2504.06389

citations

#2829

Improving Editability in Image Generation with Layer-wise Memory

Daneul Kim, Jaeah Lee, Jaesik Park

CVPR 2025posterarXiv:2505.01079

citations

#2830

Semantic Line Combination Detector

JINWON KO, Dongkwon Jin, Chang-Su Kim

CVPR 2024posterarXiv:2404.18399

citations

#2831

DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation

Sang-Jun Park, Keun-Soo Heo, Dong-Hee Shin et al.

CVPR 2025posterarXiv:2504.11786

citations

#2832

Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation

Yue Zhang, Mingyue Bin, Yuyang Zhang et al.

CVPR 2025poster

citations

#2833

PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?

Martin Spitznagel, Jan Vaillant, Janis Keuper

CVPR 2025posterarXiv:2503.05333

citations

#2834

CroCoDL: Cross-device Collaborative Dataset for Localization

Hermann Blum, Alessandro Mercurio, Joshua O'Reilly et al.

CVPR 2025poster

citations

#2835

Concept Lancet: Image Editing with Compositional Representation Transplant

Jinqi Luo, Tianjiao Ding, Kwan Ho Ryan Chan et al.

CVPR 2025posterarXiv:2504.02828

citations

#2836

Pose-Guided Temporal Enhancement for Robust Low-Resolution Hand Reconstruction

Kaixin Fan, Pengfei Ren, Jingyu Wang et al.

CVPR 2025poster

citations

#2837

EvOcc: Accurate Semantic Occupancy for Automated Driving Using Evidence Theory

Jonas Kälble, Sascha Wirges, Maxim Tatarchenko et al.

CVPR 2025poster

citations

#2838

Self-Supervised Learning for Color Spike Camera Reconstruction

Yanchen Dong, Ruiqin Xiong, Xiaopeng Fan et al.

CVPR 2025poster

citations

#2839

Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport

Mengnan Liu, Le Wang, Sanping Zhou et al.

CVPR 2025poster

citations

#2840

Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable

Xin Jin, Simon Niklaus, Zhoutong Zhang et al.

CVPR 2025posterarXiv:2504.03136

citations

#2841

HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation

Mehdi Zayene, Albias Havolli, Jannik Endres et al.

CVPR 2025highlightarXiv:2411.18335

citations

#2842

Dense Dispersed Structured Light for Hyperspectral 3D Imaging of Dynamic Scenes

Suhyun Shin, Seungwoo Yoon, Ryota Maeda et al.

CVPR 2025posterarXiv:2412.01140

citations

#2843

Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration

Aocheng Li, James R. Zimmer-Dauphinee, Rajesh Kalyanam et al.

CVPR 2025posterarXiv:2503.04030

citations

#2844

VRetouchEr: Learning Cross-frame Feature Interdependence with Imperfection Flow for Face Retouching in Videos

Wen Xue, Le Jiang, Lianxin Xie et al.

CVPR 2024poster

citations

#2845

Directional Label Diffusion Model for Learning from Noisy Labels

Senyu Hou, Gaoxia Jiang, Jia Zhang et al.

CVPR 2025poster

citations

#2846

SyncSDE: A Probabilistic Framework for Diffusion Synchronization

Hyunjun Lee, Hyunsoo Lee, Sookwan Han

CVPR 2025posterarXiv:2503.21555

citations

#2847

Attribute-Missing Multi-view Graph Clustering

Bowen Zhao, Qianqian Wang, Zhengming Ding et al.

CVPR 2025poster

citations

#2848

Homogeneous Dynamics Space for Heterogeneous Humans

Xinpeng Liu, Junxuan Liang, Chenshuo Zhang et al.

CVPR 2025posterarXiv:2412.06146

citations

#2849

Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection

Ting Li, Mao Ye, Tianwen Wu et al.

CVPR 2025poster

citations

#2850

Vision-Guided Action: Enhancing 3D Human Motion Prediction with Gaze-informed Affordance in 3D Scenes

Ting Yu, Yi Lin, Jun Yu et al.

CVPR 2025poster

citations

#2851

Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning

Dongyao Jiang, Haodong Jing, Yongqiang Ma et al.

CVPR 2025poster

citations

#2852

Dynamic Group Normalization: Spatio-Temporal Adaptation to Evolving Data Statistics

Yair Smadar, Assaf Hoogi

CVPR 2025poster

citations

#2853

D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation

Jichun Zhao, Haiyong Jiang, Haoxuan Song et al.

CVPR 2025poster

citations

#2854

ESC: Erasing Space Concept for Knowledge Deletion

Tae-Young Lee, Sundong Park, Minwoo Jeon et al.

CVPR 2025highlightarXiv:2504.02199

citations

#2855

Composing Parts for Expressive Object Generation

Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni et al.

CVPR 2025posterarXiv:2406.10197

citations

#2856

Named Entity Driven Zero-Shot Image Manipulation

Zhida Feng, Li Chen, Jing Tian et al.

CVPR 2024poster

citations

#2857

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Zihao Zhang, Aming Wu, Yahong Han

CVPR 2025highlightarXiv:2503.09968

citations

#2858

Temporal Action Detection Model Compression by Progressive Block Drop

Xiaoyong Chen, Yong Guo, Jiaming Liang et al.

CVPR 2025posterarXiv:2503.16916

citations

#2859

HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving

R.D. Lin, Pengcheng Weng, Yinqiao Wang et al.

CVPR 2025posterarXiv:2503.17752

citations

#2860

Mamba-Adaptor: State Space Model Adaptor for Visual Recognition

Fei Xie, Jiahao Nie, Yujin Tang et al.

CVPR 2025posterarXiv:2505.12685

citations

#2861

Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes

Haobin Duan, Miao Wang, Yanxun Li et al.

CVPR 2024posterarXiv:2311.15637

citations

#2862

Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery

Jiahua Rao, Hanjing Lin, Leyu Chen et al.

CVPR 2025poster

citations

#2863

Fitted Neural Lossless Image Compression

Zhe Zhang, Zhenzhong Chen, Shan Liu

CVPR 2025poster

citations

#2864

Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior

Chanhui Lee, Yeonghwan Song, Jeany Son

CVPR 2025posterarXiv:2502.21048

citations

#2865

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

CVPR 2025highlightarXiv:2411.16788

citations

#2866

Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling

Nannan Li, Kevin Shih, Bryan A. Plummer

CVPR 2025posterarXiv:2501.04666

citations

#2867

ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning

Quanxing Zha, Xin Liu, Shu-Juan Peng et al.

CVPR 2025posterarXiv:2502.19962

citations

#2868

Revisiting Fairness in Multitask Learning: A Performance-Driven Approach for Variance Reduction

Xiaohan Qin, Xiaoxing Wang, Junchi Yan

CVPR 2025poster

citations

#2869

MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

Jianwei Zhao, XIN LI, Fan Yang et al.

CVPR 2025posterarXiv:2503.12401

citations

#2870

Multi-modal Topology-embedded Graph Learning for Spatially Resolved Genes Prediction from Pathology Images with Prior Gene Similarity Information

Hang Shi, Chi Changxi, Peng Wan et al.

CVPR 2025poster

citations

#2871

DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer

Ho-Joong Kim, Yearang Lee, Jung-Ho Hong et al.

CVPR 2025posterarXiv:2505.05711

citations

#2872

Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning

Na Zheng, Xuemeng Song, Xue Dong et al.

CVPR 2025poster

citations

#2873

MEET: Towards Memory-Efficient Temporal Sparse Deep Neural Networks

Zeqi Zhu, Ibrahim Batuhan Akkaya, Luc Waeijen et al.

CVPR 2025poster

citations

#2874

SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction

Xinran Yang, Donghao Ji, Yuanqi Li et al.

CVPR 2025posterarXiv:2505.04668

citations

#2875

Real-time Acquisition and Reconstruction of Dynamic Volumes with Neural Structured Illumination

Yixin Zeng, Zoubin Bi, Yin Mingrui et al.

CVPR 2024poster

citations

#2876

3D Prior Is All You Need: Cross-Task Few-shot 2D Gaze Estimation

Yihua Cheng, Hengfei Wang, Zhongqun Zhang et al.

CVPR 2025posterarXiv:2502.04074

citations

#2877

Probabilistic Prompt Distribution Learning for Animal Pose Estimation

Jiyong Rao, Brian Nlong Zhao, Yu Wang

CVPR 2025posterarXiv:2503.16120

citations

#2878

Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model

Shuyun Wang, Hu Zhang, Xin Shen et al.

CVPR 2025poster

citations

#2879

Argus: A Compact and Versatile Foundation Model for Vision

Weiming Zhuang, Chen Chen, Zhizhong Li et al.

CVPR 2025poster

citations

#2880

Sampling Innovation-Based Adaptive Compressive Sensing

Zhifu Tian, Tao Hu, Chaoyang Niu et al.

CVPR 2025posterarXiv:2503.13241

citations

#2881

Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation

Jiaxin Cai, Jingze Su, Qi Li et al.

CVPR 2025poster

citations

#2882

Deep Change Monitoring: A Hyperbolic Representative Learning Framework and a Dataset for Long-term Fine-grained Tree Change Detection

Yante Li, Hanwen Qi, Haoyu Chen et al.

CVPR 2025highlightarXiv:2503.00643

citations

#2883

Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering

Zhen Yang, Zhuo Tao, Qi Chen et al.

CVPR 2025poster

citations

#2884

Zero-Shot Head Swapping in Real-World Scenarios

Sohyun Jeong, Taewoong Kang, Hyojin Jang et al.

CVPR 2025posterarXiv:2503.00861

citations

#2885

Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D

Jiawei Tan, Hongxing Wang, Junwu Weng et al.

CVPR 2025poster

citations

#2886

PAVE: Patching and Adapting Video Large Language Models

Zhuoming Liu, Yiquan Li, Khoi D Nguyen et al.

CVPR 2025posterarXiv:2503.19794

citations

#2887

Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation

Ningyuan Tang, Minghao Fu, Jianxin Wu

CVPR 2025poster

citations

#2888

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Xiaohao Xu, Feng Xue, Shibo Zhao et al.

CVPR 2025posterarXiv:2412.09723

citations

#2889

NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks

Chenyi Zhang, Ting Liu, Xiaochao Qu et al.

CVPR 2025highlight

citations

#2890

HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery

Yuto Matsubara, Ko Nishino

CVPR 2025posterarXiv:2412.04456

citations

#2891

SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models

Kevin Miller, Aditya Gangrade, Samarth Mishra et al.

CVPR 2025posterarXiv:2502.16911

citations

#2892

SAMBLE: Shape-Specific Point Cloud Sampling for an Optimal Trade-Off Between Local Detail and Global Uniformity

Chengzhi Wu, Yuxin Wan, Hao Fu et al.

CVPR 2025posterarXiv:2504.19581

citations

#2893

HyperPose: Hypernetwork-Infused Camera Pose Localization and an Extended Cambridge Landmarks Dataset

Ron Ferens, Yosi Keller

CVPR 2025posterarXiv:2303.02610

citations

#2894

Foveated Instance Segmentation

Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.

CVPR 2025posterarXiv:2503.21854

citations

#2895

PHGC: Procedural Heterogeneous Graph Completion for Natural Language Task Verification in Egocentric Videos

Xun Jiang, Zhiyi Huang, Xing Xu et al.

CVPR 2025poster

citations

#2896

FSboard: Over 3 Million Characters of ASL Fingerspelling Collected via Smartphones

Manfred Georg, Garrett Tanzer, Esha Uboweja et al.

CVPR 2025posterarXiv:2407.15806

citations

#2897

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

Hongyu Sun, Qiuhong Ke, Ming Cheng et al.

CVPR 2025posterarXiv:2503.12150

citations

#2898

Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios

Hang Shao, lei luo, Jianjun Qian et al.

CVPR 2025posterarXiv:2503.11465

citations

#2899

RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety

Andrei Dumitriu, Florin Tatui, Florin Miron et al.

CVPR 2025posterarXiv:2504.01128

citations

#2900

WildAvatar: Learning In-the-wild 3D Avatars from the Web

Zihao Huang, Shoukang Hu, Guangcong Wang et al.

CVPR 2025posterarXiv:2407.02165

citations

#2901

Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models

Yoojin Jung, Byung Cheol Song

CVPR 2025posterarXiv:2504.04747

citations

#2902

Twinner: Shining Light on Digital Twins in a Few Snaps

Jesus Zarzar, Tom Monnier, Roman Shapovalov et al.

CVPR 2025posterarXiv:2503.08382

citations

#2903

PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation

Xinting Hu, Haoran Wang, Jan Lenssen et al.

CVPR 2025poster

citations

#2904

Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples

WEIWEI LI, Junzhuo Liu, Yuanyuan Ren et al.

CVPR 2025posterarXiv:2512.22874

citations

#2905

Leveraging Global Stereo Consistency for Category-Level Shape and 6D Pose Estimation from Stereo Images

Junning Qiu, Minglei Lu, Fei Wang et al.

CVPR 2025poster

citations

#2906

Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model

Longrong Yang, Dong Shen, Chaoxiang Cai et al.

CVPR 2025poster

citations

#2907

Symbolic Representation for Any-to-Any Generative Tasks

Jiaqi Chen, Xiaoye Zhu, Yue Wang et al.

CVPR 2025posterarXiv:2504.17261

citations

#2908

Towards Scalable Human-aligned Benchmark for Text-guided Image Editing

Suho Ryu, Kihyun Kim, Eugene Baek et al.

CVPR 2025highlightarXiv:2505.00502

citations

#2909

Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception

Luke Chen, Junyao Wang, Trier Mortlock et al.

CVPR 2025posterarXiv:2503.20011

citations

#2910

MaSS13K: A Matting-level Semantic Segmentation Benchmark

Chenxi Xie, Minghan LI, Hui Zeng et al.

CVPR 2025posterarXiv:2503.18364

citations

#2911

MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World

Ankit Dhiman, Manan Shah, R. Venkatesh Babu

CVPR 2025posterarXiv:2504.15397

citations

#2912

WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images

Shifan Zhang, Hongzi Zhu, Yinan He et al.

CVPR 2025poster

citations

#2913

Seeing A 3D World in A Grain of Sand

Yufan Zhang, Yu Ji, Yu Guo et al.

CVPR 2025posterarXiv:2503.00260

citations

#2914

Boost the Inference with Co-training: A Depth-guided Mutual Learning Framework for Semi-supervised Medical Polyp Segmentation

Yuxin Li, Zihao Zhu, Yuxiang Zhang et al.

CVPR 2025poster

citations

#2915

AirRoom: Objects Matter in Room Reidentification

Runmao Yao, Yi Du, Zhuoqun Chen et al.

CVPR 2025posterarXiv:2503.01130

citations

#2916

Saliuitl: Ensemble Salience Guided Recovery of Adversarial Patches against CNNs

Mauricio Byrd Victorica, György Dán, Henrik Sandberg

CVPR 2025poster

citations

#2917

Unlocking Generalization Power in LiDAR Point Cloud Registration

Zhenxuan Zeng, Qiao Wu, Xiyu Zhang et al.

CVPR 2025highlightarXiv:2503.10149

citations

#2918

Odd-One-Out: Anomaly Detection by Comparing with Neighbors

Ankan Kumar Bhunia, Changjian Li, Hakan Bilen

CVPR 2025posterarXiv:2406.20099

citations

#2919

GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction

Li Zhang, mingliang xu, Jianan Wang et al.

CVPR 2025poster

citations

#2920

F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics

Pramit Saha, Felix Wagner, Divyanshu Mishra et al.

CVPR 2025highlight

citations

#2921

STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search

Yuning Qiu, Andong Wang, Chao Li et al.

CVPR 2025poster

citations

#2922

Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering

Liang Chen, Zhe Xue, Yawen Li et al.

CVPR 2025poster

citations

#2923

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

Tianming Liang, Chaolei Tan, Beihao Xia et al.

CVPR 2024posterarXiv:2403.14430

citations

#2924

Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing

Shengzhi Wang, Yingkang Zhong, Jiangchuan Mu et al.

CVPR 2025poster

citations

#2925

Object Dynamics Modeling with Hierarchical Point Cloud-based Representations

Chanho Kim, Li Fuxin

CVPR 2024posterarXiv:2404.06044

citations

#2926

ArtiFade: Learning to Generate High-quality Subject from Blemished Images

Shuya Yang, Shaozhe Hao, Yukang Cao et al.

CVPR 2025posterarXiv:2409.03745

citations

#2927

GeoDepth: From Point-to-Depth to Plane-to-Depth Modeling for Self-Supervised Monocular Depth Estimation

Haifeng Wu, Shuhang Gu, Lixin Duan et al.

CVPR 2025poster

citations

#2928

GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding

Yuki Kawana, Shintaro Shiba, Quan Kong et al.

CVPR 2025posterarXiv:2505.10671

citations

#2929

OFER: Occluded Face Expression Reconstruction

Pratheba Selvaraju, Victoria Abrevaya, Timo Bolkart et al.

CVPR 2025posterarXiv:2410.21629

citations

#2930

Black Hole-Driven Identity Absorbing in Diffusion Models

Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

CVPR 2025poster

citations

#2931

Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression

Dohyun Kim, Sehwan Park, GeonHee Han et al.

CVPR 2025posterarXiv:2504.02011

citations

#2932

Towards Cost-Effective Learning: A Synergy of Semi-Supervised and Active Learning

Tianxiang Yin, Ningzhong Liu, Han Sun

CVPR 2025poster

citations

#2933

Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation

Kendong Liu, Zhiyu Zhu, Hui LIU et al.

CVPR 2025posterarXiv:2503.15975

citations

#2934

Hierarchical Gaussian Mixture Model Splatting for Efficient and Part Controllable 3D Generation

Qitong Yang, Mingtao Feng, Zijie Wu et al.

CVPR 2025poster

citations

#2935

ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation

Tao Tan, Qiulei Dong

CVPR 2025poster

citations

#2936

Relation-Rich Visual Document Generator for Visual Information Extraction

Zi-Han Jiang, Chien-Wei Lin, WeiHua Li et al.

CVPR 2025posterarXiv:2504.10659

citations

#2937

Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification

Zequn Zeng, Yudi Su, Jianqiao Sun et al.

CVPR 2025posterarXiv:2503.18483

citations

#2938

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

Xiaolu Liu, Ruizi Yang, Song Wang et al.

CVPR 2025posterarXiv:2503.23109

citations

#2939

Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction

Yuanbo Wang, Zhaoxuan Zhang, Jiajin Qiu et al.

CVPR 2025posterarXiv:2505.13091

citations

#2940

DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation

Xiaoliang Ju, Hongsheng Li

CVPR 2025posterarXiv:2503.06900

citations

#2941

VEU-Bench: Towards Comprehensive Understanding of Video Editing

Bozheng Li, Yongliang Wu, YI LU et al.

CVPR 2025highlightarXiv:2504.17828

citations

#2942

Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation

Hyunsoo Kim, Donghyun Kim, Suhyun Kim

CVPR 2025posterarXiv:2506.07750

citations

#2943

RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions

Shihang Du, Sanqing Qu, Tianhang Wang et al.

CVPR 2025poster

citations

#2944

PIAD: Pose and Illumination agnostic Anomaly Detection

Kaichen Yang, Junjie Cao, Zeyu Bai et al.

CVPR 2025poster

citations

#2945

Type-R: Automatically Retouching Typos for Text-to-Image Generation

Wataru Shimoda, Naoto Inoue, Daichi Haraguchi et al.

CVPR 2025highlightarXiv:2411.18159

citations

#2946

Latent Space Imaging

Matheus Souza, Yidan Zheng, Kaizhang Kang et al.

CVPR 2025posterarXiv:2407.07052

citations

#2947

MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection

Rishubh Parihar, Srinjay Sarkar, Sarthak Vora et al.

CVPR 2025posterarXiv:2504.06801

citations

#2948

Implicit Correspondence Learning for Image-to-Point Cloud Registration

Xinjun Li, Wenfei Yang, Jiacheng Deng et al.

CVPR 2025highlight

citations

#2949

Simulator HC: Regression-based Online Simulation of Starting Problem-Solution Pairs for Homotopy Continuation in Geometric Vision

Xinyue Zhang, Zijia Dai, Wanting Xu et al.

CVPR 2025highlightarXiv:2411.03745

citations

#2950

The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers

Daiqing Qi, Handong Zhao, Jing Shi et al.

CVPR 2025poster

citations

#2951

MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model

Haoyuan Wang, Zhenwei Wang, Xiaoxiao Long et al.

CVPR 2025poster

citations

#2952

CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation

Bonan Li, Zicheng Zhang, Xingyi Yang et al.

CVPR 2025highlight

citations

#2953

Video Language Model Pretraining with Spatio-temporal Masking

Yue Wu, Zhaobo Qi, Junshu Sun et al.

CVPR 2025poster

citations

#2954

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model

Yue Han, Jiangning Zhang, Junwei Zhu et al.

CVPR 2025highlight

citations

#2955

Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark

Hao Guo, Xugong Qin, Jun Jie Ou Yang et al.

CVPR 2025posterarXiv:2512.20174

citations

#2956

Percept, Memory, and Imagine: World Feature Simulating for Open-Domain Unknown Object Detection

Aming Wu, Cheng Deng

CVPR 2025poster

citations

#2957

Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Zichen Tian, Yaoyao Liu, Qianru Sun

CVPR 2025highlight

citations

#2958

Quad-Pixel Image Defocus Deblurring: A New Benchmark and Model

Hang Chen, Yin Xie, Xiaoxiu Peng et al.

CVPR 2025poster

citations

#2959

DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework

Yalong Xu, Lin Zhao, Chen Gong et al.

CVPR 2025poster

citations

#2960

FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video

Andrea Boscolo Camiletto, Jian Wang, Eduardo Alvarado et al.

CVPR 2025highlightarXiv:2503.23094

citations

#2961

VIRES: Video Instance Repainting via Sketch and Text Guided Generation

Shuchen Weng, Haojie Zheng, Peixuan Zhang et al.

CVPR 2025posterarXiv:2411.16199

citations

#2962

PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models

Jenny Schmalfuss, Nadine Chang, Vibashan VS et al.

CVPR 2025posterarXiv:2506.14808

citations

#2963

Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation

Yiftach Edelstein, Or Patashnik, Dana Cohen-Bar et al.

CVPR 2025posterarXiv:2412.02631

citations

#2964

HSI: A Holistic Style Injector for Arbitrary Style Transfer

Shuhao Zhang, Hui Kang, Yang Liu et al.

CVPR 2025posterarXiv:2502.04369

citations

#2965

Self-Supervised Cross-View Correspondence with Predictive Cycle Consistency

Alan Baade, Changan Chen

CVPR 2025highlight

citations

#2966

Nested Diffusion Models Using Hierarchical Latent Priors

Xiao Zhang, Ruoxi Jiang, Rebecca Willett et al.

CVPR 2025posterarXiv:2412.05984

citations

#2967

Feature Spectrum Learning for Remote Sensing Change Detection

Qi Zang, Dong Zhao, Shuang Wang et al.

CVPR 2025poster

citations

#2968

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.

CVPR 2025posterarXiv:2503.08601

citations

#2969

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

Jingxuan Xu, Wuyang Chen, Yao Zhao et al.

CVPR 2024posterarXiv:2404.07448

citations

#2970

Take the Bull by the Horns: Learning to Segment Hard Samples

Yuan Guo, Jingyu Kong, Yu Wang et al.

CVPR 2025poster

citations

#2971

AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering

Jing Wang, Songhe Feng, Kristoffer Knutsen Wickstrøm et al.

CVPR 2025poster

citations

#2972

IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular VideosC

Yuan Li, Ziqian Bai, Feitong Tan et al.

CVPR 2025poster

citations

#2973

De^2Gaze: Deformable and Decoupled Representation Learning for 3D Gaze Estimation

Yunfeng Xiao, Xiaowei Bai, Baojun Chen et al.

CVPR 2025poster

citations

#2974

Soft Self-labeling and Potts Relaxations for Weakly-supervised Segmentation

Zhongwen Zhang, Yuri Boykov

CVPR 2025posterarXiv:2507.01721

citations

#2975

MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects

Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.

CVPR 2025poster

citations

#2976

DIO: Decomposable Implicit 4D Occupancy-Flow World Model

Christopher Diehl, Quinlan Sykora, Ben Agro et al.

CVPR 2025poster

citations

#2977

Improving Personalized Search with Regularized Low-Rank Parameter Updates

Fiona Ryan, Josef Sivic, Fabian Caba Heilbron et al.

CVPR 2025highlightarXiv:2506.10182

citations

#2978

COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts

Jiansheng Li, Xingxuan Zhang, Hao Zou et al.

CVPR 2025highlightarXiv:2504.10158

citations

#2979

Customized Condition Controllable Generation for Video Soundtrack

Fan Qi, KunSheng Ma, Changsheng Xu

CVPR 2025poster

citations

#2980

UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition

Meng Pang, Wenjun Zhang, Nanrun Zhou et al.

CVPR 2025poster

citations

#2981

Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning

Cheng Chen, Yunpeng Zhai, Yifan Zhao et al.

CVPR 2025posterarXiv:2506.09473

citations

#2982

LoKi: Low-dimensional KAN for Efficient Fine-tuning Image Models

Xuan Cai, Renjie Pan, Hua Yang

CVPR 2025poster

citations

#2983

EnliveningGS: Active Locomotion of 3DGS

Siyuan Shen, Tianjia Shao, Kun Zhou et al.

CVPR 2025poster

citations

#2984

Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models

Yuhao Cui, Xinxing Zu, Wenhua Zhang et al.

CVPR 2025poster

citations

#2985

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation

Zheng Zhang, Guanchun Yin, Bo Zhang et al.

CVPR 2025poster

citations

#2986

R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner

Ziyi Bai, Hanxuan Li, Bin Fu et al.

CVPR 2025poster

citations

#2987

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

Pascal Chang, Sergio Sancho, Jingwei Tang et al.

CVPR 2025posterarXiv:2504.08902

citations

#2988

Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction

Li Fang, Hao Zhu, Longlong Chen et al.

CVPR 2025posterarXiv:2505.19793

citations

#2989

ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams

Chris Dongjoo Kim, Jihwan Moon, Sangwoo Moon et al.

CVPR 2025posterarXiv:2504.14875

citations

#2990

ETAP: Event-based Tracking of Any Point

Friedhelm Hamann, Daniel Gehrig, Filbert Febryanto et al.

CVPR 2025highlightarXiv:2412.00133

citations

#2991

Adapting to Observation Length of Trajectory Prediction via Contrastive Learning

Ruiqi Qiu, JUN GONG, Xinyu Zhang et al.

CVPR 2025poster

citations

#2992

SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors

Yufan Wu, Xuanhong Chen, Wen Li et al.

CVPR 2025poster

citations

#2993

ADU: Adaptive Detection of Unknown Categories in Black-Box Domain Adaptation

Yushan Lai, Guowen Li, Haoyuan Liang et al.

CVPR 2025poster

citations

#2994

Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-based Vision

Manon Dampfhoffer, Thomas Mesquida, Damien Joubert et al.

CVPR 2025highlight

citations

#2995

Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video

Marchellus Matthew, Nadhira Noor, In Kyu Park

CVPR 2025posterarXiv:2505.07333

citations

#2996

Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering

Yuanlin Wang, Yiyang Zhang, Ruiqin Xiong et al.

CVPR 2025poster

citations

#2997

A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations

Theo Bodrito, Olivier Flasseur, Julien Mairal et al.

CVPR 2025posterarXiv:2503.17117

citations

#2998

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models

Jinhui Yi, Syed Talal Wasim, Yanan Luo et al.

CVPR 2025posterarXiv:2412.18609

citations

#2999

Integral Fast Fourier Color Constancy

Wenjun Wei, Yanlin Qian, Huaian Chen et al.

CVPR 2025posterarXiv:2502.03494

citations

#3000

High-quality Point Cloud Oriented Normal Estimation via Hybrid Angular and Euclidean Distance Encoding

Yuanqi Li, Jingcheng Huang, Hongshen Wang et al.

CVPR 2025poster

citations

← Previous

1...13 14 15 16 17...28