Most Cited CVPR "animatable head avatars" Papers

5,589 papers found • Page 23 of 28

#4401

LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate

Haoyan Gong, Zhenrong Zhang, Yuzheng Feng et al.

CVPR 2025highlight
2
citations
#4402

L-SWAG: Layer-Sample Wise Activation with Gradients Information for Zero-Shot NAS on Vision Transformers

Sofia Casarin, Sergio Escalera, Oswald Lanz

CVPR 2025arXiv:2505.07300
2
citations
#4403

Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning

Juntae Lee, Munawar Hayat, Sungrack Yun

CVPR 2025arXiv:2506.15720
2
citations
#4404

Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted

Shuaiwei Yuan, Junyu Dong, Yuezun Li

CVPR 2025arXiv:2505.08255
2
citations
#4405

Nested Diffusion Models Using Hierarchical Latent Priors

Xiao Zhang, Ruoxi Jiang, Rebecca Willett et al.

CVPR 2025arXiv:2412.05984
2
citations
#4406

Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression

Boqian Zhang, shen yang, Hao Chen et al.

CVPR 2025
2
citations
#4407

ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way

Jiazi Bu, Pengyang Ling, Pan Zhang et al.

CVPR 2025arXiv:2410.06241
2
citations
#4408

Do ImageNet-trained Models Learn Shortcuts? The Impact of Frequency Shortcuts on Generalization

Shunxin Wang, Raymond Veldhuis, Nicola Strisciuglio

CVPR 2025arXiv:2503.03519
2
citations
#4409

UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation

Qihui Zhang, Munan Ning, Zheyuan Liu et al.

CVPR 2025arXiv:2503.14941
2
citations
#4410

Neural Inverse Rendering from Propagating Light

Anagh Malik, Benjamin Attal, Andrew Xie et al.

CVPR 2025arXiv:2506.05347
2
citations
#4411

No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, HwiJeong Lee, Inha Kang et al.

CVPR 2025arXiv:2503.15910
2
citations
#4412

FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting

Hengyu Liu, Yuehao Wang, Chenxin Li et al.

CVPR 2025arXiv:2506.04174
2
citations
#4413

Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model

Yingmao Miao, Zhanpeng Huang, Rui Han et al.

CVPR 2025arXiv:2503.16065
2
citations
#4414

A Unified, Resilient, and Explainable Adversarial Patch Detector

Vishesh Kumar, Akshay Agarwal

CVPR 2025
2
citations
#4415

Seek Common Ground While Reserving Differences: Semi-Supervised Image-Text Sentiment Recognition

Wuyou Xia, Guoli Jia, Sicheng Zhao et al.

CVPR 2025
2
citations
#4416

VISTREAM: Improving Computation Efficiency of Visual Streaming Perception via Law-of-Charge-Conservation Inspired Spiking Neural Network

Kang You, Ziling Wei, Jing Yan et al.

CVPR 2025
2
citations
#4417

Self-Evolving Visual Concept Library using Vision-Language Critics

Atharva Sehgal, Patrick Yuan, Ziniu Hu et al.

CVPR 2025arXiv:2504.00185
2
citations
#4418

MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing

Feifei Shao, Ping Liu, Zhao Wang et al.

CVPR 2025arXiv:2411.16773
2
citations
#4419

Split Adaptation for Pre-trained Vision Transformers

Lixu Wang, Bingqi Shang, Yi Li et al.

CVPR 2025arXiv:2503.00441
2
citations
#4420

Synthetic Visual Genome

Jae Sung Park, Zixian Ma, Linjie Li et al.

CVPR 2025arXiv:2506.07643
2
citations
#4421

Autoregressive Distillation of Diffusion Transformers

Yeongmin Kim, Sotiris Anagnostidis, Yuming Du et al.

CVPR 2025arXiv:2504.11295
2
citations
#4422

Adaptive Softassign via Hadamard-Equipped Sinkhorn

Binrui Shen, Qiang Niu, Shengxin Zhu

CVPR 2024arXiv:2309.13855
2
citations
#4423

DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes

Ashish Kumar, A. N. Rajagopalan

CVPR 2025
2
citations
#4424

From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling

Jinhong Lin, Cheng-En Wu, Huanran Li et al.

CVPR 2025arXiv:2411.10685
2
citations
#4425

Solving Instance Detection from an Open-World Perspective

Qianqian Shen, Yunhan Zhao, Nahyun Kwon et al.

CVPR 2025arXiv:2503.00359
2
citations
#4426

Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection

Ahyun Seo, Minsu Cho

CVPR 2025arXiv:2503.20235
2
citations
#4427

Potential Field Based Deep Metric Learning

Shubhang Bhatnagar, Narendra Ahuja

CVPR 2025arXiv:2405.18560
2
citations
#4428

A2XP: Towards Private Domain Generalization

Geunhyeok Yu, Hyoseok Hwang

CVPR 2024arXiv:2311.10339
2
citations
#4429

Self-Supervised Spatial Correspondence Across Modalities

Ayush Shrivastava, Andrew Owens

CVPR 2025arXiv:2506.03148
2
citations
#4430

Towards Better Vision-Inspired Vision-Language Models

Yun-Hao Cao, Kaixiang Ji, Ziyuan Huang et al.

CVPR 2024
2
citations
#4431

Evaluating Model Perception of Color Illusions in Photorealistic Scenes

Lingjun Mao, Zineng Tang, Alane Suhr

CVPR 2025arXiv:2412.06184
2
citations
#4432

ProReflow: Progressive Reflow with Decomposed Velocity

Lei Ke, Haohang Xu, Xuefei Ning et al.

CVPR 2025arXiv:2503.04824
2
citations
#4433

Exploiting Deblurring Networks for Radiance Fields

Haeyun Choi, Heemin Yang, Janghyeok Han et al.

CVPR 2025arXiv:2502.14454
2
citations
#4434

Hand-held Object Reconstruction from RGB Video with Dynamic Interaction

Shijian Jiang, Qi Ye, Rengan Xie et al.

CVPR 2025
2
citations
#4435

KMD: Koopman Multi-modality Decomposition for Generalized Brain Tumor Segmentation under Incomplete Modalities

Tianyi Liu, Haochuan Jiang, Kaizhu Huang

CVPR 2025
2
citations
#4436

The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition

Otto Brookes, Maksim Kukushkin, Majid Mirmehdi et al.

CVPR 2025arXiv:2502.21201
2
citations
#4437

beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation

Ming Hu, Jianfu Yin, Zhuangzhuang Ma et al.

CVPR 2025
2
citations
#4438

Generative Quanta Color Imaging

Vishal Purohit, Junjie Luo, Yiheng Chi et al.

CVPR 2024arXiv:2403.19066
2
citations
#4439

NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval

Zengrong Lin, Zheng Wang, Tianwen Qian et al.

CVPR 2025arXiv:2503.10526
2
citations
#4440

PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description

Ziqi Cai, Shuchen Weng, Yifei Xia et al.

CVPR 2025
2
citations
#4441

WinSyn: : A High Resolution Testbed for Synthetic Data

Tom Kelly, John Femiani, Peter Wonka

CVPR 2024arXiv:2310.08471
2
citations
#4442

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

Runjian Chen, Wenqi Shao, Bo Zhang et al.

CVPR 2025arXiv:2503.08422
2
citations
#4443

Flexible Group Count Enables Hassle-Free Structured Pruning

Jiamu Zhang, Shaochen Zhong, Andrew Ye et al.

CVPR 2025
2
citations
#4444

Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Philip Doldo, Derek Everett, Amol Khanna et al.

CVPR 2025arXiv:2503.19347
2
citations
#4445

Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration

Chao Wang, Hehe Fan, Huichen Yang et al.

CVPR 2025
2
citations
#4446

DTOS: Dynamic Time Object Sensing with Large Multimodal Model

Jirui Tian, Jinrong Zhang, Shenglan Liu et al.

CVPR 2025
2
citations
#4447

FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy

Xingchao Yang, Takafumi Taketomi, Yuki Endo et al.

CVPR 2025arXiv:2503.17197
2
citations
#4448

ToonerGAN: Reinforcing GANs for Obfuscating Automated Facial Indexing

Kartik Thakral, Shashikant Prasad, Stuti Aswani et al.

CVPR 2024
2
citations
#4449

Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing

Chen Liao, Yan Shen, Dan Li et al.

CVPR 2025arXiv:2503.08429
2
citations
#4450

High-Fidelity Lightweight Mesh Reconstruction from Point Clouds

Chen Zhang, Wentao Wang, Ximeng Li et al.

CVPR 2025highlight
2
citations
#4451

MotionMap: Representing Multimodality in Human Pose Forecasting

Reyhaneh Hosseininejad, Megh Shukla, Saeed Saadatnejad et al.

CVPR 2025arXiv:2412.18883
2
citations
#4452

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

Pascal Chang, Sergio Sancho, Jingwei Tang et al.

CVPR 2025arXiv:2504.08902
2
citations
#4453

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

Zhenggang Tang, Jason Ren, Xiaoming Zhao et al.

CVPR 2024arXiv:2406.10543
2
citations
#4454

Reversing Flow for Image Restoration

Haina Qin, Wenyang Luo, Bing Li et al.

CVPR 2025arXiv:2506.16961
2
citations
#4455

Do Your Best and Get Enough Rest for Continual Learning

Hankyul Kang, Gregor Seifer, Donghyun Lee et al.

CVPR 2025arXiv:2503.18371
2
citations
#4456

Certified Human Trajectory Prediction

Mohammadhossein Bahari, Saeed Saadatnejad, Amirhossein Askari Farsangi et al.

CVPR 2025arXiv:2403.13778
2
citations
#4457

ScaleLSD: Scalable Deep Line Segment Detection Streamlined

Zeran Ke, Bin Tan, Xianwei Zheng et al.

CVPR 2025arXiv:2506.09369
2
citations
#4458

On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events

Jesse Hagenaars, Yilun Wu, Federico Paredes Valles et al.

CVPR 2025arXiv:2412.06359
2
citations
#4459

LC-Mamba: Local and Continuous Mamba with Shifted Windows for Frame Interpolation

Min Wu Jeong, Chae Eun Rhee

CVPR 2025
2
citations
#4460

LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table

Yusuke Matsui

CVPR 2025arXiv:2506.04790
2
citations
#4461

HashPoint: Accelerated Point Searching and Sampling for Neural Rendering

Jiahao Ma, Miaomiao Liu, David Ahmedt-Aristizabal et al.

CVPR 2024highlightarXiv:2404.14044
2
citations
#4462

Bias for Action: Video Implicit Neural Representations with Bias Modulation

Alper Kayabasi, Anil Kumar Vadathya, Guha Balakrishnan et al.

CVPR 2025arXiv:2501.09277
2
citations
#4463

DiffCAM: Data-Driven Saliency Maps by Capturing Feature Differences

Xingjian Li, Qiming Zhao, Neelesh Bisht et al.

CVPR 2025highlight
2
citations
#4464

EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation

Diljeet Jagpal, Xi Chen, Vinay P. Namboodiri

CVPR 2025arXiv:2504.06861
2
citations
#4465

Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning

Jinpeng Wang, Tianci Luo, Yaohua Zha et al.

CVPR 2025arXiv:2504.21263
2
citations
#4466

FLAVC: Learned Video Compression with Feature Level Attention

Chun Zhang, Heming Sun, Jiro Katto

CVPR 2025
2
citations
#4467

Hybrid Concept Bottleneck Models

Yang Liu, Tianwei Zhang, Shi Gu

CVPR 2025
2
citations
#4468

Segment Anything, Even Occluded

Wei-En Tai, Yu-Lin Shih, Cheng Sun et al.

CVPR 2025arXiv:2503.06261
2
citations
#4469

Style Quantization for Data-Efficient GAN Training

Jian Wang, Xin Lan, Ji-Zhe Zhou et al.

CVPR 2025arXiv:2503.24282
2
citations
#4470

Condensing Action Segmentation Datasets via Generative Network Inversion

Guodong Ding, Rongyu Chen, Angela Yao

CVPR 2025arXiv:2503.14112
2
citations
#4471

Robust Multi-Object 4D Generation for In-the-wild Videos

Wen-Hsuan Chu, Lei Ke, Jianmeng Liu et al.

CVPR 2025
2
citations
#4472

Recovering Dynamic 3D Sketches from Videos

Jaeah Lee, Changwoon Choi, Young Min Kim et al.

CVPR 2025arXiv:2503.20321
2
citations
#4473

Preconditioners for the Stochastic Training of Neural Fields

Shin-Fang Chng, Hemanth Saratchandran, Simon Lucey

CVPR 2025arXiv:2402.08784
2
citations
#4474

Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning

Cheng Chen, Yunpeng Zhai, Yifan Zhao et al.

CVPR 2025arXiv:2506.09473
2
citations
#4475

VinaBench: Benchmark for Faithful and Consistent Visual Narratives

Silin Gao, Sheryl Mathew, Li Mi et al.

CVPR 2025arXiv:2503.20871
2
citations
#4476

Dual-Granularity Semantic Guided Sparse Routing Diffusion Model for General Pansharpening

Yinghui Xing, Qu Li Tao, Shizhou Zhang et al.

CVPR 2025
2
citations
#4477

Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

Navami Kairanda, Marc Habermann, Shanthika Shankar Naik et al.

CVPR 2025arXiv:2503.19976
2
citations
#4478

HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views

Ethan Griffiths, Maryam Haghighat, Simon Denman et al.

CVPR 2025arXiv:2503.08140
2
citations
#4479

Universal Semi-Supervised Domain Adaptation by Mitigating Common-Class Bias

Wenyu Zhang, Qingmu Liu, Felix Ong et al.

CVPR 2024arXiv:2403.11234
2
citations
#4480

EBS-EKF: Accurate and High Frequency Event-based Star Tracking

Albert Reed, Connor Hashemi, Dennis Melamed et al.

CVPR 2025highlightarXiv:2503.20101
2
citations
#4481

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide

Dohun Lee, Bryan Sangwoo Kim, Geon Yeong Park et al.

CVPR 2025arXiv:2410.04364
2
citations
#4482

SACB-Net: Spatial-awareness Convolutions for Medical Image Registration

Xinxing Cheng, Tianyang Zhang, Wenqi Lu et al.

CVPR 2025highlightarXiv:2503.19592
2
citations
#4483

VITED: Video Temporal Evidence Distillation

Yujie Lu, Yale Song, Lorenzo Torresani et al.

CVPR 2025arXiv:2503.12855
2
citations
#4484

Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning

Xueyi Ke, Satoshi Tsutsui, Yayun Zhang et al.

CVPR 2025arXiv:2501.05205
2
citations
#4485

Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need

Qiang Wang, Xiang Song, Yuhang He et al.

CVPR 2025arXiv:2505.23744
2
citations
#4486

T-CIL: Temperature Scaling using Adversarial Perturbation for Calibration in Class-Incremental Learning

Seong-Hyeon Hwang, Minsu Kim, Steven Euijong Whang

CVPR 2025arXiv:2503.22163
2
citations
#4487

SEC-Prompt:SEmantic Complementary Prompting for Few-Shot Class-Incremental Learning

Ye Liu, Meng Yang

CVPR 2025
2
citations
#4488

SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs

Guibiao Liao, Qing Li, Zhenyu Bao et al.

CVPR 2025arXiv:2503.12535
2
citations
#4489

Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation

Xingguo Lv, Xingbo Dong, Liwen Wang et al.

CVPR 2025arXiv:2503.13012
2
citations
#4490

VEU-Bench: Towards Comprehensive Understanding of Video Editing

Bozheng Li, Yongliang Wu, YI LU et al.

CVPR 2025highlightarXiv:2504.17828
2
citations
#4491

CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes

ziteng xue, Mingzhe Guo, Heng Fan et al.

CVPR 2025
2
citations
#4492

PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation

Xinting Hu, Haoran Wang, Jan Lenssen et al.

CVPR 2025
1
citations
#4493

HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks

Maria Pilligua, Danna Xue, Javier Vazquez-Corral

CVPR 2025arXiv:2503.17276
1
citations
#4494

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

Xiaolu Liu, Ruizi Yang, Song Wang et al.

CVPR 2025arXiv:2503.23109
1
citations
#4495

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Yung-Hsuan Lai, Janek Ebbers, Yu-Chiang Frank Wang et al.

CVPR 2025arXiv:2505.09615
1
citations
#4496

Directional Label Diffusion Model for Learning from Noisy Labels

Senyu Hou, Gaoxia Jiang, Jia Zhang et al.

CVPR 2025
1
citations
#4497

CroCoDL: Cross-device Collaborative Dataset for Localization

Hermann Blum, Alessandro Mercurio, Joshua O'Reilly et al.

CVPR 2025
1
citations
#4498

Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable

Xin Jin, Simon Niklaus, Zhoutong Zhang et al.

CVPR 2025arXiv:2504.03136
1
citations
#4499

Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations

Jiate Li, Meng Pang, Yun Dong et al.

CVPR 2025arXiv:2503.18503
1
citations
#4500

Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering

Zhen Yang, Zhuo Tao, Qi Chen et al.

CVPR 2025
1
citations
#4501

PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers

Wooju Lee, Juhye Park, Dasol Hong et al.

CVPR 2025arXiv:2503.02388
1
citations
#4502

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Zihao Zhang, Aming Wu, Yahong Han

CVPR 2025highlightarXiv:2503.09968
1
citations
#4503

IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images

Yushuang Wu, Luyue Shi, Junhao Cai et al.

CVPR 2024highlightarXiv:2404.00269
1
citations
#4504

The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers

Daiqing Qi, Handong Zhao, Jing Shi et al.

CVPR 2025
1
citations
#4505

MEET: Towards Memory-Efficient Temporal Sparse Deep Neural Networks

Zeqi Zhu, Ibrahim Batuhan Akkaya, Luc Waeijen et al.

CVPR 2025
1
citations
#4506

MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects

Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.

CVPR 2025
1
citations
#4507

Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction

Yuanbo Wang, Zhaoxuan Zhang, Jiajin Qiu et al.

CVPR 2025arXiv:2505.13091
1
citations
#4508

Open Ad-hoc Categorization with Contextualized Feature Learning

Zilin Wang, Sangwoo Mo, Stella X. Yu et al.

CVPR 2025arXiv:2512.16202
1
citations
#4509

ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization

Weiyao Wang, Pierre Gleize, Hao Tang et al.

CVPR 2024arXiv:2401.08937
1
citations
#4510

MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

Jianwei Zhao, XIN LI, Fan Yang et al.

CVPR 2025arXiv:2503.12401
1
citations
#4511

ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning

Quanxing Zha, Xin Liu, Shu-Juan Peng et al.

CVPR 2025arXiv:2502.19962
1
citations
#4512

Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes

Haobin Duan, Miao Wang, Yanxun Li et al.

CVPR 2024arXiv:2311.15637
1
citations
#4513

Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Sayak Nag, Udita Ghosh, Calvin-Khang Ta et al.

CVPR 2025arXiv:2503.13947
1
citations
#4514

Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery

Sara Al-Emadi, Yin Yang, Ferda Ofli

CVPR 2025arXiv:2503.19202
1
citations
#4515

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

CVPR 2025highlightarXiv:2411.16788
1
citations
#4516

CARL: A Framework for Equivariant Image Registration

Hastings Greer, Lin Tian, François-Xavier Vialard et al.

CVPR 2025arXiv:2405.16738
1
citations
#4517

Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target.

Dokyoon Yoon, Youngsook Song, Woomyoung Park

CVPR 2025arXiv:2506.11417
1
citations
#4518

Composing Parts for Expressive Object Generation

Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni et al.

CVPR 2025arXiv:2406.10197
1
citations
#4519

Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models

Yoojin Jung, Byung Cheol Song

CVPR 2025arXiv:2504.04747
1
citations
#4520

R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner

Ziyi Bai, Hanxuan Li, Bin Fu et al.

CVPR 2025
1
citations
#4521

Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration

Jiani Ni, He Zhao, Jintong Gao et al.

CVPR 2025arXiv:2504.10007
1
citations
#4522

Sampling Innovation-Based Adaptive Compressive Sensing

Zhifu Tian, Tao Hu, Chaoyang Niu et al.

CVPR 2025arXiv:2503.13241
1
citations
#4523

Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention

Soikat Hasan Ahmed, Jan Finkbeiner, Emre Neftci

CVPR 2025arXiv:2403.10173
1
citations
#4524

VIRES: Video Instance Repainting via Sketch and Text Guided Generation

Shuchen Weng, Haojie Zheng, Peixuan Zhang et al.

CVPR 2025arXiv:2411.16199
1
citations
#4525

Universal Domain Adaptation for Semantic Segmentation

Seun-An Choe, Keon Hee Park, Jinwoo Choi et al.

CVPR 2025arXiv:2505.22458
1
citations
#4526

Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes

Ludwic Leonard, Nils Thuerey, rüdiger westermann

CVPR 2025highlightarXiv:2501.05226
1
citations
#4527

RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions

Shihang Du, Sanqing Qu, Tianhang Wang et al.

CVPR 2025
1
citations
#4528

DIO: Decomposable Implicit 4D Occupancy-Flow World Model

Christopher Diehl, Quinlan Sykora, Ben Agro et al.

CVPR 2025
1
citations
#4529

Deep Video Inverse Tone Mapping Based on Temporal Clues

Yuyao Ye, Ning Zhang, Yang Zhao et al.

CVPR 2024
1
citations
#4530

Gaussian Splatting Feature Fields for (Privacy-Preserving) Visual Localization

Maxime Pietrantoni, Gabriela Csurka, Torsten Sattler

CVPR 2025arXiv:2507.23569
1
citations
#4531

Improving Editability in Image Generation with Layer-wise Memory

Daneul Kim, Jaeah Lee, Jaesik Park

CVPR 2025arXiv:2505.01079
1
citations
#4532

DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction

Junjie Zhou, Shouju Wang, Yuxia Tang et al.

CVPR 2025highlightarXiv:2503.09491
1
citations
#4533

Probabilistic Prompt Distribution Learning for Animal Pose Estimation

Jiyong Rao, Brian Nlong Zhao, Yu Wang

CVPR 2025arXiv:2503.16120
1
citations
#4534

Open Vocabulary Semantic Scene Sketch Understanding

Ahmed Bourouis, Judith Fan, Yulia Gryaditskaya

CVPR 2024arXiv:2312.12463
1
citations
#4535

F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics

Pramit Saha, Felix Wagner, Divyanshu Mishra et al.

CVPR 2025highlight
1
citations
#4536

FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields

Kwan Yun, Chaelin Kim, Hangyeul Shin et al.

CVPR 2025arXiv:2503.17095
1
citations
#4537

A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations

Theo Bodrito, Olivier Flasseur, Julien Mairal et al.

CVPR 2025arXiv:2503.17117
1
citations
#4538

Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport

Hao Tan, Zichang Tan, Jun Li et al.

CVPR 2025arXiv:2503.15337
1
citations
#4539

STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search

Yuning Qiu, Andong Wang, Chao Li et al.

CVPR 2025
1
citations
#4540

Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples

WEIWEI LI, Junzhuo Liu, Yuanyuan Ren et al.

CVPR 2025arXiv:2512.22874
1
citations
#4541

High-quality Point Cloud Oriented Normal Estimation via Hybrid Angular and Euclidean Distance Encoding

Yuanqi Li, Jingcheng Huang, Hongshen Wang et al.

CVPR 2025
1
citations
#4542

Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection

Ting Li, Mao Ye, Tianwen Wu et al.

CVPR 2025
1
citations
#4543

G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D Part Grouping

Junfeng Cheng, Tania Stathaki

CVPR 2024arXiv:2405.06828
1
citations
#4544

Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation

Yue Zhang, Mingyue Bin, Yuyang Zhang et al.

CVPR 2025
1
citations
#4545

Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning

Na Zheng, Xuemeng Song, Xue Dong et al.

CVPR 2025
1
citations
#4546

Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation

Yuan Xiao, Shiqing Ma, Juan Zhai et al.

CVPR 2024arXiv:2406.00699
1
citations
#4547

NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks

Chenyi Zhang, Ting Liu, Xiaochao Qu et al.

CVPR 2025highlight
1
citations
#4548

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

Jingxuan Xu, Wuyang Chen, Yao Zhao et al.

CVPR 2024arXiv:2404.07448
1
citations
#4549

Dynamic Group Normalization: Spatio-Temporal Adaptation to Evolving Data Statistics

Yair Smadar, Assaf Hoogi

CVPR 2025
1
citations
#4550

Argus: A Compact and Versatile Foundation Model for Vision

Weiming Zhuang, Chen Chen, Zhizhong Li et al.

CVPR 2025
1
citations
#4551

ETAP: Event-based Tracking of Any Point

Friedhelm Hamann, Daniel Gehrig, Filbert Febryanto et al.

CVPR 2025highlightarXiv:2412.00133
1
citations
#4552

EASEMVC:Efficient Dual Selection Mechanism for Deep Multi-View Clustering

Baili Xiao, Zhibin Dong, KE LIANG et al.

CVPR 2025
1
citations
#4553

Multi-modal Topology-embedded Graph Learning for Spatially Resolved Genes Prediction from Pathology Images with Prior Gene Similarity Information

Hang Shi, Chi Changxi, Peng Wan et al.

CVPR 2025
1
citations
#4554

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation

Zheng Zhang, Guanchun Yin, Bo Zhang et al.

CVPR 2025
1
citations
#4555

Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks

Nina Shvetsova, Arsha Nagrani, Bernt Schiele et al.

CVPR 2025arXiv:2503.18637
1
citations
#4556

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Chen Tang, Xinzhu Ma, Encheng Su et al.

CVPR 2025arXiv:2503.20748
1
citations
#4557

Efficient Personalization of Quantized Diffusion Model without Backpropagation

Hoigi Seo, Wongi Jeong, Kyungryeol Lee et al.

CVPR 2025arXiv:2503.14868
1
citations
#4558

GLane3D: Detecting Lanes with Graph of 3D Keypoints

Halil İbrahim Öztürk, Muhammet Esat Kalfaoglu, Ozsel Kilinc

CVPR 2025arXiv:2503.23882
1
citations
#4559

An Image-like Diffusion Method for Human-Object Interaction Detection

Xiaofei Hui, Haoxuan Qu, Hossein Rahmani et al.

CVPR 2025arXiv:2503.18134
1
citations
#4560

Leveraging Global Stereo Consistency for Category-Level Shape and 6D Pose Estimation from Stereo Images

Junning Qiu, Minglei Lu, Fei Wang et al.

CVPR 2025
1
citations
#4561

Percept, Memory, and Imagine: World Feature Simulating for Open-Domain Unknown Object Detection

Aming Wu, Cheng Deng

CVPR 2025
1
citations
#4562

SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction

Xinran Yang, Donghao Ji, Yuanqi Li et al.

CVPR 2025arXiv:2505.04668
1
citations
#4563

ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams

Chris Dongjoo Kim, Jihwan Moon, Sangwoo Moon et al.

CVPR 2025arXiv:2504.14875
1
citations
#4564

Real-time Acquisition and Reconstruction of Dynamic Volumes with Neural Structured Illumination

Yixin Zeng, Zoubin Bi, Yin Mingrui et al.

CVPR 2024
1
citations
#4565

Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality

Liyan Chen, Gregory P. Meyer, Zaiwei Zhang et al.

CVPR 2025highlightarXiv:2412.16481
1
citations
#4566

Relation-Rich Visual Document Generator for Visual Information Extraction

Zi-Han Jiang, Chien-Wei Lin, WeiHua Li et al.

CVPR 2025arXiv:2504.10659
1
citations
#4567

UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning

Long Zhou, Fereshteh Shakeri, Aymen Sadraoui et al.

CVPR 2025arXiv:2412.16739
1
citations
#4568

Video Language Model Pretraining with Spatio-temporal Masking

Yue Wu, Zhaobo Qi, Junshu Sun et al.

CVPR 2025
1
citations
#4569

OFER: Occluded Face Expression Reconstruction

Pratheba Selvaraju, Victoria Abrevaya, Timo Bolkart et al.

CVPR 2025arXiv:2410.21629
1
citations
#4570

Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model

Shuyun Wang, Hu Zhang, Xin Shen et al.

CVPR 2025
1
citations
#4571

NN-Former: Rethinking Graph Structure in Neural Architecture Representation

Ruihan Xu, Haokui Zhang, Yaowei Wang et al.

CVPR 2025arXiv:2507.00880
1
citations
#4572

ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation

Tao Tan, Qiulei Dong

CVPR 2025
1
citations
#4573

Towards Cost-Effective Learning: A Synergy of Semi-Supervised and Active Learning

Tianxiang Yin, Ningzhong Liu, Han Sun

CVPR 2025
1
citations
#4574

Named Entity Driven Zero-Shot Image Manipulation

Zhida Feng, Li Chen, Jing Tian et al.

CVPR 2024
1
citations
#4575

Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning

Huajie Jiang, Zhengxian Li, Xiaohan Yu et al.

CVPR 2025arXiv:2503.23030
1
citations
#4576

Black Hole-Driven Identity Absorbing in Diffusion Models

Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

CVPR 2025
1
citations
#4577

ADU: Adaptive Detection of Unknown Categories in Black-Box Domain Adaptation

Yushan Lai, Guowen Li, Haoyuan Liang et al.

CVPR 2025
1
citations
#4578

POMP: Physics-constrainable Motion Generative Model through Phase Manifolds

Bin Ji, Ye Pan, zhimeng Liu et al.

CVPR 2025
1
citations
#4579

AirRoom: Objects Matter in Room Reidentification

Runmao Yao, Yi Du, Zhuoqun Chen et al.

CVPR 2025arXiv:2503.01130
1
citations
#4580

Temporal Action Detection Model Compression by Progressive Block Drop

Xiaoyong Chen, Yong Guo, Jiaming Liang et al.

CVPR 2025arXiv:2503.16916
1
citations
#4581

SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting

Dongliang Luo, Hanshen Zhu, Ziyang Zhang et al.

CVPR 2025arXiv:2504.09966
1
citations
#4582

Instance-wise Supervision-level Optimization in Active Learning

Shinnosuke Matsuo, Riku Togashi, Ryoma Bise et al.

CVPR 2025arXiv:2503.06517
1
citations
#4583

Type-R: Automatically Retouching Typos for Text-to-Image Generation

Wataru Shimoda, Naoto Inoue, Daichi Haraguchi et al.

CVPR 2025highlightarXiv:2411.18159
1
citations
#4584

SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models

Subhadeep Koley, Tapas Kumar Dutta, Aneeshan Sain et al.

CVPR 2025arXiv:2503.14129
1
citations
#4585

Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds

Huitong Chen, Yu Wang, Yan Fan et al.

CVPR 2025arXiv:2503.17677
1
citations
#4586

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

Hongyu Sun, Qiuhong Ke, Ming Cheng et al.

CVPR 2025arXiv:2503.12150
1
citations
#4587

Advancing Manga Analysis: Comprehensive Segmentation Annotations for the Manga109 Dataset

Minshan Xie, Jian Lin, Hanyuan Liu et al.

CVPR 2025
1
citations
#4588

SyncSDE: A Probabilistic Framework for Diffusion Synchronization

Hyunjun Lee, Hyunsoo Lee, Sookwan Han

CVPR 2025arXiv:2503.21555
1
citations
#4589

Incremental Object Keypoint Learning

Mingfu Liang, Jiahuan Zhou, Xu Zou et al.

CVPR 2025arXiv:2503.20248
1
citations
#4590

SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models

Kevin Miller, Aditya Gangrade, Samarth Mishra et al.

CVPR 2025arXiv:2502.16911
1
citations
#4591

SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer

Chunnan Shang, Zhizhong Wang, Hongwei Wang et al.

CVPR 2025highlightarXiv:2503.04119
1
citations
#4592

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.

CVPR 2025arXiv:2503.08601
1
citations
#4593

RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety

Andrei Dumitriu, Florin Tatui, Florin Miron et al.

CVPR 2025arXiv:2504.01128
1
citations
#4594

Foveated Instance Segmentation

Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.

CVPR 2025arXiv:2503.21854
1
citations
#4595

Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation

Jiaxin Cai, Jingze Su, Qi Li et al.

CVPR 2025
1
citations
#4596

Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories

Susung Hong, Johanna Suvi Karras, Ricardo Martin et al.

CVPR 2025arXiv:2412.05279
1
citations
#4597

Perceptual Inductive Bias Is What You Need Before Contrastive Learning

Junru Zhao, Tianqin Li, Dunhan Jiang et al.

CVPR 2025arXiv:2506.01201
1
citations
#4598

Identity-preserving Distillation Sampling by Fixed-Point Iterator

SeonHwa Kim, Jiwon Kim, Soobin Park et al.

CVPR 2025arXiv:2502.19930
1
citations
#4599

End-to-End HOI Reconstruction Transformer with Graph-based Encoding

Zhenrong Wang, Qi Zheng, Sihan Ma et al.

CVPR 2025highlightarXiv:2503.06012
1
citations
#4600

SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation

Hao Du, Bo Wu, Yan Lu et al.

CVPR 2025arXiv:2504.05925
1
citations