Most Cited CVPR "mixed tail generic chaining" Papers

5,589 papers found • Page 15 of 28

#2801

Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model

Changchang Sun, Gaowen Liu, Charles Fleming et al.

CVPR 2025posterarXiv:2503.22138
2
citations
#2802

Diffusion-based Event Generation for High-Quality Image Deblurring

Xinan Xie, Qing Zhang, Wei-Shi Zheng

CVPR 2025poster
2
citations
#2803

Flexible Group Count Enables Hassle-Free Structured Pruning

Jiamu Zhang, Shaochen Zhong, Andrew Ye et al.

CVPR 2025poster
2
citations
#2804

MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning

Xu Han, Yuan Tang, Jinfeng Xu et al.

CVPR 2025posterarXiv:2503.18368
2
citations
#2805

SACB-Net: Spatial-awareness Convolutions for Medical Image Registration

Xinxing Cheng, Tianyang Zhang, Wenqi Lu et al.

CVPR 2025highlightarXiv:2503.19592
2
citations
#2806

DH-Set: Improving Vision-Language Alignment with Diverse and Hybrid Set-Embeddings Learning

Kun Zhang, Jingyu Li, Zhe Li et al.

CVPR 2025poster
2
citations
#2807

Person De-reidentification: A Variation-guided Identity Shift Modeling

Yi-Xing Peng, Yu-Ming Tang, Kun-Yu Lin et al.

CVPR 2025poster
2
citations
#2808

Split Adaptation for Pre-trained Vision Transformers

Lixu Wang, Bingqi Shang, Yi Li et al.

CVPR 2025posterarXiv:2503.00441
2
citations
#2809

MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing

Feifei Shao, Ping Liu, Zhao Wang et al.

CVPR 2025posterarXiv:2411.16773
2
citations
#2810

PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description

Ziqi Cai, Shuchen Weng, Yifei Xia et al.

CVPR 2025poster
2
citations
#2811

BOE-ViT: Boosting Orientation Estimation with Equivariance in Self-Supervised 3D Subtomogram Alignment

Runmin Jiang, Jackson Daggett, Shriya Pingulkar et al.

CVPR 2025poster
2
citations
#2812

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Mohsen Gholami, Mohammad Akbari, Kevin Cannons et al.

CVPR 2025highlightarXiv:2503.05936
2
citations
#2813

Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

Navami Kairanda, Marc Habermann, Shanthika Shankar Naik et al.

CVPR 2025posterarXiv:2503.19976
2
citations
#2814

DV-Matcher: Deformation-based Non-rigid Point Cloud Matching Guided by Pre-trained Visual Features

Zhangquan Chen, Puhua Jiang, Ruqi Huang

CVPR 2025posterarXiv:2408.08568
2
citations
#2815

Few-shot Personalized Scanpath Prediction

Ruoyu Xue, Jingyi Xu, Sounak Mondal et al.

CVPR 2025posterarXiv:2504.05499
2
citations
#2816

EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstruction

Dongrui Dai, Yuxiang Xing

CVPR 2025poster
2
citations
#2817

Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning

Jinpeng Wang, Tianci Luo, Yaohua Zha et al.

CVPR 2025posterarXiv:2504.21263
2
citations
#2818

VITED: Video Temporal Evidence Distillation

Yujie Lu, Yale Song, Lorenzo Torresani et al.

CVPR 2025posterarXiv:2503.12855
2
citations
#2819

COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation

Fanding Huang, Jingyan Jiang, Qinting Jiang et al.

CVPR 2025posterarXiv:2503.23388
2
citations
#2820

Adapting Dense Matching for Homography Estimation with Grid-based Acceleration

Kaining Zhang, Yuxin Deng, Jiayi Ma et al.

CVPR 2025poster
2
citations
#2821

Boltzmann Attention Sampling for Image Analysis with Small Objects

Theodore Zhao, Sid Kiblawi, Mu Wei et al.

CVPR 2025posterarXiv:2503.02841
2
citations
#2822

RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network

Van-Tin Luu, Yong-Lin Cai, Vu-Hoang Tran et al.

CVPR 2025posterarXiv:2505.22427
2
citations
#2823

Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability

Unki Park, Seongmoon Jeong, Jang Youngchan et al.

CVPR 2025poster
2
citations
#2824

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide

Dohun Lee, Bryan Sangwoo Kim, Geon Yeong Park et al.

CVPR 2025posterarXiv:2410.04364
2
citations
#2825

Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness

Beier Zhu, Jiequan Cui, Hanwang Zhang et al.

CVPR 2025highlightarXiv:2503.09487
2
citations
#2826

Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration

Chao Wang, Hehe Fan, Huichen Yang et al.

CVPR 2025poster
2
citations
#2827

Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation

Tanner Schmidt, Richard Newcombe

CVPR 2025posterarXiv:2506.11131
2
citations
#2828

DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes

Ashish Kumar, A. N. Rajagopalan

CVPR 2025poster
2
citations
#2829

Generative Map Priors for Collaborative BEV Semantic Segmentation

Jiahui Fu, Yue Gong, Luting Wang et al.

CVPR 2025poster
2
citations
#2830

LOCORE: Image Re-ranking with Long-Context Sequence Modeling

Zilin Xiao, Pavel Suma, Ayush Sachdeva et al.

CVPR 2025posterarXiv:2503.21772
2
citations
#2831

ToNNO: Tomographic Reconstruction of a Neural Network’s Output for Weakly Supervised Segmentation of 3D Medical Images

Marius Schmidt-Mengin, Alexis Benichoux, Shibeshih Belachew et al.

CVPR 2024posterarXiv:2404.13103
2
citations
#2832

CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes

ziteng xue, Mingzhe Guo, Heng Fan et al.

CVPR 2025poster
2
citations
#2833

Progressive Correspondence Regenerator for Robust 3D Registration

Guiyu Zhao, Sheng Ao, Ye Zhang et al.

CVPR 2025posterarXiv:2502.02163
2
citations
#2834

Self-Supervised Spatial Correspondence Across Modalities

Ayush Shrivastava, Andrew Owens

CVPR 2025posterarXiv:2506.03148
2
citations
#2835

Preconditioners for the Stochastic Training of Neural Fields

Shin-Fang Chng, Hemanth Saratchandran, Simon Lucey

CVPR 2025posterarXiv:2402.08784
2
citations
#2836

Hierarchical Flow Diffusion for Efficient Frame Interpolation

Yang Hai, Guo Wang, Tan Su et al.

CVPR 2025posterarXiv:2504.00380
2
citations
#2837

Sample- and Parameter-Efficient Auto-Regressive Image Models

Elad Amrani, Leonid Karlinsky, Alex M. Bronstein

CVPR 2025posterarXiv:2411.15648
2
citations
#2838

Observation-Guided Diffusion Probabilistic Models

Junoh Kang, Jinyoung Choi, Sungik Choi et al.

CVPR 2024posterarXiv:2310.04041
2
citations
#2839

SyncSDE: A Probabilistic Framework for Diffusion Synchronization

Hyunjun Lee, Hyunsoo Lee, Sookwan Han

CVPR 2025posterarXiv:2503.21555
1
citations
#2840

AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering

Jing Wang, Songhe Feng, Kristoffer Knutsen Wickstrøm et al.

CVPR 2025poster
1
citations
#2841

DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework

Yalong Xu, Lin Zhao, Chen Gong et al.

CVPR 2025poster
1
citations
#2842

DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation

Sang-Jun Park, Keun-Soo Heo, Dong-Hee Shin et al.

CVPR 2025posterarXiv:2504.11786
1
citations
#2843

Composing Parts for Expressive Object Generation

Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni et al.

CVPR 2025posterarXiv:2406.10197
1
citations
#2844

Percept, Memory, and Imagine: World Feature Simulating for Open-Domain Unknown Object Detection

Aming Wu, Cheng Deng

CVPR 2025poster
1
citations
#2845

CaMuViD: Calibration-Free Multi-View Detection

Amir Etefaghi Daryani, M. Usman Maqbool Bhutta, Byron Hernandez et al.

CVPR 2025poster
1
citations
#2846

Data Distributional Properties As Inductive Bias for Systematic Generalization

Felipe del Rio, Alain Raymond, Daniel Florea et al.

CVPR 2025posterarXiv:2502.20499
1
citations
#2847

Zero-Shot Head Swapping in Real-World Scenarios

Sohyun Jeong, Taewoong Kang, Hyojin Jang et al.

CVPR 2025posterarXiv:2503.00861
1
citations
#2848

Revisiting Fairness in Multitask Learning: A Performance-Driven Approach for Variance Reduction

Xiaohan Qin, Xiaoxing Wang, Junchi Yan

CVPR 2025poster
1
citations
#2849

Self-Supervised Learning for Color Spike Camera Reconstruction

Yanchen Dong, Ruiqin Xiong, Xiaopeng Fan et al.

CVPR 2025poster
1
citations
#2850

ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion

Nissim Maruani, Wang Yifan, Matthew Fisher et al.

CVPR 2025posterarXiv:2502.02187
1
citations
#2851

HSI: A Holistic Style Injector for Arbitrary Style Transfer

Shuhao Zhang, Hui Kang, Yang Liu et al.

CVPR 2025posterarXiv:2502.04369
1
citations
#2852

ADU: Adaptive Detection of Unknown Categories in Black-Box Domain Adaptation

Yushan Lai, Guowen Li, Haoyuan Liang et al.

CVPR 2025poster
1
citations
#2853

Dynamic Group Normalization: Spatio-Temporal Adaptation to Evolving Data Statistics

Yair Smadar, Assaf Hoogi

CVPR 2025poster
1
citations
#2854

TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification

Dongyoon Yang, Jihu Lee, Yongdai Kim

CVPR 2025posterarXiv:2505.06580
1
citations
#2855

Real-time Acquisition and Reconstruction of Dynamic Volumes with Neural Structured Illumination

Yixin Zeng, Zoubin Bi, Yin Mingrui et al.

CVPR 2024poster
1
citations
#2856

DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery

Jing Gao, Ce Zheng, Laszlo Jeni et al.

CVPR 2025posterarXiv:2504.03006
1
citations
#2857

Pose-Guided Temporal Enhancement for Robust Low-Resolution Hand Reconstruction

Kaixin Fan, Pengfei Ren, Jingyu Wang et al.

CVPR 2025poster
1
citations
#2858

ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos

Zetong Zhang, Manuel Kaufmann, Lixin Xue et al.

CVPR 2025posterarXiv:2504.13167
1
citations
#2859

Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories

Susung Hong, Johanna Suvi Karras, Ricardo Martin et al.

CVPR 2025posterarXiv:2412.05279
1
citations
#2860

Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning

Dongyao Jiang, Haodong Jing, Yongqiang Ma et al.

CVPR 2025poster
1
citations
#2861

HyperPose: Hypernetwork-Infused Camera Pose Localization and an Extended Cambridge Landmarks Dataset

Ron Ferens, Yosi Keller

CVPR 2025posterarXiv:2303.02610
1
citations
#2862

Targeted Forgetting of Image Subgroups in CLIP Models

Zeliang Zhang, Gaowen Liu, Charles Fleming et al.

CVPR 2025posterarXiv:2506.03117
1
citations
#2863

Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-based Vision

Manon Dampfhoffer, Thomas Mesquida, Damien Joubert et al.

CVPR 2025highlight
1
citations
#2864

HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery

Yuto Matsubara, Ko Nishino

CVPR 2025posterarXiv:2412.04456
1
citations
#2865

Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration

Aocheng Li, James R. Zimmer-Dauphinee, Rajesh Kalyanam et al.

CVPR 2025posterarXiv:2503.04030
1
citations
#2866

MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation

Shu Wang, Yanbo Gao, Shuai Li et al.

CVPR 2025highlightarXiv:2503.10000
1
citations
#2867

Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation

Hyunsoo Kim, Donghyun Kim, Suhyun Kim

CVPR 2025posterarXiv:2506.07750
1
citations
#2868

GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing

Tong Wang, Ting Liu, Xiaochao Qu et al.

CVPR 2025posterarXiv:2505.04915
1
citations
#2869

Mamba-Adaptor: State Space Model Adaptor for Visual Recognition

Fei Xie, Jiahao Nie, Yujin Tang et al.

CVPR 2025posterarXiv:2505.12685
1
citations
#2870

Foveated Instance Segmentation

Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.

CVPR 2025posterarXiv:2503.21854
1
citations
#2871

SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation

Hritam Basak, Zhaozheng Yin

CVPR 2025posterarXiv:2504.06389
1
citations
#2872

F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics

Pramit Saha, Felix Wagner, Divyanshu Mishra et al.

CVPR 2025highlight
1
citations
#2873

SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer

Chunnan Shang, Zhizhong Wang, Hongwei Wang et al.

CVPR 2025highlightarXiv:2503.04119
1
citations
#2874

Advancing Manga Analysis: Comprehensive Segmentation Annotations for the Manga109 Dataset

Minshan Xie, Jian Lin, Hanyuan Liu et al.

CVPR 2025poster
1
citations
#2875

HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation

Mehdi Zayene, Albias Havolli, Jannik Endres et al.

CVPR 2025highlightarXiv:2411.18335
1
citations
#2876

Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models

Yoojin Jung, Byung Cheol Song

CVPR 2025posterarXiv:2504.04747
1
citations
#2877

R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner

Ziyi Bai, Hanxuan Li, Bin Fu et al.

CVPR 2025poster
1
citations
#2878

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation

Zheng Zhang, Guanchun Yin, Bo Zhang et al.

CVPR 2025poster
1
citations
#2879

High-quality Point Cloud Oriented Normal Estimation via Hybrid Angular and Euclidean Distance Encoding

Yuanqi Li, Jingcheng Huang, Hongshen Wang et al.

CVPR 2025poster
1
citations
#2880

MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

Jianwei Zhao, XIN LI, Fan Yang et al.

CVPR 2025posterarXiv:2503.12401
1
citations
#2881

Object Dynamics Modeling with Hierarchical Point Cloud-based Representations

Chanho Kim, Li Fuxin

CVPR 2024posterarXiv:2404.06044
1
citations
#2882

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models

Jinhui Yi, Syed Talal Wasim, Yanan Luo et al.

CVPR 2025posterarXiv:2412.18609
1
citations
#2883

Explicit Depth-Aware Blurry Video Frame Interpolation Guided by Differential Curves

yan zaoming, pengcheng lei, Tingting Wang et al.

CVPR 2025poster
1
citations
#2884

PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers

Wooju Lee, Juhye Park, Dasol Hong et al.

CVPR 2025posterarXiv:2503.02388
1
citations
#2885

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Yung-Hsuan Lai, Janek Ebbers, Yu-Chiang Frank Wang et al.

CVPR 2025posterarXiv:2505.09615
1
citations
#2886

Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection

Qi Chen, Hu Ding

CVPR 2025poster
1
citations
#2887

GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction

Li Zhang, mingliang xu, Jianan Wang et al.

CVPR 2025poster
1
citations
#2888

Implicit Correspondence Learning for Image-to-Point Cloud Registration

Xinjun Li, Wenfei Yang, Jiacheng Deng et al.

CVPR 2025highlight
1
citations
#2889

Homogeneous Dynamics Space for Heterogeneous Humans

Xinpeng Liu, Junxuan Liang, Chenshuo Zhang et al.

CVPR 2025posterarXiv:2412.06146
1
citations
#2890

Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable

Xin Jin, Simon Niklaus, Zhoutong Zhang et al.

CVPR 2025posterarXiv:2504.03136
1
citations
#2891

Argus: A Compact and Versatile Foundation Model for Vision

Weiming Zhuang, Chen Chen, Zhizhong Li et al.

CVPR 2025poster
1
citations
#2892

Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport

Mengnan Liu, Le Wang, Sanping Zhou et al.

CVPR 2025poster
1
citations
#2893

An Image-like Diffusion Method for Human-Object Interaction Detection

Xiaofei Hui, Haoxuan Qu, Hossein Rahmani et al.

CVPR 2025posterarXiv:2503.18134
1
citations
#2894

Leveraging Global Stereo Consistency for Category-Level Shape and 6D Pose Estimation from Stereo Images

Junning Qiu, Minglei Lu, Fei Wang et al.

CVPR 2025poster
1
citations
#2895

AirRoom: Objects Matter in Room Reidentification

Runmao Yao, Yi Du, Zhuoqun Chen et al.

CVPR 2025posterarXiv:2503.01130
1
citations
#2896

Boost the Inference with Co-training: A Depth-guided Mutual Learning Framework for Semi-supervised Medical Polyp Segmentation

Yuxin Li, Zihao Zhu, Yuxiang Zhang et al.

CVPR 2025poster
1
citations
#2897

Type-R: Automatically Retouching Typos for Text-to-Image Generation

Wataru Shimoda, Naoto Inoue, Daichi Haraguchi et al.

CVPR 2025highlightarXiv:2411.18159
1
citations
#2898

Unlocking Generalization Power in LiDAR Point Cloud Registration

Zhenxuan Zeng, Qiao Wu, Xiyu Zhang et al.

CVPR 2025highlightarXiv:2503.10149
1
citations
#2899

Soft Self-labeling and Potts Relaxations for Weakly-supervised Segmentation

Zhongwen Zhang, Yuri Boykov

CVPR 2025posterarXiv:2507.01721
1
citations
#2900

Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering

Yuanlin Wang, Yiyang Zhang, Ruiqin Xiong et al.

CVPR 2025poster
1
citations
#2901

Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Zichen Tian, Yaoyao Liu, Qianru Sun

CVPR 2025highlight
1
citations
#2902

Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting

Hanxi Liu, Yifang Men, Zhouhui Lian

CVPR 2025highlightarXiv:2504.20403
1
citations
#2903

Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D

Jiawei Tan, Hongxing Wang, Junwu Weng et al.

CVPR 2025poster
1
citations
#2904

Polarized Color Screen Matting

Kenji Enomoto, Scott Cohen, Brian Price et al.

CVPR 2025highlight
1
citations
#2905

Understanding Multi-layered Transmission Matrices

Marina Alterman, Anat Levin

CVPR 2025highlightarXiv:2410.23864
1
citations
#2906

STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search

Yuning Qiu, Andong Wang, Chao Li et al.

CVPR 2025poster
1
citations
#2907

Black Hole-Driven Identity Absorbing in Diffusion Models

Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

CVPR 2025poster
1
citations
#2908

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

Hongyu Sun, Qiuhong Ke, Ming Cheng et al.

CVPR 2025posterarXiv:2503.12150
1
citations
#2909

SAMBLE: Shape-Specific Point Cloud Sampling for an Optimal Trade-Off Between Local Detail and Global Uniformity

Chengzhi Wu, Yuxin Wan, Hao Fu et al.

CVPR 2025posterarXiv:2504.19581
1
citations
#2910

Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation

Jiaxin Cai, Jingze Su, Qi Li et al.

CVPR 2025poster
1
citations
#2911

OFER: Occluded Face Expression Reconstruction

Pratheba Selvaraju, Victoria Abrevaya, Timo Bolkart et al.

CVPR 2025posterarXiv:2410.21629
1
citations
#2912

Instance-wise Supervision-level Optimization in Active Learning

Shinnosuke Matsuo, Riku Togashi, Ryoma Bise et al.

CVPR 2025posterarXiv:2503.06517
1
citations
#2913

Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video

Marchellus Matthew, Nadhira Noor, In Kyu Park

CVPR 2025posterarXiv:2505.07333
1
citations
#2914

Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling

Nannan Li, Kevin Shih, Bryan A. Plummer

CVPR 2025posterarXiv:2501.04666
1
citations
#2915

Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing

Shengzhi Wang, Yingkang Zhong, Jiangchuan Mu et al.

CVPR 2025poster
1
citations
#2916

Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction

Li Fang, Hao Zhu, Longlong Chen et al.

CVPR 2025posterarXiv:2505.19793
1
citations
#2917

Relation-Rich Visual Document Generator for Visual Information Extraction

Zi-Han Jiang, Chien-Wei Lin, WeiHua Li et al.

CVPR 2025posterarXiv:2504.10659
1
citations
#2918

De^2Gaze: Deformable and Decoupled Representation Learning for 3D Gaze Estimation

Yunfeng Xiao, Xiaowei Bai, Baojun Chen et al.

CVPR 2025poster
1
citations
#2919

WildAvatar: Learning In-the-wild 3D Avatars from the Web

Zihao Huang, Shoukang Hu, Guangcong Wang et al.

CVPR 2025posterarXiv:2407.02165
1
citations
#2920

VIRES: Video Instance Repainting via Sketch and Text Guided Generation

Shuchen Weng, Haojie Zheng, Peixuan Zhang et al.

CVPR 2025posterarXiv:2411.16199
1
citations
#2921

Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering

Liang Chen, Zhe Xue, Yawen Li et al.

CVPR 2025poster
1
citations
#2922

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Zihao Zhang, Aming Wu, Yahong Han

CVPR 2025highlightarXiv:2503.09968
1
citations
#2923

Latent Space Imaging

Matheus Souza, Yidan Zheng, Kaizhang Kang et al.

CVPR 2025posterarXiv:2407.07052
1
citations
#2924

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

Jingxuan Xu, Wuyang Chen, Yao Zhao et al.

CVPR 2024posterarXiv:2404.07448
1
citations
#2925

ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning

Quanxing Zha, Xin Liu, Shu-Juan Peng et al.

CVPR 2025posterarXiv:2502.19962
1
citations
#2926

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

CVPR 2025highlightarXiv:2411.16788
1
citations
#2927

Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction

Yuanbo Wang, Zhaoxuan Zhang, Jiajin Qiu et al.

CVPR 2025posterarXiv:2505.13091
1
citations
#2928

VEU-Bench: Towards Comprehensive Understanding of Video Editing

Bozheng Li, Yongliang Wu, YI LU et al.

CVPR 2025highlightarXiv:2504.17828
1
citations
#2929

PIAD: Pose and Illumination agnostic Anomaly Detection

Kaichen Yang, Junjie Cao, Zeyu Bai et al.

CVPR 2025poster
1
citations
#2930

Video Language Model Pretraining with Spatio-temporal Masking

Yue Wu, Zhaobo Qi, Junshu Sun et al.

CVPR 2025poster
1
citations
#2931

EnliveningGS: Active Locomotion of 3DGS

Siyuan Shen, Tianjia Shao, Kun Zhou et al.

CVPR 2025poster
1
citations
#2932

Dense Dispersed Structured Light for Hyperspectral 3D Imaging of Dynamic Scenes

Suhyun Shin, Seungwoo Yoon, Ryota Maeda et al.

CVPR 2025posterarXiv:2412.01140
1
citations
#2933

MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects

Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.

CVPR 2025poster
1
citations
#2934

DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer

Ho-Joong Kim, Yearang Lee, Jung-Ho Hong et al.

CVPR 2025posterarXiv:2505.05711
1
citations
#2935

Take the Bull by the Horns: Learning to Segment Hard Samples

Yuan Guo, Jingyu Kong, Yu Wang et al.

CVPR 2025poster
1
citations
#2936

Multi-modal Topology-embedded Graph Learning for Spatially Resolved Genes Prediction from Pathology Images with Prior Gene Similarity Information

Hang Shi, Chi Changxi, Peng Wan et al.

CVPR 2025poster
1
citations
#2937

Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning

Na Zheng, Xuemeng Song, Xue Dong et al.

CVPR 2025poster
1
citations
#2938

SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors

Yufan Wu, Xuanhong Chen, Wen Li et al.

CVPR 2025poster
1
citations
#2939

Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection

Ting Li, Mao Ye, Tianwen Wu et al.

CVPR 2025poster
1
citations
#2940

Seeing A 3D World in A Grain of Sand

Yufan Zhang, Yu Ji, Yu Guo et al.

CVPR 2025posterarXiv:2503.00260
1
citations
#2941

Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification

Zequn Zeng, Yudi Su, Jianqiao Sun et al.

CVPR 2025posterarXiv:2503.18483
1
citations
#2942

Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning

Huajie Jiang, Zhengxian Li, Xiaohan Yu et al.

CVPR 2025posterarXiv:2503.23030
1
citations
#2943

The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers

Daiqing Qi, Handong Zhao, Jing Shi et al.

CVPR 2025poster
1
citations
#2944

ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation

Tao Tan, Qiulei Dong

CVPR 2025poster
1
citations
#2945

ETAP: Event-based Tracking of Any Point

Friedhelm Hamann, Daniel Gehrig, Filbert Febryanto et al.

CVPR 2025highlightarXiv:2412.00133
1
citations
#2946

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.

CVPR 2025posterarXiv:2503.08601
1
citations
#2947

Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models

Yuhao Cui, Xinxing Zu, Wenhua Zhang et al.

CVPR 2025poster
1
citations
#2948

Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks

Nina Shvetsova, Arsha Nagrani, Bernt Schiele et al.

CVPR 2025posterarXiv:2503.18637
1
citations
#2949

Revisiting Generative Replay for Class Incremental Object Detection

Shizhou Zhang, Xueqiang Lv, Yinghui Xing et al.

CVPR 2025poster
1
citations
#2950

PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models

Jenny Schmalfuss, Nadine Chang, Vibashan VS et al.

CVPR 2025posterarXiv:2506.14808
1
citations
#2951

CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation

Bonan Li, Zicheng Zhang, Xingyi Yang et al.

CVPR 2025highlight
1
citations
#2952

NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks

Chenyi Zhang, Ting Liu, Xiaochao Qu et al.

CVPR 2025highlight
1
citations
#2953

Fitted Neural Lossless Image Compression

Zhe Zhang, Zhenzhong Chen, Shan Liu

CVPR 2025poster
1
citations
#2954

SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation

Hao Du, Bo Wu, Yan Lu et al.

CVPR 2025posterarXiv:2504.05925
1
citations
#2955

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Xiaohao Xu, Feng Xue, Shibo Zhao et al.

CVPR 2025posterarXiv:2412.09723
1
citations
#2956

Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering

Zhen Yang, Zhuo Tao, Qi Chen et al.

CVPR 2025poster
1
citations
#2957

Attribute-Missing Multi-view Graph Clustering

Bowen Zhao, Qianqian Wang, Zhengming Ding et al.

CVPR 2025poster
1
citations
#2958

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model

Yue Han, Jiangning Zhang, Junwei Zhu et al.

CVPR 2025highlight
1
citations
#2959

Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration

Jiani Ni, He Zhao, Jintong Gao et al.

CVPR 2025posterarXiv:2504.10007
1
citations
#2960

FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video

Andrea Boscolo Camiletto, Jian Wang, Eduardo Alvarado et al.

CVPR 2025highlightarXiv:2503.23094
1
citations
#2961

Twinner: Shining Light on Digital Twins in a Few Snaps

Jesus Zarzar, Tom Monnier, Roman Shapovalov et al.

CVPR 2025posterarXiv:2503.08382
1
citations
#2962

Customized Condition Controllable Generation for Video Soundtrack

Fan Qi, KunSheng Ma, Changsheng Xu

CVPR 2025poster
1
citations
#2963

Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios

Hang Shao, lei luo, Jianjun Qian et al.

CVPR 2025posterarXiv:2503.11465
1
citations
#2964

SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models

Kevin Miller, Aditya Gangrade, Samarth Mishra et al.

CVPR 2025posterarXiv:2502.16911
1
citations
#2965

Self-Supervised Cross-View Correspondence with Predictive Cycle Consistency

Alan Baade, Changan Chen

CVPR 2025highlight
1
citations
#2966

DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation

Xiaoliang Ju, Hongsheng Li

CVPR 2025posterarXiv:2503.06900
1
citations
#2967

Adapting to Observation Length of Trajectory Prediction via Contrastive Learning

Ruiqi Qiu, JUN GONG, Xinyu Zhang et al.

CVPR 2025poster
1
citations
#2968

Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior

Chanhui Lee, Yeonghwan Song, Jeany Son

CVPR 2025posterarXiv:2502.21048
1
citations
#2969

Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark

Hao Guo, Xugong Qin, Jun Jie Ou Yang et al.

CVPR 2025posterarXiv:2512.20174
1
citations
#2970

Unified Reconstruction of Static and Dynamic Scenes from Events

Qiyao Gao, Peiqi Duan, Hanyue Lou et al.

CVPR 2025highlight
1
citations
#2971

ArtiFade: Learning to Generate High-quality Subject from Blemished Images

Shuya Yang, Shaozhe Hao, Yukang Cao et al.

CVPR 2025posterarXiv:2409.03745
1
citations
#2972

HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving

R.D. Lin, Pengcheng Weng, Yinqiao Wang et al.

CVPR 2025posterarXiv:2503.17752
1
citations
#2973

MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection

Rishubh Parihar, Srinjay Sarkar, Sarthak Vora et al.

CVPR 2025posterarXiv:2504.06801
1
citations
#2974

Towards Cost-Effective Learning: A Synergy of Semi-Supervised and Active Learning

Tianxiang Yin, Ningzhong Liu, Han Sun

CVPR 2025poster
1
citations
#2975

ESC: Erasing Space Concept for Knowledge Deletion

Tae-Young Lee, Sundong Park, Minwoo Jeon et al.

CVPR 2025highlightarXiv:2504.02199
1
citations
#2976

Automatic Spectral Calibration of Hyperspectral Images: Method, Dataset and Benchmark

Zhuoran Du, Shaodi You, Cheng Cheng et al.

CVPR 2025poster
1
citations
#2977

ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams

Chris Dongjoo Kim, Jihwan Moon, Sangwoo Moon et al.

CVPR 2025posterarXiv:2504.14875
1
citations
#2978

NN-Former: Rethinking Graph Structure in Neural Architecture Representation

Ruihan Xu, Haokui Zhang, Yaowei Wang et al.

CVPR 2025posterarXiv:2507.00880
1
citations
#2979

Active Event-based Stereo Vision

Jianing Li, Yunjian Zhang, Haiqian Han et al.

CVPR 2025poster
1
citations
#2980

POMP: Physics-constrainable Motion Generative Model through Phase Manifolds

Bin Ji, Ye Pan, zhimeng Liu et al.

CVPR 2025poster
1
citations
#2981

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

Xiaolu Liu, Ruizi Yang, Song Wang et al.

CVPR 2025posterarXiv:2503.23109
1
citations
#2982

Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples

WEIWEI LI, Junzhuo Liu, Yuanyuan Ren et al.

CVPR 2025posterarXiv:2512.22874
1
citations
#2983

Deep Video Inverse Tone Mapping Based on Temporal Clues

Yuyao Ye, Ning Zhang, Yang Zhao et al.

CVPR 2024poster
1
citations
#2984

Symbolic Representation for Any-to-Any Generative Tasks

Jiaqi Chen, Xiaoye Zhu, Yue Wang et al.

CVPR 2025posterarXiv:2504.17261
1
citations
#2985

Sampling Innovation-Based Adaptive Compressive Sensing

Zhifu Tian, Tao Hu, Chaoyang Niu et al.

CVPR 2025posterarXiv:2503.13241
1
citations
#2986

Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception

Luke Chen, Junyao Wang, Trier Mortlock et al.

CVPR 2025posterarXiv:2503.20011
1
citations
#2987

PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation

Xinting Hu, Haoran Wang, Jan Lenssen et al.

CVPR 2025poster
1
citations
#2988

EvOcc: Accurate Semantic Occupancy for Automated Driving Using Evidence Theory

Jonas Kälble, Sascha Wirges, Maxim Tatarchenko et al.

CVPR 2025poster
1
citations
#2989

Nested Diffusion Models Using Hierarchical Latent Priors

Xiao Zhang, Ruoxi Jiang, Rebecca Willett et al.

CVPR 2025posterarXiv:2412.05984
1
citations
#2990

Temporal Action Detection Model Compression by Progressive Block Drop

Xiaoyong Chen, Yong Guo, Jiaming Liang et al.

CVPR 2025posterarXiv:2503.16916
1
citations
#2991

CURSOR: Scalable Mixed-Order Hypergraph Matching with CUR Decomposition

Qixuan Zheng, Ming Zhang, Hong Yan

CVPR 2024posterarXiv:2402.16594
1
citations
#2992

Can Text-to-Video Generation help Video-Language Alignment?

Luca Zanella, Massimiliano Mancini, Willi Menapace et al.

CVPR 2025posterarXiv:2503.18507
1
citations
#2993

Semantic Line Combination Detector

JINWON KO, Dongkwon Jin, Chang-Su Kim

CVPR 2024posterarXiv:2404.18399
1
citations
#2994

UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition

Meng Pang, Wenjun Zhang, Nanrun Zhou et al.

CVPR 2025poster
1
citations
#2995

Identity-preserving Distillation Sampling by Fixed-Point Iterator

SeonHwa Kim, Jiwon Kim, Soobin Park et al.

CVPR 2025posterarXiv:2502.19930
1
citations
#2996

Non-Rigid Structure-from-Motion: Temporally-Smooth Procrustean Alignment and Spatially-Variant Deformation Modeling

Jiawei Shi, Hui Deng, Yuchao Dai

CVPR 2024posterarXiv:2405.04309
1
citations
#2997

TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning

Seungmin Baek, Soyul Lee, Hayeon Jo et al.

CVPR 2025posterarXiv:2501.04293
1
citations
#2998

Vision-Guided Action: Enhancing 3D Human Motion Prediction with Gaze-informed Affordance in 3D Scenes

Ting Yu, Yi Lin, Jun Yu et al.

CVPR 2025poster
1
citations
#2999

Odd-One-Out: Anomaly Detection by Comparing with Neighbors

Ankan Kumar Bhunia, Changjian Li, Hakan Bilen

CVPR 2025posterarXiv:2406.20099
1
citations
#3000

Dual Semantic Guidance for Open Vocabulary Semantic Segmentation

ZhengYang Wang, Tingliang Feng, Fan Lyu et al.

CVPR 2025poster
1
citations