Most Cited ECCV "adaptive prompt refinement" Papers

2,387 papers found • Page 6 of 12

#1001

Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training

Yuanqi Yao, Gang Wu, Kui Jiang et al.

ECCV 2024posterarXiv:2411.02149
7
citations
#1002

Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model

Donggeun Yoon, Minseok Seo, Doyi Kim et al.

ECCV 2024poster
7
citations
#1003

FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving

Xingtai Gui, Tengteng Huang, Haonan Shao et al.

ECCV 2024posterarXiv:2404.12867
7
citations
#1004

Learning Cross-hand Policies of High-DOF Reaching and Grasping

Qijin She, Shishun Zhang, Yunfan Ye et al.

ECCV 2024posterarXiv:2404.09150
7
citations
#1005

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment

Mu Cai, Haotian Liu, Yuheng Li et al.

ECCV 2024posterarXiv:2410.00905
7
citations
#1006

External Knowledge Enhanced 3D Scene Generation from Sketch

Zijie Wu, Mingtao Feng, Yaonan Wang et al.

ECCV 2024posterarXiv:2403.14121
7
citations
#1007

CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.

Long Li, Nian Liu, Dingwen Zhang et al.

ECCV 2024posterarXiv:2409.01021
7
citations
#1008

AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos

Feichi Lu, Zijian Dong, Jie Song et al.

ECCV 2024posterarXiv:2408.02110
7
citations
#1009

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024posterarXiv:2407.16125
7
citations
#1010

Self-Supervised Audio-Visual Soundscape Stylization

Tingle Li, Renhao Wang, Po-Yao Huang et al.

ECCV 2024posterarXiv:2409.14340
7
citations
#1011

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

ECCV 2024posterarXiv:2312.04885
7
citations
#1012

Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures

Sayanton Vhaduri Dibbo, Adam Breuer, Juston Moore et al.

ECCV 2024posterarXiv:2403.14772
7
citations
#1013

Direct Distillation between Different Domains

Jialiang Tang, Shuo Chen, Gang Niu et al.

ECCV 2024posterarXiv:2401.06826
7
citations
#1014

Data Augmentation via Latent Diffusion for Saliency Prediction

Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang et al.

ECCV 2024posterarXiv:2409.07307
7
citations
#1015

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.

ECCV 2024posterarXiv:2407.16448
7
citations
#1016

Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation

Wenbo Qi, Jiafei Wu, S. C. Chan

ECCV 2024poster
7
citations
#1017

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

ECCV 2024poster
7
citations
#1018

Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images

Chuanrui Zhang, Yonggen Ling, Minglei Lu et al.

ECCV 2024posterarXiv:2407.06984
7
citations
#1019

Learning to Complement and to Defer to Multiple Users

Zheng Zhang, Wenjie Ai, Kevin Wells et al.

ECCV 2024posterarXiv:2407.07003
7
citations
#1020

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

Qitai Wang, Jiawei He, Yuntao Chen et al.

ECCV 2024poster
7
citations
#1021

Zero-Shot Multi-Object Scene Completion

Shun Iwase, Katherine Liu, Vitor Guizilini et al.

ECCV 2024posterarXiv:2403.14628
7
citations
#1022

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

Amrin Kareem, Jean Lahoud, Hisham Cholakkal

ECCV 2024posterarXiv:2404.03836
7
citations
#1023

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

KILICHBEK HAYDAROV, Xiaoqian Shen, Avinash Madasu et al.

ECCV 2024posterarXiv:2308.16349
7
citations
#1024

Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting

Yu Liu, Fatimah binti Khalid, Lei Wang et al.

ECCV 2024poster
7
citations
#1025

Personalized Video Relighting With an At-Home Light Stage

Jun Myeong Choi, Max Christman, Roni Sengupta

ECCV 2024posterarXiv:2311.08843
7
citations
#1026

Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning

Wenke Huang, Mang Ye, zekun shi et al.

ECCV 2024poster
7
citations
#1027

Synergy of Sight and Semantics: Visual Intention Understanding with CLIP

Qu Yang, Mang Ye, Dacheng Tao

ECCV 2024poster
7
citations
#1028

High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

YISHENG HE, Weihao Yuan, Siyu Zhu et al.

ECCV 2024posterarXiv:2404.02514
7
citations
#1029

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

ECCV 2024posterarXiv:2311.12090
7
citations
#1030

Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs

Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi et al.

ECCV 2024posterarXiv:2404.10700
7
citations
#1031

DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors

Zizheng Yan, Jiapeng Zhou, Fanpeng Meng et al.

ECCV 2024posterarXiv:2407.16260
7
citations
#1032

Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation

Anqi Zhang, Guangyu Gao

ECCV 2024posterarXiv:2407.09838
7
citations
#1033

DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting

Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha et al.

ECCV 2024posterarXiv:2407.17058
7
citations
#1034

BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events

Yijin Li, Yichen Shen, Zhaoyang Huang et al.

ECCV 2024posterarXiv:2410.20451
7
citations
#1035

Agglomerative Token Clustering

Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor et al.

ECCV 2024posterarXiv:2409.11923
7
citations
#1036

Camera-LiDAR Cross-modality Gait Recognition

Wenxuan Guo, Yingping Liang, Zhiyu Pan et al.

ECCV 2024posterarXiv:2407.02038
7
citations
#1037

Unmasking Bias in Diffusion Model Training

Hu Yu, Li Shen, Jie Huang et al.

ECCV 2024posterarXiv:2310.08442
7
citations
#1038

Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids

Wontae Kim, Nam Ik Cho

ECCV 2024poster
7
citations
#1039

PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference

Tanvir Mahmud, Burhaneddin Yaman, Chun-Hao Liu et al.

ECCV 2024posterarXiv:2403.16020
7
citations
#1040

Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma et al.

ECCV 2024posterarXiv:2407.09367
7
citations
#1041

Text-Guided Video Masked Autoencoder

David Fan, Jue Wang, Shuai Liao et al.

ECCV 2024posterarXiv:2408.00759
7
citations
#1042

Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model

Shoma Iwai, Atsuki Osanai, Shunsuke Kitada et al.

ECCV 2024posterarXiv:2409.16689
7
citations
#1043

Audio-visual Generalized Zero-shot Learning the Easy Way

Shentong Mo, Pedro Morgado

ECCV 2024posterarXiv:2407.13095
7
citations
#1044

Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition

Zhongxi Chen, Shen Chen, Taiping Yao et al.

ECCV 2024poster
7
citations
#1045

Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM

Jia Wan, qiangqiang wu, Wei Lin et al.

ECCV 2024posterarXiv:2402.17514
7
citations
#1046

HPE-Li: WiFi-enabled Lightweight Dual Selective Kernel Convolution for Human Pose Estimation

Gian Toan D., Tien Dac Lai, Thien Van Luong et al.

ECCV 2024poster
7
citations
#1047

Temporal Residual Jacobians for Rig-free Motion Transfer

Sanjeev Muralikrishnan, Niladri Shekhar Dutt, Siddhartha Chaudhuri et al.

ECCV 2024posterarXiv:2407.14958
7
citations
#1048

Click Prompt Learning with Optimal Transport for Interactive Segmentation

Jie Liu, haochen wang, Wenzhe Yin et al.

ECCV 2024poster
7
citations
#1049

Concise Plane Arrangements for Low-Poly Surface and Volume Modelling

Raphael Sulzer, Florent Lafarge

ECCV 2024posterarXiv:2404.06154
7
citations
#1050

From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition

Maan Qraitem, Kate Saenko, Bryan Plummer

ECCV 2024posterarXiv:2308.04553
7
citations
#1051

Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation

Xinru Cui, Qiming Liu, Zhe Liu et al.

ECCV 2024poster
7
citations
#1052

Unsupervised Multi-modal Medical Image Registration via Invertible Translation

Mengjie Guo

ECCV 2024poster
7
citations
#1053

PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation

Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno et al.

ECCV 2024posterarXiv:2409.06535
7
citations
#1054

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Shishira R Maiya, Anubhav Anubhav, Matthew Gwilliam et al.

ECCV 2024posterarXiv:2408.02672
7
citations
#1055

Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM

Baicheng Li, Zike Yan, Dong Wu et al.

ECCV 2024posterarXiv:2407.13338
7
citations
#1056

Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning

Shibo Jie, Yehui Tang, Jianyuan Guo et al.

ECCV 2024posterarXiv:2408.06798
7
citations
#1057

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.

ECCV 2024posterarXiv:2407.06315
7
citations
#1058

Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection

Jian Shi, Pengyi Zhang, Ni Zhang et al.

ECCV 2024posterarXiv:2302.14696
7
citations
#1059

Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders

Lucas Stoffl, Andy Bonnetto, Stéphane D'Ascoli et al.

ECCV 2024poster
7
citations
#1060

Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection

Yunfeng Fan, Wenchao Xu, Haozhao Wang et al.

ECCV 2024posterarXiv:2401.00403
7
citations
#1061

A high-quality robust diffusion framework for corrupted dataset

Quan Dao, Binh Ta, Tung Pham et al.

ECCV 2024posterarXiv:2311.17101
7
citations
#1062

Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception

Congzhang Shao, Guiyang Luo, Quan Yuan et al.

ECCV 2024poster
7
citations
#1063

PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments

rixin zhou, Ding Xia, YI ZHANG et al.

ECCV 2024posterarXiv:2312.08704
7
citations
#1064

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Shentong Mo, Enze Xie, Yue Wu et al.

ECCV 2024posterarXiv:2312.07231
7
citations
#1065

Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model

Guanren Qiao, Guiliang Liu, Guorui Quan et al.

ECCV 2024poster
7
citations
#1066

Fast Encoding and Decoding for Implicit Video Representation

Hao Chen, Saining Xie, Ser-Nam Lim et al.

ECCV 2024posterarXiv:2409.19429
7
citations
#1067

NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration

Lin Tian, Thomas H Greer, Raul San Jose Estepar et al.

ECCV 2024posterarXiv:2309.07322
7
citations
#1068

Occlusion-Aware Seamless Segmentation

Yihong Cao, Jiaming Zhang, Hao Shi et al.

ECCV 2024posterarXiv:2407.02182
6
citations
#1069

Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

Yeongtak Oh, Jonghyun Lee, Jooyoung Choi et al.

ECCV 2024posterarXiv:2403.10911
6
citations
#1070

Two-Stage Active Learning for Efficient Temporal Action Segmentation

Yuhao Su, Ehsan Elhamifar

ECCV 2024poster
6
citations
#1071

DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation

Junkai Yan, Yipeng Gao, Qize Yang et al.

ECCV 2024posterarXiv:2404.06119
6
citations
#1072

Semantically Guided Representation Learning For Action Anticipation

Anxhelo Diko, Danilo Avola, Bardh Prenkaj et al.

ECCV 2024posterarXiv:2407.02309
6
citations
#1073

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla et al.

ECCV 2024posterarXiv:2408.05364
6
citations
#1074

Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Jeeyung Kim, Ze Wang, Qiang Qiu

ECCV 2024posterarXiv:2407.08947
6
citations
#1075

MeshVPR: Citywide Visual Place Recognition Using 3D Meshes

Gabriele Berton, Lorenz Junglas, Riccardo Zaccone et al.

ECCV 2024posterarXiv:2406.02776
6
citations
#1076

Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions

Yihao Ai, Yifei Qi, Bo Wang et al.

ECCV 2024posterarXiv:2407.15451
6
citations
#1077

SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images

josh myers-dean, Jarek T Reynolds, Brian Price et al.

ECCV 2024posterarXiv:2407.09686
6
citations
#1078

Event Trojan: Asynchronous Event-based Backdoor Attacks

Ruofei Wang, Qing Guo, Haoliang Li et al.

ECCV 2024posterarXiv:2407.06838
6
citations
#1079

3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing

Haoran Li, Long Ma, Haolin Shi et al.

ECCV 2024posterarXiv:2311.12050
6
citations
#1080

Edge-Guided Fusion and Motion Augmentation for Event-Image Stereo

Fengan Zhao, Qianang Zhou, Junlin Xiong

ECCV 2024poster
6
citations
#1081

LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar

Yujeong Chae, HYEONSEONG KIM, Changgyoon Oh et al.

ECCV 2024poster
6
citations
#1082

LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement

Ye Yu, Fengxin Chen, Jun Yu et al.

ECCV 2024posterarXiv:2408.16235
6
citations
#1083

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Sungmin Woo, Wonjoon Lee, Woo Jin Kim et al.

ECCV 2024posterarXiv:2407.09303
6
citations
#1084

AFreeCA: Annotation-Free Counting for All

Adriano DAlessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh

ECCV 2024posterarXiv:2403.04943
6
citations
#1085

AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation

Ri-Zhao Qiu, Yu-Xiong Wang, Kris Hauser

ECCV 2024poster
6
citations
#1086

Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

Shengqi Xu, Run Sun, Yi Chang et al.

ECCV 2024posterarXiv:2407.08377
6
citations
#1087

Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection

Yongwei Nie, Hao Huang, Chengjiang Long et al.

ECCV 2024posterarXiv:2401.13551
6
citations
#1088

STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning

Hao CHENG, SIYUAN YANG, Chong Wang et al.

ECCV 2024poster
6
citations
#1089

E3M: Zero-Shot Spatio-Temporal Video Grounding with Expectation-Maximization Multimodal Modulation

Peijun Bao, Zihao Shao, Wenhan Yang et al.

ECCV 2024poster
6
citations
#1090

Adapt without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models

MENGYU ZHENG, Yehui Tang, Zhiwei Hao et al.

ECCV 2024poster
6
citations
#1091

ADMap: Anti-disturbance Framework for Vectorized HD Map Construction

Haotian Hu, Fanyi Wang, Yaonong Wang et al.

ECCV 2024poster
6
citations
#1092

ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency

Shaocheng Yan, Pengcheng Shi, Jiayuan Li

ECCV 2024posterarXiv:2407.09862
6
citations
#1093

DEVIAS: Learning Disentangled Video Representations of Action and Scene

Kyungho Bae, Youngrae Kim, Geo Ahn et al.

ECCV 2024posterarXiv:2312.00826
6
citations
#1094

PolyRoom: Room-aware Transformer for Floorplan Reconstruction

Yuzhou Liu, Lingjie Zhu, Xiaodong Ma et al.

ECCV 2024posterarXiv:2407.10439
6
citations
#1095

Delving Deep into Engagement Prediction of Short Videos

dasong Li, Wenjie Li, Baili Lu et al.

ECCV 2024posterarXiv:2410.00289
6
citations
#1096

De-confounded Gaze Estimation

Ziyang Liang, Yiwei Bao, Feng Lu

ECCV 2024poster
6
citations
#1097

Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains

Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim et al.

ECCV 2024posterarXiv:2312.12098
6
citations
#1098

This Probably Looks Exactly Like That: An Invertible Prototypical Network

Zachariah Carmichael, Timothy Redgrave, Daniel Gonzalez Cedre et al.

ECCV 2024posterarXiv:2407.12200
6
citations
#1099

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding

Ofir Abramovich, Niv Nayman, Sharon Fogel et al.

ECCV 2024posterarXiv:2407.12594
6
citations
#1100

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception

Kai Jiang, Jiaxing Huang, Weiying Xie et al.

ECCV 2024posterarXiv:2401.08687
6
citations
#1101

DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation

Sanghyun Jo, Fei Pan, In-Jae Yu et al.

ECCV 2024posterarXiv:2404.00380
6
citations
#1102

Better Regression Makes Better Test-time Adaptive 3D Object Detection

Jiakang Yuan, Bo Zhang, Kaixiong Gong et al.

ECCV 2024poster
6
citations
#1103

Domain Generalization of 3D Object Detection by Density-Resampling

Shuangzhi Li, Lei Ma, Xingyu Li

ECCV 2024posterarXiv:2311.10845
6
citations
#1104

Noise-assisted Prompt Learning for Image Forgery Detection and Localization

Dong Li, Jiaying Zhu, Xueyang Fu et al.

ECCV 2024poster
6
citations
#1105

Pseudo-keypoint RKHS Learning for Self-supervised 6DoF Pose Estimation

Yangzheng Wu, Michael Alan Greenspan

ECCV 2024posterarXiv:2311.09500
6
citations
#1106

MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps

Jianhao Zheng, Daniel Barath, Marc Pollefeys et al.

ECCV 2024posterarXiv:2406.05849
6
citations
#1107

VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition

Ahmad Khaliq, Ming Xu, Stephen Hausler et al.

ECCV 2024posterarXiv:2409.19293
6
citations
#1108

Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination

Yunan LI, Yihao Zhang, Shoude Li et al.

ECCV 2024poster
6
citations
#1109

Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification

Cheng-Chang Tsai, Yuan-Chih Chen, Chun-Shien Lu

ECCV 2024poster
6
citations
#1110

Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation

ChenChen Zong, Ye-Wen Wang, Kun-Peng Ning et al.

ECCV 2024posterarXiv:2402.15198
6
citations
#1111

Adaptive Multi-task Learning for Few-shot Object Detection

Yan Ren, Yanling Li, Wai-Kin Adams Kong

ECCV 2024poster
6
citations
#1112

Quanta Video Restoration

PRATEEK CHENNURI, Yiheng Chi, Enze Jiang et al.

ECCV 2024posterarXiv:2410.14994
6
citations
#1113

Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models

James Burgess, Kuan-Chieh Wang, Serena Yeung-Levy

ECCV 2024posterarXiv:2309.07986
6
citations
#1114

Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation

Seongsu Ha, Chaeyun Kim, Donghwa Kim et al.

ECCV 2024posterarXiv:2411.01494
6
citations
#1115

Differentiable Product Quantization for Memory Efficient Camera Relocalization

Zakaria Laskar, Iaroslav Melekhov, Assia Benbihi et al.

ECCV 2024posterarXiv:2407.15540
6
citations
#1116

DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement

Qimin Chen, Zhiqin Chen, Vladimir Kim et al.

ECCV 2024posterarXiv:2409.06129
6
citations
#1117

CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection

Jinhao Deng, Wei Ye, Hai Wu et al.

ECCV 2024poster
6
citations
#1118

Region-centric Image-Language Pretraining for Open-Vocabulary Detection

Dahun Kim, Anelia Angelova, Weicheng Kuo

ECCV 2024posterarXiv:2310.00161
6
citations
#1119

PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers

Ananthu Aniraj, Cassio F. Dantas, Dino Ienco et al.

ECCV 2024posterarXiv:2407.04538
6
citations
#1120

Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas

Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli et al.

ECCV 2024posterarXiv:2408.15660
6
citations
#1121

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Yuxiao Chen, Kai Li, Wentao Bao et al.

ECCV 2024posterarXiv:2409.16145
6
citations
#1122

SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments

Niklas Gard, Anna Hilsmann, Peter Eisert

ECCV 2024posterarXiv:2404.10527
6
citations
#1123

GenQ: Quantization in Low Data Regimes with Generative Synthetic Data

YUHANG LI, Youngeun Kim, Donghyun Lee et al.

ECCV 2024posterarXiv:2312.05272
6
citations
#1124

Investigating Style Similarity in Diffusion Models

Gowthami Somepalli, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024poster
6
citations
#1125

Geometry Fidelity for Spherical Images

Anders Christensen, Nooshin Mojab, Khushman Patel et al.

ECCV 2024posterarXiv:2407.18207
6
citations
#1126

Conceptual Codebook Learning for Vision-Language Models

Yi Zhang, Ke Yu, Siqi Wu et al.

ECCV 2024posterarXiv:2407.02350
6
citations
#1127

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning

Artemis Panagopoulou, Le Xue, Ning Yu et al.

ECCV 2024poster
6
citations
#1128

Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving

Yixuan Fan, Ya-Li Li, Shengjin Wang

ECCV 2024poster
6
citations
#1129

LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang

Yuqing Zhang, Hangqi Li, Shengyu Zhang et al.

ECCV 2024poster
6
citations
#1130

Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution

Mridul Khurana, Arka Daw, M. Maruf et al.

ECCV 2024posterarXiv:2408.00160
6
citations
#1131

Operational Open-Set Recognition and PostMax Refinement

Steve Cruz, Ryan Rabinowitz, Manuel Günther et al.

ECCV 2024poster
6
citations
#1132

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

Jianhao Li, Tianyu Sun, Zhongdao Wang et al.

ECCV 2024posterarXiv:2407.11382
6
citations
#1133

UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework

Tarun Kalluri, Sreyas Ravichandran, Manmohan Chandraker

ECCV 2024posterarXiv:2409.15264
6
citations
#1134

Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning

Sanjoy Kundu, Shubham Trehan, Sathyanarayanan Aakur

ECCV 2024posterarXiv:2305.16602
5
citations
#1135

Active Generation for Image Classification

Tao Huang, Jiaqi Liu, Shan You et al.

ECCV 2024posterarXiv:2403.06517
5
citations
#1136

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation

Jiannan Ge, Lingxi Xie, Hongtao Xie et al.

ECCV 2024posterarXiv:2404.05667
5
citations
#1137

Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction

Lin Zhu, Yunlong Zheng, Yijun Zhang et al.

ECCV 2024posterarXiv:2407.10636
5
citations
#1138

S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis

Dongze Li, Kang Zhao, WEI WANG et al.

ECCV 2024posterarXiv:2408.09347
5
citations
#1139

The Gaussian Discriminant Variational Autoencoder (GdVAE): A Self-Explainable Model with Counterfactual Explanations

Anselm Haselhoff, Kevin Trelenberg, Fabian Küppers et al.

ECCV 2024posterarXiv:2409.12952
5
citations
#1140

Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations

Zipeng Wang, yunfan lu, LIN WANG

ECCV 2024posterarXiv:2407.18500
5
citations
#1141

Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models

Luozhou Wang, Guibao Shen, Wenhang Ge et al.

ECCV 2024posterarXiv:2306.14408
5
citations
#1142

Understanding Multi-compositional learning in Vision and Language models via Category Theory

Sotirios Panagiotis Takis Chytas, Hyunwoo J. Kim, Vikas Singh

ECCV 2024poster
5
citations
#1143

Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation

Yuwen Pan, Rui Sun, Naisong Luo et al.

ECCV 2024posterarXiv:2408.13838
5
citations
#1144

AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes

Dongxu Yue, Maomao Li, Yunfei Liu et al.

ECCV 2024poster
5
citations
#1145

cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process

Yihang Chen, TSAI HOR CHAN, Guosheng Yin et al.

ECCV 2024posterarXiv:2407.11448
5
citations
#1146

Revisiting Calibration of Wide-Angle Radially Symmetric Cameras

Andrea Porfiri Dal Cin, Francesco Azzoni, Giacomo Boracchi et al.

ECCV 2024poster
5
citations
#1147

SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization

Xixu Hu, Runkai Zheng, Jindong Wang et al.

ECCV 2024posterarXiv:2402.03317
5
citations
#1148

FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation

tianyu zhang, Guocheng Qian, Jin Xie et al.

ECCV 2024posterarXiv:2410.19573
5
citations
#1149

DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism

Zhen Wang, Xinyun Jiang, Jun Xiao et al.

ECCV 2024posterarXiv:2311.14920
5
citations
#1150

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation

Zirui Shao, Feiyu Gao, Hangdi Xing et al.

ECCV 2024posterarXiv:2407.15502
5
citations
#1151

Efficient Depth-Guided Urban View Synthesis

sheng miao, Jiaxin Huang, Dongfeng Bai et al.

ECCV 2024posterarXiv:2407.12395
5
citations
#1152

Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching

Xiaoyong Lu, Songlin Du

ECCV 2024posterarXiv:2407.07789
5
citations
#1153

Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment

Zhanzhong Pang, Fadime Sener, Shrinivas Ramasubramanian et al.

ECCV 2024posterarXiv:2408.09919
5
citations
#1154

Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction

Jianxiong Tang, Jian-Huang Lai, Lingxiao Yang et al.

ECCV 2024poster
5
citations
#1155

EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation

Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali et al.

ECCV 2024posterarXiv:2403.18080
5
citations
#1156

Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images

Ruiqi Wang, Akshay Gadi Patil, Fenggen Yu et al.

ECCV 2024posterarXiv:2303.11530
5
citations
#1157

FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition

Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Shah Mubarak

ECCV 2024posterarXiv:2409.01448
5
citations
#1158

Open Vocabulary Multi-Label Video Classification

Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan et al.

ECCV 2024posterarXiv:2407.09073
5
citations
#1159

GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers

Manu S Pillai, Mamshad Nayeem Rizve, Shah Mubarak

ECCV 2024posterarXiv:2408.02840
5
citations
#1160

Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers

Zixuan Fu, Lanqing Guo, Chong Wang et al.

ECCV 2024posterarXiv:2409.11256
5
citations
#1161

Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

Xinyu Yang, Hossein Rahmani, Sue Black et al.

ECCV 2024posterarXiv:2402.17891
5
citations
#1162

Region-Aware Sequence-to-Sequence Learning for Hyperspectral Denoising

JiaHua Xiao, Yang Liu, Xing Wei

ECCV 2024poster
5
citations
#1163

Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

Junxiong Lin, Yan Wang, Zeng Tao et al.

ECCV 2024posterarXiv:2403.05808
5
citations
#1164

OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks

JINGYANG XIANG, Zuohui Chen, Siqi Li et al.

ECCV 2024posterarXiv:2407.05257
5
citations
#1165

Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction

Rui Peng, Shihe Shen, Kaiqiang Xiong et al.

ECCV 2024posterarXiv:2409.03634
5
citations
#1166

Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization

Jooyeol Yun, Choo Jaegul

ECCV 2024posterarXiv:2407.07176
5
citations
#1167

Reprojection Errors as Prompts for Efficient Scene Coordinate Regression

Ting-Ru Liu, Hsuan-Kung Yang, Jou-Min Liu et al.

ECCV 2024posterarXiv:2409.04178
5
citations
#1168

FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions

Sohyun Lee, Namyup Kim, Sungyeon Kim et al.

ECCV 2024posterarXiv:2407.13437
5
citations
#1169

LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment

Yiming Ren, Xiao Han, Yichen Yao et al.

ECCV 2024posterarXiv:2407.09833
5
citations
#1170

DATENeRF: Depth-Aware Text-based Editing of NeRFs

Sara Rojas Martinez, Julien Philip, Kai Zhang et al.

ECCV 2024posterarXiv:2404.04526
5
citations
#1171

Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation

Haozhi Cao, Yuecong Xu, Jianfei Yang et al.

ECCV 2024poster
5
citations
#1172

General Geometry-aware Weakly Supervised 3D Object Detection

Guowen Zhang, Junsong Fan, Liyi Chen et al.

ECCV 2024posterarXiv:2407.13748
5
citations
#1173

Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation

Arpit Garg, Cuong Cao Nguyen, RAFAEL FELIX et al.

ECCV 2024posterarXiv:2305.19486
5
citations
#1174

Task-Driven Uncertainty Quantification in Inverse Problems via Conformal Prediction

Jeffrey Wen, Rizwan Ahmad, Phillip Schniter

ECCV 2024posterarXiv:2405.18527
5
citations
#1175

Event-based Mosaicing Bundle Adjustment

Shuang Guo, Guillermo Gallego

ECCV 2024posterarXiv:2409.07365
5
citations
#1176

Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning

Ray Zhang, Zheming Zhou, Min Sun et al.

ECCV 2024posterarXiv:2407.20223
5
citations
#1177

Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning

Jihai Zhang, Xiang Lan, Xiaoye Qu et al.

ECCV 2024posterarXiv:2402.11816
5
citations
#1178

RaFE: Generative Radiance Fields Restoration

Zhongkai Wu, Ziyu Wan, Jing Zhang et al.

ECCV 2024posterarXiv:2404.03654
5
citations
#1179

Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation

Haizhong Zheng, Jiachen Sun, Shutong Wu et al.

ECCV 2024posterarXiv:2310.07506
5
citations
#1180

A Fair Ranking and New Model for Panoptic Scene Graph Generation

Julian Lorenz, Alexander Pest, Daniel Kienzle et al.

ECCV 2024posterarXiv:2407.09216
5
citations
#1181

ProSub: Probabilistic Open-Set Semi-Supervised Learning with Subspace-Based Out-of-Distribution Detection

Erik Wallin, Lennart Svensson, Fredrik Kahl et al.

ECCV 2024posterarXiv:2407.11735
5
citations
#1182

Using My Artistic Style? You Must Obtain My Authorization

Xiuli Bi, Haowei Liu, Weisheng Li et al.

ECCV 2024poster
5
citations
#1183

Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation

Mathias Öttl, Frauke Wilm, Jana Steenpass et al.

ECCV 2024posterarXiv:2403.14429
5
citations
#1184

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

Zhili Chen, Shuangjie Xu, Maosheng Ye et al.

ECCV 2024posterarXiv:2407.15354
5
citations
#1185

Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization

Qi Zhang, Kaiyi Zhang, Antoni Chan et al.

ECCV 2024posterarXiv:2409.01726
5
citations
#1186

OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection

Changsheng Lu, Zheyuan Liu, Piotr Koniusz

ECCV 2024posterarXiv:2409.19899
5
citations
#1187

Stable Video Portraits

Mirela Ostrek, Justus Thies

ECCV 2024posterarXiv:2409.18083
5
citations
#1188

FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos

Florian Langer, Jihong Ju, Georgi Dikov et al.

ECCV 2024posterarXiv:2403.15161
5
citations
#1189

Bucketed Ranking-based Losses for Efficient Training of Object Detectors

Feyza Yavuz, Baris Can Cam, Adnan Harun Dogan et al.

ECCV 2024posterarXiv:2407.14204
5
citations
#1190

SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes

Mohammad Zohaib, Luca Cosmo, Alessio Del Bue

ECCV 2024posterarXiv:2408.02291
5
citations
#1191

CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings

Cristina Mata, Kanchana N Ranasinghe, Michael S Ryoo

ECCV 2024posterarXiv:2507.07125
5
citations
#1192

SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference

Alind Khare, Animesh Agrawal, Aditya Annavajjala et al.

ECCV 2024posterarXiv:2301.10879
5
citations
#1193

HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization

Sakib Reza, Yuexi Zhang, Mohsen Moghaddam et al.

ECCV 2024posterarXiv:2408.06437
5
citations
#1194

nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding

Benjin Zhu, zhe wang, Hongsheng LI

ECCV 2024poster
5
citations
#1195

Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation

Chang Liu, Giulia Rizzoli, Pietro Zanuttigh et al.

ECCV 2024posterarXiv:2407.13363
5
citations
#1196

Minimalist Vision with Freeform Pixels

Jeremy Klotz, Shree Nayar

ECCV 2024posterarXiv:2501.00142
5
citations
#1197

Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs

Jeongkee Lim, Yusung Kim

ECCV 2024posterarXiv:2408.02261
5
citations
#1198

Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering

Francesco Di Sario, Riccardo Renzulli, Marco Grangetto et al.

ECCV 2024posterarXiv:2407.10389
5
citations
#1199

Local and Global Flatness for Federated Domain Generalization

Hao Yan, Yuhong Guo

ECCV 2024poster
5
citations
#1200

Efficient Training with Denoised Neural Weights

Yifan Gong, Zheng Zhan, Yanyu Li et al.

ECCV 2024posterarXiv:2407.11966
5
citations