Most Cited ECCV "visual prompt strategy" Papers

2,387 papers found • Page 6 of 12

Filters:Most Cited ECCV visual prompt strategy Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1001

Toward Tiny and High-quality Facial Makeup with Data Amplify Learning

Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.

ECCV 2024arXiv:2403.15033

citations

#1002

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024arXiv:2408.07481

citations

#1003

Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation

Taekyung Ki, Dongchan Min, Gyeongsu Chae

ECCV 2024arXiv:2404.00636

citations

#1004

Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

Ruicheng Wang, Jianfeng Xiang, Jiaolong Yang et al.

ECCV 2024arXiv:2403.11503

citations

#1005

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024arXiv:2404.11895

citations

#1006

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024arXiv:2404.17672

citations

#1007

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu et al.

ECCV 2024arXiv:2311.17893

citations

#1008

Self-Supervised Any-Point Tracking by Contrastive Random Walks

Ayush Shrivastava, Andrew Owens

ECCV 2024arXiv:2409.16288

citations

#1009

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Dylan Li, Gyungin Shin

ECCV 2024arXiv:2409.18961

citations

#1010

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024arXiv:2407.13642

citations

#1011

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024arXiv:2407.03788

citations

#1012

EDformer: Transformer-Based Event Denoising Across Varied Noise Levels

Bin Jiang, Bo Xiong, Bohan Qu et al.

ECCV 2024

citations

#1013

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Yuanhong Chen, Chong Wang, Yuyuan Liu et al.

ECCV 2024arXiv:2407.05358

citations

#1014

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024arXiv:2407.21654

citations

#1015

Few-shot NeRF by Adaptive Rendering Loss Regularization

Qingshan Xu, Xuanyu Yi, Jianyao Xu et al.

ECCV 2024arXiv:2410.17839

citations

#1016

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Shiyue Zhang, Zheng Chong, Xujie Zhang et al.

ECCV 2024arXiv:2408.12352

citations

#1017

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation

Mengchen Zhang, Tong Wu, Tai Wang et al.

ECCV 2024arXiv:2409.18261

citations

#1018

Nonverbal Interaction Detection

Jianan Wei, Tianfei Zhou, Yi Yang et al.

ECCV 2024arXiv:2407.08133

citations

#1019

Efficient Few-Shot Action Recognition via Multi-Level Post-Reasoning

Cong Wu, Xiao-Jun Wu, Linze Li et al.

ECCV 2024

citations

#1020

Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions

Weng Fei Low, Gim Hee Lee

ECCV 2024arXiv:2409.17988

citations

#1021

Neural Spectral Decomposition for Dataset Distillation

Yang Shaolei, Shen Cheng, Mingbo Hong et al.

ECCV 2024arXiv:2408.16236

citations

#1022

CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering

Haidong Zhu, Tianyu Ding, Tianyi Chen et al.

ECCV 2024arXiv:2311.15510

citations

#1023

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Wanyun Li, Pinxue Guo, Xinyu Zhou et al.

ECCV 2024arXiv:2403.08682

citations

#1024

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Haiwen Diao, Bo Wan, XU JIA et al.

ECCV 2024arXiv:2407.07523

citations

#1025

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors

Wei Shang, Dongwei Ren, Wanying Zhang et al.

ECCV 2024arXiv:2407.09919

citations

#1026

RoadPainter: Points Are Ideal Navigators for Topology transformER

Zhongxing Ma, Liang Shuang, Yongkun Wen et al.

ECCV 2024arXiv:2407.15349

citations

#1027

DiscoMatch: Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching

Paul Roetzer, Ahmed Abbas, Dongliang Cao et al.

ECCV 2024arXiv:2310.08230

citations

#1028

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024arXiv:2409.10473

citations

#1029

Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures

Jiaqi He, Zhihua Wang, Leon Wang et al.

ECCV 2024arXiv:2407.10181

citations

#1030

DEAL: Disentangle and Localize Concept-level Explanations for VLMs

Tang Li, Mengmeng Ma, Xi Peng

ECCV 2024arXiv:2407.14412

citations

#1031

DiffusionPen: Towards Controlling the Style of Handwritten Text Generation

KONSTANTINA NIKOLAIDOU, George Retsinas, Giorgos Sfikas et al.

ECCV 2024arXiv:2409.06065

citations

#1032

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471

citations

#1033

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Zhongqun Zhang, Hengfei Wang, Ziwei Yu et al.

ECCV 2024arXiv:2407.12727

citations

#1034

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Yi Zhang, Yun Tang, Wenjie Ruan et al.

ECCV 2024arXiv:2402.15429

citations

#1035

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Image Editing

Jing Gu, Nanxuan Zhao, Wei Xiong et al.

ECCV 2024

citations

#1036

Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy

Hong Zhang, Yixuan Lyu, Qian Yu et al.

ECCV 2024

citations

#1037

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Yuanhao Zhai, Kevin Lin, Linjie Li et al.

ECCV 2024arXiv:2407.10937

citations

#1038

Class-Agnostic Object Counting with Text-to-Image Diffusion Model

Xiaofei Hui, Qian Wu, Hossein Rahmani et al.

ECCV 2024

citations

#1039

CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts

Yichao Cai, Yuhang Liu, Zhen Zhang et al.

ECCV 2024arXiv:2311.16445

citations

#1040

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Ruihuang Li, Zhengqiang ZHANG, Chenhang He et al.

ECCV 2024arXiv:2407.09781

citations

#1041

Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.

Zhuoyi Yang, Heyang Jiang, Wenyi Hong et al.

ECCV 2024arXiv:2405.04312

citations

#1042

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024

citations

#1043

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation

Haibo Yang, Yang Chen, Yingwei Pan et al.

ECCV 2024arXiv:2409.07454

citations

#1044

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Yu Cao, Shaogang Gong

ECCV 2024arXiv:2407.07249

citations

#1045

Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

Pau de Jorge Aranda, Riccardo Volpi, Puneet Dokania et al.

ECCV 2024arXiv:2402.16392

citations

#1046

Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

Jinrui Zhang, Teng Wang, Haigang Zhang et al.

ECCV 2024arXiv:2407.11422

citations

#1047

Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem

Qianliang Wu, Haobo Jiang, Lei Luo et al.

ECCV 2024

citations

#1048

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Faegheh Sardari, Armin Mustafa, Philip JB Jackson et al.

ECCV 2024arXiv:2405.10690

citations

#1049

SAVE: Protagonist Diversification with Structure Agnostic Video Editing

Yeji Song, Wonsik Shin, Junsoo Lee et al.

ECCV 2024arXiv:2312.02503

citations

#1050

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

Yi-Hao Peng, Faria Huq, Yue Jiang et al.

ECCV 2024arXiv:2410.00201

citations

#1051

Timestep-Aware Correction for Quantized Diffusion Models

Yuzhe YAO, Feng Tian, Jun Chen et al.

ECCV 2024arXiv:2407.03917

citations

#1052

INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding

jiha jang, Hoigi Seo, Se Young Chun

ECCV 2024arXiv:2409.06210

citations

#1053

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

Runyi Li, Xuhan SHENG, Weiqi Li et al.

ECCV 2024arXiv:2404.10312

citations

#1054

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models

Jeongho Kim, Min-Jung Kim, Junsoo Lee et al.

ECCV 2024arXiv:2407.09012

citations

#1055

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Kihong Kim, Haneol Lee, Jihye Park et al.

ECCV 2024arXiv:2402.13729

citations

#1056

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327

citations

#1057

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Jiawei Han, Kaiqi Liu, Wei Li et al.

ECCV 2024arXiv:2408.10537

citations

#1058

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing

Xudong Wang, Ke-Yue Zhang, Taiping Yao et al.

ECCV 2024

citations

#1059

EgoPet: Egomotion and Interaction Data from an Animal's Perspective

Amir Bar, Arya Bakhtiar, Danny L Tran et al.

ECCV 2024arXiv:2404.09991

citations

#1060

Real-time 3D-aware Portrait Editing from a Single Image

Qingyan Bai, Zifan Shi, Yinghao Xu et al.

ECCV 2024arXiv:2402.14000

citations

#1061

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

Haibo Wang, Weifeng Ge

ECCV 2024arXiv:2401.10712

citations

#1062

MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment

Kanglei Zhou, Liyuan Wang, Xingxing Zhang et al.

ECCV 2024arXiv:2403.04398

citations

#1063

Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma et al.

ECCV 2024arXiv:2402.05655

citations

#1064

RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark

Yuan-Hao Ho, Jen-Hao Cheng, Sheng Yao Kuan et al.

ECCV 2024arXiv:2407.13930

citations

#1065

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Swetha Sirnam, Jinyu Yang, Tal Neiman et al.

ECCV 2024arXiv:2407.13851

citations

#1066

Learning to Make Keypoints Sub-Pixel Accurate

Shinjeong Kim, Marc Pollefeys, Daniel Barath

ECCV 2024arXiv:2407.11668

citations

#1067

DNI: Dilutional Noise Initialization for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong et al.

ECCV 2024arXiv:2409.13037

citations

#1068

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024arXiv:2407.12291

citations

#1069

Uncertainty-aware sign language video retrieval with probability distribution modeling

Xuan Wu, Hongxiang Li, yuanjiang luo et al.

ECCV 2024arXiv:2405.19689

citations

#1070

CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems

Jiankun Zhao, Bowen Song, Liyue Shen

ECCV 2024arXiv:2407.12676

citations

#1071

Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective

Fangzhou Song, Bin Zhu, Yanbin Hao et al.

ECCV 2024arXiv:2312.04763

citations

#1072

Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement

Lingyu Zhu, Wenhan Yang, Baoliang Chen et al.

ECCV 2024arXiv:2408.12316

citations

#1073

Recursive Visual Programming

Jiaxin Ge, Sanjay Subramanian, Baifeng Shi et al.

ECCV 2024arXiv:2312.02249

citations

#1074

Rasterized Edge Gradients: Handling Discontinuities Differentially

Stanislav Pidhorskyi, Tomas Simon, Gabriel Schwartz et al.

ECCV 2024arXiv:2405.02508

citations

#1075

Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions

Jiacong Xu, Mingqian Liao, Ram Prabhakar Kathirvel et al.

ECCV 2024arXiv:2403.14053

citations

#1076

Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification

Dekun Lin, Zhe Cui, Rui Chen et al.

ECCV 2024

citations

#1077

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2312.08291

citations

#1078

Length-Aware Motion Synthesis via Latent Diffusion

Alessio Sampieri, Alessio Palma, Indro Spinelli et al.

ECCV 2024arXiv:2407.11532

citations

#1079

Free Lunch for Gait Recognition: A Novel Relation Descriptor

Jilong Wang, Saihui Hou, Yan Huang et al.

ECCV 2024arXiv:2308.11487

citations

#1080

The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers

Seungwoo Son, Jegwang Ryu, Namhoon Lee et al.

ECCV 2024arXiv:2302.10494

citations

#1081

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

Samuele Poppi, Tobia Poppi, Federico Cocchi et al.

ECCV 2024arXiv:2311.16254

citations

#1082

Strike a Balance in Continual Panoptic Segmentation

Jinpeng Chen, Runmin Cong, Yuxuan Luo et al.

ECCV 2024arXiv:2407.16354

citations

#1083

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024arXiv:2407.07324

citations

#1084

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Fanyue Wei, Wei Zeng, Zhenyang Li et al.

ECCV 2024arXiv:2407.06642

citations

#1085

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

Ye-Bin Moon, Nam Hyeon-Woo, Wonseok Choi et al.

ECCV 2024arXiv:2407.13442

citations

#1086

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

Zhengming Yu, Zhiyang Dou, Xiaoxiao Long et al.

ECCV 2024arXiv:2311.17050

citations

#1087

ByteEdit: Boost, Comply and Accelerate Generative Image Editing

YUXI REN, Jie Wu, Yanzuo Lu et al.

ECCV 2024arXiv:2404.04860

citations

#1088

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Tuo FENG, Wenguan Wang, Ruijie Quan et al.

ECCV 2024arXiv:2407.10200

citations

#1089

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition

Fadi Boutros, Vitomir Struc, Naser Damer

ECCV 2024arXiv:2407.01332

citations

#1090

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence

Mengyao Lyu, Tianxiang Hao, Xinhao Xu et al.

ECCV 2024arXiv:2407.18899

citations

#1091

Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

Lilang Lin, Lehong Wu, Jiahang Zhang et al.

ECCV 2024arXiv:2410.20349

citations

#1092

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024arXiv:2407.08931

citations

#1093

Motion and Structure from Event-based Normal Flow

Zhongyang Ren, Bangyan Liao, Delei Kong et al.

ECCV 2024arXiv:2407.12239

citations

#1094

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Minchan Kim, Minyeong Kim, Junik Bae et al.

ECCV 2024arXiv:2403.16167

citations

#1095

Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024

citations

#1096

Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack

Mingyu Yang, Daizong Liu, Keke Tang et al.

ECCV 2024

citations

#1097

Part2Object: Hierarchical Unsupervised 3D Instance Segmentation

cheng Shi, Yulin zhang, Bin Yang et al.

ECCV 2024arXiv:2407.10084

citations

#1098

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024arXiv:2403.04908

citations

#1099

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Shen Jianbing, Chunliang Li, Wencheng Han et al.

ECCV 2024arXiv:2407.10876

citations

#1100

Siamese Vision Transformers are Scalable Audio-visual Learners

Yan-Bo Lin, Gedas Bertasius

ECCV 2024arXiv:2403.19638

citations

#1101

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024arXiv:2407.09115

citations

#1102

NOVUM: Neural Object Volumes for Robust Object Classification

Artur Jesslen, Guofeng Zhang, Angtian Wang et al.

ECCV 2024arXiv:2305.14668

citations

#1103

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

ECCV 2024arXiv:2406.01194

citations

#1104

Temporal-Mapping Photography for Event Cameras

Yuhan Bao, Lei Sun, Yuqin Ma et al.

ECCV 2024arXiv:2403.06443

citations

#1105

Graph Neural Network Causal Explanation via Neural Causal Models

Arman Behnam, Binghui Wang

ECCV 2024arXiv:2407.09378

citations

#1106

Improving Adversarial Transferability via Model Alignment

Avery Ma, Amir-massoud Farahmand, Yangchen Pan et al.

ECCV 2024arXiv:2311.18495

citations

#1107

Leveraging temporal contextualization for video action recognition

Minji Kim, Dongyoon Han, Taekyung Kim et al.

ECCV 2024arXiv:2404.09490

citations

#1108

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi, Hunsang Lee, Seyoung Joung et al.

ECCV 2024arXiv:2404.08330

citations

#1109

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

ECCV 2024arXiv:2311.15908

citations

#1110

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs

Aayam Shrestha, Pan Liu, German Ros et al.

ECCV 2024arXiv:2502.05641

citations

#1111

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ECCV 2024arXiv:2312.07315

citations

#1112

VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

Yibo Liu, Zheyuan Yang, Guile Wu et al.

ECCV 2024arXiv:2407.06516

citations

#1113

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

ECCV 2024arXiv:2407.21032

citations

#1114

Training-free Composite Scene Generation for Layout-to-Image Synthesis

Jiaqi Liu, Tao Huang, Chang Xu

ECCV 2024arXiv:2407.13609

citations

#1115

Volumetric Rendering with Baked Quadrature Fields

Gopal Sharma, Daniel Rebain, Kwang Moo Yi et al.

ECCV 2024arXiv:2312.02202

citations

#1116

T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers et al.

ECCV 2024arXiv:2312.10217

citations

#1117

PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation

Ning Gao, Sanping Zhou, Le Wang et al.

ECCV 2024arXiv:2409.05122

citations

#1118

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024arXiv:2406.02461

citations

#1119

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ECCV 2024arXiv:2409.15801

citations

#1120

Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos

Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang et al.

ECCV 2024arXiv:2409.20557

citations

#1121

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024arXiv:2310.05615

citations

#1122

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

Pengfei Wang, Yuxi Wang, Shuai Li et al.

ECCV 2024arXiv:2407.13362

citations

#1123

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024arXiv:2408.05019

citations

#1124

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Agneet Chatterjee, Yiran Luo, Tejas Gokhale et al.

ECCV 2024arXiv:2408.02231

citations

#1125

Domain Shifting: A Generalized Solution for Heterogeneous Cross-Modality Person Re-Identification

Yan Jiang, Xu Cheng, Hao Yu et al.

ECCV 2024

citations

#1126

PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects

Guangcheng Chen, Yicheng He, Li He et al.

ECCV 2024arXiv:2409.14331

citations

#1127

Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning

Peng Xiao, Yi Xie, Xuemiao Xu et al.

ECCV 2024

citations

#1128

Open-Set Recognition in the Age of Vision-Language Models

Dimity Miller, Niko Suenderhauf, Alex Kenna et al.

ECCV 2024arXiv:2403.16528

citations

#1129

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387

citations

#1130

CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation

Hajin Shim, Changhun Kim, Eunho Yang

ECCV 2024arXiv:2407.16193

citations

#1131

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

ECCV 2024arXiv:2407.06704

citations

#1132

MVDD: Multi-View Depth Diffusion Models

Zhen Wang, Qiangeng Xu, Feitong Tan et al.

ECCV 2024arXiv:2312.04875

citations

#1133

Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

Yingshan Chang, Yasi Zhang, Zhiyuan Fang et al.

ECCV 2024arXiv:2403.16394

citations

#1134

Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking

Jikai Zheng, Mingjiang Liang, Shaoli Huang et al.

ECCV 2024

citations

#1135

Curved Diffusion: A Generative Model With Optical Geometry Control

Andrey Voynov, Amir Hertz, Moab Arar et al.

ECCV 2024arXiv:2311.17609

citations

#1136

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024arXiv:2403.17213

citations

#1137

Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems

Ziyuan Luo, Boxin Shi, Haoliang Li et al.

ECCV 2024arXiv:2407.09352

citations

#1138

Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos

Keqiang Sun, Dori Litvak, Yunzhi Zhang et al.

ECCV 2024arXiv:2312.13604

citations

#1139

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval

Xianwei Zhuang, Hongxiang Li, Xuxin Cheng et al.

ECCV 2024

citations

#1140

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2407.13342

citations

#1141

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A Hobley, Victor Adrian Prisacariu

ECCV 2024arXiv:2309.04820

citations

#1142

Towards More Practical Group Activity Detection: A New Benchmark and Model

Dongkeun Kim, Youngkil Song, Minsu Cho et al.

ECCV 2024arXiv:2312.02878

citations

#1143

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee et al.

ECCV 2024arXiv:2407.04345

citations

#1144

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

ECCV 2024arXiv:2407.04086

citations

#1145

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

ECCV 2024

citations

#1146

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

ECCV 2024

citations

#1147

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024arXiv:2311.15562

citations

#1148

Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation

Ziyun Wang, Jinyuan Guo, Kostas Daniilidis

ECCV 2024arXiv:2312.00114

citations

#1149

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Xiaoxu Xu, Yitian Yuan, Jinlong Li et al.

ECCV 2024arXiv:2407.09826

citations

#1150

Towards Physical World Backdoor Attacks against Skeleton Action Recognition

Qichen Zheng, Yi Yu, SIYUAN YANG et al.

ECCV 2024arXiv:2408.08671

citations

#1151

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024arXiv:2407.16125

citations

#1152

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

ECCV 2024arXiv:2403.18820

citations

#1153

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

ECCV 2024arXiv:2409.13803

citations

#1154

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024arXiv:2407.05352

citations

#1155

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024

citations

#1156

Self-Supervised Representation Learning for Adversarial Attack Detection

Yi Li, Plamen Angelov, Neeraj Suri

ECCV 2024arXiv:2407.04382

citations

#1157

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ECCV 2024arXiv:2407.02047

citations

#1158

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ECCV 2024arXiv:2405.09883

citations

#1159

Towards Reliable Advertising Image Generation Using Human Feedback

Zhenbang Du, Wei Feng, Haohan Wang et al.

ECCV 2024arXiv:2408.00418

citations

#1160

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024arXiv:2312.07485

citations

#1161

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024arXiv:2408.10624

citations

#1162

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

ECCV 2024arXiv:2410.10659

citations

#1163

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024arXiv:2407.07402

citations

#1164

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024arXiv:2407.20228

citations

#1165

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024arXiv:2309.03244

citations

#1166

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024arXiv:2407.08418

citations

#1167

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ECCV 2024arXiv:2403.11586

citations

#1168

Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation

Zhaoyang Li, Yuan Wang, Wangkai Li et al.

ECCV 2024arXiv:2408.13752

citations

#1169

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024arXiv:2305.03716

citations

#1170

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024arXiv:2407.13545

citations

#1171

GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

Xiufeng HUANG, Ka Chun Cheung, Simon See et al.

ECCV 2024arXiv:2407.13390

citations

#1172

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ECCV 2024arXiv:2407.12939

citations

#1173

AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution

Yuanting Fan, Chengxu Liu, Nengzhong Yin et al.

ECCV 2024arXiv:2410.17752

citations

#1174

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024arXiv:2407.12489

citations

#1175

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024arXiv:2406.08392

citations

#1176

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Fan Duan, Jiahao Yu, Li Chen

ECCV 2024arXiv:2407.05008

citations

#1177

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

chen rao, Guangyuan Li, Zehua Lan et al.

ECCV 2024arXiv:2408.13459

citations

#1178

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024

citations

#1179

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024arXiv:2407.13254

citations

#1180

Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°

Yuxiao He, Yiyu Zhuang, Yanwen Wang et al.

ECCV 2024arXiv:2408.00296

citations

#1181

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ECCV 2024

citations

#1182

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024arXiv:2411.06344

citations

#1183

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024arXiv:2403.09638

citations

#1184

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Yang Liu, Pengxiang Ding, Siteng Huang et al.

ECCV 2024arXiv:2409.07239

citations

#1185

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024arXiv:2407.02665

citations

#1186

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Pengwei Yin, Jingjing Wang, Guanzhong Zeng et al.

ECCV 2024arXiv:2411.08606

citations

#1187

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

ECCV 2024arXiv:2407.10704

citations

#1188

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024arXiv:2403.13808

citations

#1189

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

ECCV 2024arXiv:2407.12443

citations

#1190

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024arXiv:2501.02771

citations

#1191

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

ECCV 2024arXiv:2409.16763

citations

#1192

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Haodong LI, Hao LU, Yingcong Chen

ECCV 2024arXiv:2409.17316

citations

#1193

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

ECCV 2024arXiv:2407.05594

citations

#1194

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024arXiv:2403.10179

citations

#1195

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024arXiv:2409.01696

citations

#1196

Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction

Guowei Xu, Jiale Tao, Wen Li et al.

ECCV 2024arXiv:2407.11494

citations

#1197

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

ECCV 2024arXiv:2407.17596

citations

#1198

DualDn: Dual-domain Denoising via Differentiable ISP

Ruikang Li, Yujin Wang, Shiqi Chen et al.

ECCV 2024arXiv:2409.18783

citations

#1199

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ECCV 2024arXiv:2403.13524

citations

#1200

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024arXiv:2408.05749

citations

← Previous

1...4 5 6 7 8...12