Most Cited ECCV "inverse inference" Papers

2,387 papers found • Page 6 of 12

#1001

Toward Tiny and High-quality Facial Makeup with Data Amplify Learning

Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.

ECCV 2024arXiv:2403.15033
11
citations
#1002

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024arXiv:2408.07481
11
citations
#1003

Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation

Taekyung Ki, Dongchan Min, Gyeongsu Chae

ECCV 2024arXiv:2404.00636
11
citations
#1004

Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

Ruicheng Wang, Jianfeng Xiang, Jiaolong Yang et al.

ECCV 2024arXiv:2403.11503
11
citations
#1005

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024arXiv:2404.11895
11
citations
#1006

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024arXiv:2404.17672
11
citations
#1007

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu et al.

ECCV 2024arXiv:2311.17893
11
citations
#1008

Self-Supervised Any-Point Tracking by Contrastive Random Walks

Ayush Shrivastava, Andrew Owens

ECCV 2024arXiv:2409.16288
11
citations
#1009

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Dylan Li, Gyungin Shin

ECCV 2024arXiv:2409.18961
11
citations
#1010

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024arXiv:2407.13642
11
citations
#1011

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024arXiv:2407.03788
11
citations
#1012

EDformer: Transformer-Based Event Denoising Across Varied Noise Levels

Bin Jiang, Bo Xiong, Bohan Qu et al.

ECCV 2024
11
citations
#1013

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Yuanhong Chen, Chong Wang, Yuyuan Liu et al.

ECCV 2024arXiv:2407.05358
11
citations
#1014

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024arXiv:2407.21654
11
citations
#1015

Few-shot NeRF by Adaptive Rendering Loss Regularization

Qingshan Xu, Xuanyu Yi, Jianyao Xu et al.

ECCV 2024arXiv:2410.17839
11
citations
#1016

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Shiyue Zhang, Zheng Chong, Xujie Zhang et al.

ECCV 2024arXiv:2408.12352
11
citations
#1017

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation

Mengchen Zhang, Tong Wu, Tai Wang et al.

ECCV 2024arXiv:2409.18261
11
citations
#1018

Nonverbal Interaction Detection

Jianan Wei, Tianfei Zhou, Yi Yang et al.

ECCV 2024arXiv:2407.08133
11
citations
#1019

Efficient Few-Shot Action Recognition via Multi-Level Post-Reasoning

Cong Wu, Xiao-Jun Wu, Linze Li et al.

ECCV 2024
11
citations
#1020

Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions

Weng Fei Low, Gim Hee Lee

ECCV 2024arXiv:2409.17988
11
citations
#1021

Neural Spectral Decomposition for Dataset Distillation

Yang Shaolei, Shen Cheng, Mingbo Hong et al.

ECCV 2024arXiv:2408.16236
11
citations
#1022

CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering

Haidong Zhu, Tianyu Ding, Tianyi Chen et al.

ECCV 2024arXiv:2311.15510
11
citations
#1023

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Wanyun Li, Pinxue Guo, Xinyu Zhou et al.

ECCV 2024arXiv:2403.08682
11
citations
#1024

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Haiwen Diao, Bo Wan, XU JIA et al.

ECCV 2024arXiv:2407.07523
11
citations
#1025

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors

Wei Shang, Dongwei Ren, Wanying Zhang et al.

ECCV 2024arXiv:2407.09919
11
citations
#1026

RoadPainter: Points Are Ideal Navigators for Topology transformER

Zhongxing Ma, Liang Shuang, Yongkun Wen et al.

ECCV 2024arXiv:2407.15349
11
citations
#1027

DiscoMatch: Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching

Paul Roetzer, Ahmed Abbas, Dongliang Cao et al.

ECCV 2024arXiv:2310.08230
11
citations
#1028

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024arXiv:2409.10473
11
citations
#1029

Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures

Jiaqi He, Zhihua Wang, Leon Wang et al.

ECCV 2024arXiv:2407.10181
11
citations
#1030

DEAL: Disentangle and Localize Concept-level Explanations for VLMs

Tang Li, Mengmeng Ma, Xi Peng

ECCV 2024arXiv:2407.14412
11
citations
#1031

DiffusionPen: Towards Controlling the Style of Handwritten Text Generation

KONSTANTINA NIKOLAIDOU, George Retsinas, Giorgos Sfikas et al.

ECCV 2024arXiv:2409.06065
11
citations
#1032

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471
11
citations
#1033

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Zhongqun Zhang, Hengfei Wang, Ziwei Yu et al.

ECCV 2024arXiv:2407.12727
11
citations
#1034

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Yi Zhang, Yun Tang, Wenjie Ruan et al.

ECCV 2024arXiv:2402.15429
11
citations
#1035

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Image Editing

Jing Gu, Nanxuan Zhao, Wei Xiong et al.

ECCV 2024
11
citations
#1036

Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy

Hong Zhang, Yixuan Lyu, Qian Yu et al.

ECCV 2024
11
citations
#1037

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Yuanhao Zhai, Kevin Lin, Linjie Li et al.

ECCV 2024arXiv:2407.10937
11
citations
#1038

Class-Agnostic Object Counting with Text-to-Image Diffusion Model

Xiaofei Hui, Qian Wu, Hossein Rahmani et al.

ECCV 2024
11
citations
#1039

CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts

Yichao Cai, Yuhang Liu, Zhen Zhang et al.

ECCV 2024arXiv:2311.16445
11
citations
#1040

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Ruihuang Li, Zhengqiang ZHANG, Chenhang He et al.

ECCV 2024arXiv:2407.09781
11
citations
#1041

Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.

Zhuoyi Yang, Heyang Jiang, Wenyi Hong et al.

ECCV 2024arXiv:2405.04312
11
citations
#1042

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024
11
citations
#1043

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation

Haibo Yang, Yang Chen, Yingwei Pan et al.

ECCV 2024arXiv:2409.07454
11
citations
#1044

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Yu Cao, Shaogang Gong

ECCV 2024arXiv:2407.07249
11
citations
#1045

Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

Pau de Jorge Aranda, Riccardo Volpi, Puneet Dokania et al.

ECCV 2024arXiv:2402.16392
11
citations
#1046

Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

Jinrui Zhang, Teng Wang, Haigang Zhang et al.

ECCV 2024arXiv:2407.11422
11
citations
#1047

Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem

Qianliang Wu, Haobo Jiang, Lei Luo et al.

ECCV 2024
11
citations
#1048

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Faegheh Sardari, Armin Mustafa, Philip JB Jackson et al.

ECCV 2024arXiv:2405.10690
11
citations
#1049

SAVE: Protagonist Diversification with Structure Agnostic Video Editing

Yeji Song, Wonsik Shin, Junsoo Lee et al.

ECCV 2024arXiv:2312.02503
11
citations
#1050

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

Yi-Hao Peng, Faria Huq, Yue Jiang et al.

ECCV 2024arXiv:2410.00201
11
citations
#1051

Timestep-Aware Correction for Quantized Diffusion Models

Yuzhe YAO, Feng Tian, Jun Chen et al.

ECCV 2024arXiv:2407.03917
11
citations
#1052

INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding

jiha jang, Hoigi Seo, Se Young Chun

ECCV 2024arXiv:2409.06210
11
citations
#1053

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

Runyi Li, Xuhan SHENG, Weiqi Li et al.

ECCV 2024arXiv:2404.10312
11
citations
#1054

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models

Jeongho Kim, Min-Jung Kim, Junsoo Lee et al.

ECCV 2024arXiv:2407.09012
11
citations
#1055

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Kihong Kim, Haneol Lee, Jihye Park et al.

ECCV 2024arXiv:2402.13729
11
citations
#1056

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327
11
citations
#1057

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Jiawei Han, Kaiqi Liu, Wei Li et al.

ECCV 2024arXiv:2408.10537
11
citations
#1058

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing

Xudong Wang, Ke-Yue Zhang, Taiping Yao et al.

ECCV 2024
11
citations
#1059

EgoPet: Egomotion and Interaction Data from an Animal's Perspective

Amir Bar, Arya Bakhtiar, Danny L Tran et al.

ECCV 2024arXiv:2404.09991
11
citations
#1060

Real-time 3D-aware Portrait Editing from a Single Image

Qingyan Bai, Zifan Shi, Yinghao Xu et al.

ECCV 2024arXiv:2402.14000
11
citations
#1061

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

Haibo Wang, Weifeng Ge

ECCV 2024arXiv:2401.10712
11
citations
#1062

MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment

Kanglei Zhou, Liyuan Wang, Xingxing Zhang et al.

ECCV 2024arXiv:2403.04398
11
citations
#1063

Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma et al.

ECCV 2024arXiv:2402.05655
11
citations
#1064

RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark

Yuan-Hao Ho, Jen-Hao Cheng, Sheng Yao Kuan et al.

ECCV 2024arXiv:2407.13930
11
citations
#1065

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Swetha Sirnam, Jinyu Yang, Tal Neiman et al.

ECCV 2024arXiv:2407.13851
11
citations
#1066

Learning to Make Keypoints Sub-Pixel Accurate

Shinjeong Kim, Marc Pollefeys, Daniel Barath

ECCV 2024arXiv:2407.11668
10
citations
#1067

DNI: Dilutional Noise Initialization for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong et al.

ECCV 2024arXiv:2409.13037
10
citations
#1068

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024arXiv:2407.12291
10
citations
#1069

Uncertainty-aware sign language video retrieval with probability distribution modeling

Xuan Wu, Hongxiang Li, yuanjiang luo et al.

ECCV 2024arXiv:2405.19689
10
citations
#1070

CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems

Jiankun Zhao, Bowen Song, Liyue Shen

ECCV 2024arXiv:2407.12676
10
citations
#1071

Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective

Fangzhou Song, Bin Zhu, Yanbin Hao et al.

ECCV 2024arXiv:2312.04763
10
citations
#1072

Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement

Lingyu Zhu, Wenhan Yang, Baoliang Chen et al.

ECCV 2024arXiv:2408.12316
10
citations
#1073

Recursive Visual Programming

Jiaxin Ge, Sanjay Subramanian, Baifeng Shi et al.

ECCV 2024arXiv:2312.02249
10
citations
#1074

Rasterized Edge Gradients: Handling Discontinuities Differentially

Stanislav Pidhorskyi, Tomas Simon, Gabriel Schwartz et al.

ECCV 2024arXiv:2405.02508
10
citations
#1075

Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions

Jiacong Xu, Mingqian Liao, Ram Prabhakar Kathirvel et al.

ECCV 2024arXiv:2403.14053
10
citations
#1076

Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification

Dekun Lin, Zhe Cui, Rui Chen et al.

ECCV 2024
10
citations
#1077

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2312.08291
10
citations
#1078

Length-Aware Motion Synthesis via Latent Diffusion

Alessio Sampieri, Alessio Palma, Indro Spinelli et al.

ECCV 2024arXiv:2407.11532
10
citations
#1079

Free Lunch for Gait Recognition: A Novel Relation Descriptor

Jilong Wang, Saihui Hou, Yan Huang et al.

ECCV 2024arXiv:2308.11487
10
citations
#1080

The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers

Seungwoo Son, Jegwang Ryu, Namhoon Lee et al.

ECCV 2024arXiv:2302.10494
10
citations
#1081

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

Samuele Poppi, Tobia Poppi, Federico Cocchi et al.

ECCV 2024arXiv:2311.16254
10
citations
#1082

Strike a Balance in Continual Panoptic Segmentation

Jinpeng Chen, Runmin Cong, Yuxuan Luo et al.

ECCV 2024arXiv:2407.16354
10
citations
#1083

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024arXiv:2407.07324
10
citations
#1084

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Fanyue Wei, Wei Zeng, Zhenyang Li et al.

ECCV 2024arXiv:2407.06642
10
citations
#1085

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

Ye-Bin Moon, Nam Hyeon-Woo, Wonseok Choi et al.

ECCV 2024arXiv:2407.13442
10
citations
#1086

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

Zhengming Yu, Zhiyang Dou, Xiaoxiao Long et al.

ECCV 2024arXiv:2311.17050
10
citations
#1087

ByteEdit: Boost, Comply and Accelerate Generative Image Editing

YUXI REN, Jie Wu, Yanzuo Lu et al.

ECCV 2024arXiv:2404.04860
10
citations
#1088

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Tuo FENG, Wenguan Wang, Ruijie Quan et al.

ECCV 2024arXiv:2407.10200
10
citations
#1089

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition

Fadi Boutros, Vitomir Struc, Naser Damer

ECCV 2024arXiv:2407.01332
10
citations
#1090

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence

Mengyao Lyu, Tianxiang Hao, Xinhao Xu et al.

ECCV 2024arXiv:2407.18899
10
citations
#1091

Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

Lilang Lin, Lehong Wu, Jiahang Zhang et al.

ECCV 2024arXiv:2410.20349
10
citations
#1092

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024arXiv:2407.08931
10
citations
#1093

Motion and Structure from Event-based Normal Flow

Zhongyang Ren, Bangyan Liao, Delei Kong et al.

ECCV 2024arXiv:2407.12239
10
citations
#1094

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Minchan Kim, Minyeong Kim, Junik Bae et al.

ECCV 2024arXiv:2403.16167
10
citations
#1095

Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024
10
citations
#1096

Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack

Mingyu Yang, Daizong Liu, Keke Tang et al.

ECCV 2024
10
citations
#1097

Part2Object: Hierarchical Unsupervised 3D Instance Segmentation

cheng Shi, Yulin zhang, Bin Yang et al.

ECCV 2024arXiv:2407.10084
10
citations
#1098

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024arXiv:2403.04908
10
citations
#1099

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Shen Jianbing, Chunliang Li, Wencheng Han et al.

ECCV 2024arXiv:2407.10876
10
citations
#1100

Siamese Vision Transformers are Scalable Audio-visual Learners

Yan-Bo Lin, Gedas Bertasius

ECCV 2024arXiv:2403.19638
10
citations
#1101

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024arXiv:2407.09115
10
citations
#1102

NOVUM: Neural Object Volumes for Robust Object Classification

Artur Jesslen, Guofeng Zhang, Angtian Wang et al.

ECCV 2024arXiv:2305.14668
10
citations
#1103

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

ECCV 2024arXiv:2406.01194
10
citations
#1104

Temporal-Mapping Photography for Event Cameras

Yuhan Bao, Lei Sun, Yuqin Ma et al.

ECCV 2024arXiv:2403.06443
10
citations
#1105

Graph Neural Network Causal Explanation via Neural Causal Models

Arman Behnam, Binghui Wang

ECCV 2024arXiv:2407.09378
10
citations
#1106

Improving Adversarial Transferability via Model Alignment

Avery Ma, Amir-massoud Farahmand, Yangchen Pan et al.

ECCV 2024arXiv:2311.18495
10
citations
#1107

Leveraging temporal contextualization for video action recognition

Minji Kim, Dongyoon Han, Taekyung Kim et al.

ECCV 2024arXiv:2404.09490
10
citations
#1108

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi, Hunsang Lee, Seyoung Joung et al.

ECCV 2024arXiv:2404.08330
10
citations
#1109

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

ECCV 2024arXiv:2311.15908
10
citations
#1110

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs

Aayam Shrestha, Pan Liu, German Ros et al.

ECCV 2024arXiv:2502.05641
10
citations
#1111

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ECCV 2024arXiv:2312.07315
10
citations
#1112

VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

Yibo Liu, Zheyuan Yang, Guile Wu et al.

ECCV 2024arXiv:2407.06516
10
citations
#1113

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

ECCV 2024arXiv:2407.21032
10
citations
#1114

Training-free Composite Scene Generation for Layout-to-Image Synthesis

Jiaqi Liu, Tao Huang, Chang Xu

ECCV 2024arXiv:2407.13609
10
citations
#1115

Volumetric Rendering with Baked Quadrature Fields

Gopal Sharma, Daniel Rebain, Kwang Moo Yi et al.

ECCV 2024arXiv:2312.02202
10
citations
#1116

T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers et al.

ECCV 2024arXiv:2312.10217
10
citations
#1117

PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation

Ning Gao, Sanping Zhou, Le Wang et al.

ECCV 2024arXiv:2409.05122
10
citations
#1118

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024arXiv:2406.02461
10
citations
#1119

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ECCV 2024arXiv:2409.15801
10
citations
#1120

Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos

Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang et al.

ECCV 2024arXiv:2409.20557
10
citations
#1121

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024arXiv:2310.05615
10
citations
#1122

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

Pengfei Wang, Yuxi Wang, Shuai Li et al.

ECCV 2024arXiv:2407.13362
10
citations
#1123

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024arXiv:2408.05019
10
citations
#1124

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Agneet Chatterjee, Yiran Luo, Tejas Gokhale et al.

ECCV 2024arXiv:2408.02231
10
citations
#1125

Domain Shifting: A Generalized Solution for Heterogeneous Cross-Modality Person Re-Identification

Yan Jiang, Xu Cheng, Hao Yu et al.

ECCV 2024
10
citations
#1126

PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects

Guangcheng Chen, Yicheng He, Li He et al.

ECCV 2024arXiv:2409.14331
10
citations
#1127

Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning

Peng Xiao, Yi Xie, Xuemiao Xu et al.

ECCV 2024
10
citations
#1128

Open-Set Recognition in the Age of Vision-Language Models

Dimity Miller, Niko Suenderhauf, Alex Kenna et al.

ECCV 2024arXiv:2403.16528
10
citations
#1129

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387
10
citations
#1130

CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation

Hajin Shim, Changhun Kim, Eunho Yang

ECCV 2024arXiv:2407.16193
10
citations
#1131

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

ECCV 2024arXiv:2407.06704
10
citations
#1132

MVDD: Multi-View Depth Diffusion Models

Zhen Wang, Qiangeng Xu, Feitong Tan et al.

ECCV 2024arXiv:2312.04875
10
citations
#1133

Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

Yingshan Chang, Yasi Zhang, Zhiyuan Fang et al.

ECCV 2024arXiv:2403.16394
10
citations
#1134

Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking

Jikai Zheng, Mingjiang Liang, Shaoli Huang et al.

ECCV 2024
10
citations
#1135

Curved Diffusion: A Generative Model With Optical Geometry Control

Andrey Voynov, Amir Hertz, Moab Arar et al.

ECCV 2024arXiv:2311.17609
10
citations
#1136

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024arXiv:2403.17213
10
citations
#1137

Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems

Ziyuan Luo, Boxin Shi, Haoliang Li et al.

ECCV 2024arXiv:2407.09352
10
citations
#1138

Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos

Keqiang Sun, Dori Litvak, Yunzhi Zhang et al.

ECCV 2024arXiv:2312.13604
10
citations
#1139

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval

Xianwei Zhuang, Hongxiang Li, Xuxin Cheng et al.

ECCV 2024
10
citations
#1140

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2407.13342
10
citations
#1141

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A Hobley, Victor Adrian Prisacariu

ECCV 2024arXiv:2309.04820
10
citations
#1142

Towards More Practical Group Activity Detection: A New Benchmark and Model

Dongkeun Kim, Youngkil Song, Minsu Cho et al.

ECCV 2024arXiv:2312.02878
10
citations
#1143

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee et al.

ECCV 2024arXiv:2407.04345
10
citations
#1144

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

ECCV 2024arXiv:2407.04086
9
citations
#1145

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

ECCV 2024
9
citations
#1146

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

ECCV 2024
9
citations
#1147

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024arXiv:2311.15562
9
citations
#1148

Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation

Ziyun Wang, Jinyuan Guo, Kostas Daniilidis

ECCV 2024arXiv:2312.00114
9
citations
#1149

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Xiaoxu Xu, Yitian Yuan, Jinlong Li et al.

ECCV 2024arXiv:2407.09826
9
citations
#1150

Towards Physical World Backdoor Attacks against Skeleton Action Recognition

Qichen Zheng, Yi Yu, SIYUAN YANG et al.

ECCV 2024arXiv:2408.08671
9
citations
#1151

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024arXiv:2407.16125
9
citations
#1152

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

ECCV 2024arXiv:2403.18820
9
citations
#1153

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

ECCV 2024arXiv:2409.13803
9
citations
#1154

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024arXiv:2407.05352
9
citations
#1155

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024
9
citations
#1156

Self-Supervised Representation Learning for Adversarial Attack Detection

Yi Li, Plamen Angelov, Neeraj Suri

ECCV 2024arXiv:2407.04382
9
citations
#1157

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ECCV 2024arXiv:2407.02047
9
citations
#1158

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ECCV 2024arXiv:2405.09883
9
citations
#1159

Towards Reliable Advertising Image Generation Using Human Feedback

Zhenbang Du, Wei Feng, Haohan Wang et al.

ECCV 2024arXiv:2408.00418
9
citations
#1160

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024arXiv:2312.07485
9
citations
#1161

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024arXiv:2408.10624
9
citations
#1162

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

ECCV 2024arXiv:2410.10659
9
citations
#1163

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024arXiv:2407.07402
9
citations
#1164

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024arXiv:2407.20228
9
citations
#1165

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024arXiv:2309.03244
9
citations
#1166

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024arXiv:2407.08418
9
citations
#1167

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ECCV 2024arXiv:2403.11586
9
citations
#1168

Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation

Zhaoyang Li, Yuan Wang, Wangkai Li et al.

ECCV 2024arXiv:2408.13752
9
citations
#1169

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024arXiv:2305.03716
9
citations
#1170

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024arXiv:2407.13545
9
citations
#1171

GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

Xiufeng HUANG, Ka Chun Cheung, Simon See et al.

ECCV 2024arXiv:2407.13390
9
citations
#1172

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ECCV 2024arXiv:2407.12939
9
citations
#1173

AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution

Yuanting Fan, Chengxu Liu, Nengzhong Yin et al.

ECCV 2024arXiv:2410.17752
9
citations
#1174

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024arXiv:2407.12489
9
citations
#1175

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024arXiv:2406.08392
9
citations
#1176

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Fan Duan, Jiahao Yu, Li Chen

ECCV 2024arXiv:2407.05008
9
citations
#1177

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

chen rao, Guangyuan Li, Zehua Lan et al.

ECCV 2024arXiv:2408.13459
9
citations
#1178

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024
9
citations
#1179

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024arXiv:2407.13254
9
citations
#1180

Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°

Yuxiao He, Yiyu Zhuang, Yanwen Wang et al.

ECCV 2024arXiv:2408.00296
9
citations
#1181

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ECCV 2024
9
citations
#1182

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024arXiv:2411.06344
9
citations
#1183

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024arXiv:2403.09638
9
citations
#1184

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Yang Liu, Pengxiang Ding, Siteng Huang et al.

ECCV 2024arXiv:2409.07239
9
citations
#1185

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024arXiv:2407.02665
9
citations
#1186

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Pengwei Yin, Jingjing Wang, Guanzhong Zeng et al.

ECCV 2024arXiv:2411.08606
9
citations
#1187

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

ECCV 2024arXiv:2407.10704
9
citations
#1188

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024arXiv:2403.13808
9
citations
#1189

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

ECCV 2024arXiv:2407.12443
9
citations
#1190

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024arXiv:2501.02771
9
citations
#1191

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

ECCV 2024arXiv:2409.16763
9
citations
#1192

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Haodong LI, Hao LU, Yingcong Chen

ECCV 2024arXiv:2409.17316
9
citations
#1193

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

ECCV 2024arXiv:2407.05594
9
citations
#1194

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024arXiv:2403.10179
9
citations
#1195

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024arXiv:2409.01696
9
citations
#1196

Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction

Guowei Xu, Jiale Tao, Wen Li et al.

ECCV 2024arXiv:2407.11494
9
citations
#1197

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

ECCV 2024arXiv:2407.17596
9
citations
#1198

DualDn: Dual-domain Denoising via Differentiable ISP

Ruikang Li, Yujin Wang, Shiqi Chen et al.

ECCV 2024arXiv:2409.18783
9
citations
#1199

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ECCV 2024arXiv:2403.13524
9
citations
#1200

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024arXiv:2408.05749
9
citations