Most Cited 2024 "ventral stream selectivity" Papers

12,324 papers found • Page 41 of 62

#8001

Restoring Images in Adverse Weather Conditions via Histogram Transformer

Shangquan Sun, Wenqi Ren, Xinwei Gao et al.

ECCV 2024arXiv:2407.10172
#8002

G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields

Shuxiang Xie, Shuyi Zhou, Ken Sakurada et al.

ECCV 2024
#8003

The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation

Yi Yao, Chan-Feng Hsu, Jhe-Hao Lin et al.

ECCV 2024arXiv:2407.12579
#8004

Eliminating Feature Ambiguity for Few-Shot Segmentation

Qianxiong Xu, Guosheng Lin, Chen Change Loy et al.

ECCV 2024arXiv:2407.09842
#8005

GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator

Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

ECCV 2024arXiv:2312.06731
#8006

PreLAR: World Model Pre-training with Learnable Action Representation

Lixuan Zhang, Meina Kan, Shiguang Shan et al.

ECCV 2024
#8007

FreestyleRet: Retrieving Images from Style-Diversified Queries

Hao Li, Yanhao Jia, Peng Jin et al.

ECCV 2024arXiv:2312.02428
#8008

Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning

Mainak Singha, Ankit Jha, Divyam Gupta et al.

ECCV 2024arXiv:2407.04207
#8009

3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views

Kennard Yanting Chan, Fayao Liu, Guosheng Lin et al.

ECCV 2024
#8010

SIGMA: Sinkhorn-Guided Masked Video Modeling

Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.

ECCV 2024arXiv:2407.15447
#8011

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

Basile Van Hoorick, Rundi Wu, Ege Ozguroglu et al.

ECCV 2024arXiv:2405.14868
#8012

Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams

Ziqiang Wang, Zhixiang Chi, Yanan Wu et al.

ECCV 2024arXiv:2407.12128
#8013

SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild

Pengfei Wang, Xiaofei Hui, Jing Wu et al.

ECCV 2024
#8014

Text to Layer-wise 3D Clothed Human Generation

Junting Dong, Qi Fang, Zehuan Huang et al.

ECCV 2024arXiv:2404.16748
#8015

Fully Sparse 3D Occupancy Prediction

Haisong Liu, Yang Chen, Haiguang Wang et al.

ECCV 2024arXiv:2312.17118
#8016

High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding

Qi Zuo, Xiaodong Gu, Yuan Dong et al.

ECCV 2024
#8017

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation

Mengchen Zhang, Tong Wu, Tai Wang et al.

ECCV 2024arXiv:2409.18261
#8018

Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

Qi Sun, Hang Zhou, Wengang Zhou et al.

ECCV 2024arXiv:2407.05388
#8019

Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence

Hongyuan Wang, Lizhi Wang, Jiang Xu et al.

ECCV 2024arXiv:2312.12833
#8020

SUMix: Mixup with Semantic and Uncertain Information

Huafeng Qin, Xin Jin, Hongyu Zhu et al.

ECCV 2024arXiv:2407.07805
#8021

EAFormer: Scene Text Segmentation with Edge-Aware Transformers

Haiyang Yu, Teng Fu, Bin Li et al.

ECCV 2024arXiv:2407.17020
#8022

Zero-Shot Detection of AI-Generated Images

Davide Cozzolino, GIovanni Poggi, Matthias Niessner et al.

ECCV 2024arXiv:2409.15875
#8023

TCC-Det: Temporarily consistent cues for weakly-supervised 3D detection

Jan Skvrna, Lukas Neumann

ECCV 2024
#8024

Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis

Yuanhao Cai, Yixun Liang, Jiahao Wang et al.

ECCV 2024arXiv:2403.04116
#8025

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

Nishad Singhi, Jae Myung Kim, Karsten Roth et al.

ECCV 2024arXiv:2405.01531
#8026

Gaze Target Detection Based on Head-Local-Global Coordination

Yaokun Yang, Feng Lu

ECCV 2024
#8027

3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms

Po Han Chen, Chia-Chi Tsai

ECCV 2024
#8028

An Economic Framework for 6-DoF Grasp Detection

Xiao-Ming Wu, Jia-Feng Cai, Jian-Jian Jiang et al.

ECCV 2024arXiv:2407.08366
#8029

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks

Hao Fang, Jiawei Kong, Bin Chen et al.

ECCV 2024arXiv:2407.10179
#8030

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds

Zicheng Wang, Zhen Zhao, Yiming Wu et al.

ECCV 2024arXiv:2311.16474
#8031

RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation

Zhiyuan Zhang, Licheng Yang, Zhiyu Xiang

ECCV 2024arXiv:2408.06110
#8032

Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation

Zeyang Zhao, Qilong Xue, Yifan Bai et al.

ECCV 2024arXiv:2407.08489
#8033

SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images

Jintu Zheng, Yi Ding, Qizhe Liu et al.

ECCV 2024arXiv:2407.02159
#8034

MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory

Juwon Kang, Nayeong Kim, Jungseul Ok et al.

ECCV 2024
#8035

Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching

Dongliang Cao, Zorah Laehner, Florian Bernard

ECCV 2024arXiv:2407.08244
#8036

EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere

Jiaxi Jiang, Paul Streli, Manuel Meier et al.

ECCV 2024arXiv:2308.06493
#8037

Decoupling Common and Unique Representations for Multimodal Self-supervised Learning

Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham et al.

ECCV 2024arXiv:2309.05300
#8038

D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction

Bowen Fu, Gu Wang, Chenyangguang Zhang et al.

ECCV 2024arXiv:2311.14189
#8039

Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation

Sehyung Lee, Mijung Kim, Yeongnam Chae et al.

ECCV 2024
#8040

CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians

Avinash Paliwal, Wei Ye, Jinhui Xiong et al.

ECCV 2024arXiv:2403.19495
#8041

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Zhongyi Shui, Yunlong Zhang, Kai Yao et al.

ECCV 2024arXiv:2311.15939
#8042

3DEgo: 3D Editing on the Go!

Umar Khalid, Hasan Iqbal, Azib Farooq et al.

ECCV 2024arXiv:2407.10102
#8043

Domain-adaptive Video Deblurring via Test-time Blurring

Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu et al.

ECCV 2024arXiv:2407.09059
#8044

DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing

Hyeonho Jeong, Jinho Chang, GEON YEONG PARK et al.

ECCV 2024arXiv:2403.12002
#8045

Cross-Domain Learning for Video Anomaly Detection with Limited Supervision

Yashika Jain, Ali Dabouei, Min Xu

ECCV 2024arXiv:2408.05191
#8046

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Kong Zhe, Yong Zhang, Tianyu Yang et al.

ECCV 2024arXiv:2403.10983
#8047

Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent

NianHui Guo, Hong Guo, Christoph Meinel et al.

ECCV 2024
#8048

BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream

Wenpu Li, Pian Wan, Peng Wang et al.

ECCV 2024arXiv:2407.02174
#8049

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024arXiv:2403.09638
#8050

PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture

Zhuojun Li, Chun Yu, Chen Liang et al.

ECCV 2024arXiv:2409.14101
#8051

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Haiyang Wang, Hao Tang, Li Jiang et al.

ECCV 2024arXiv:2403.09394
#8052

Improving Unsupervised Domain Adaptation: A Pseudo-Candidate Set Approach

Aveen Dayal, Rishabh Lalla, Linga Reddy Cenkeramaddi et al.

ECCV 2024
#8053

Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection

Xinhao Luo, Man Yao, Yuhong Chou et al.

ECCV 2024arXiv:2407.20708
#8054

S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition

Mohamed Abdelfattah, Alexandre ALahi

ECCV 2024
#8055

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Yi Zhang, Yun Tang, Wenjie Ruan et al.

ECCV 2024arXiv:2402.15429
#8056

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024arXiv:2403.17915
#8057

6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry

Sungho Chun, Ju Yong Chang

ECCV 2024
#8058

Masked Angle-Aware Autoencoder for Remote Sensing Images

Zhihao Li, Biao Hou, Siteng Ma et al.

ECCV 2024arXiv:2408.01946
#8059

Multi-modal Relation Distillation for Unified 3D Representation Learning

Huiqun Wang, Yiping Bao, Panwang Pan et al.

ECCV 2024arXiv:2407.14007
#8060

Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection

Jiacheng Deng, Jiahao Lu, Tianzhu Zhang

ECCV 2024
#8061

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Rui Huang, Songyou Peng, Ayca Takmaz et al.

ECCV 2024arXiv:2312.17232
#8062

Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture

Zhigao Cao, Meng Li, Xiashuang Wang et al.

ECCV 2024
#8063

Deep Patch Visual SLAM

Lahav Lipson, Zachary Teed, Jia Deng

ECCV 2024arXiv:2408.01654
#8064

LiteSAM is Actually what you Need for segment Everything

Jianhai Fu, Yuanjie Yu, Ningchuan Li et al.

ECCV 2024
#8065

Visual Prompting via Partial Optimal Transport

MENGYU ZHENG, Zhiwei Hao, Yehui Tang et al.

ECCV 2024
#8066

AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection

Yunkang Cao, Jiangning Zhang, Luca Frittoli et al.

ECCV 2024arXiv:2407.15795
#8067

Pathformer3D: A 3D Scanpath Transformer for 360° Images

Rong Quan, yantao Lai, Mengyu Qiu et al.

ECCV 2024arXiv:2407.10563
#8068

Asymmetric Mask Scheme for Self-Supervised Real Image Denoising

Xiangyu Liao, Tianheng Zheng, Jiayu Zhong et al.

ECCV 2024arXiv:2407.06514
#8069

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024arXiv:2407.20228
#8070

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024arXiv:2309.03244
#8071

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024arXiv:2407.08931
#8072

E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation

Shengxuming Zhang, Lei Jin, Yifan Wang et al.

ECCV 2024
#8073

Robust Incremental Structure-from-Motion with Hybrid Features

Shaohui Liu, Yidan Gao, Tianyi Zhang et al.

ECCV 2024arXiv:2409.19811
#8074

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

Longxiang Tang, Zhuotao Tian, Kai Li et al.

ECCV 2024arXiv:2407.05342
#8075

Trajectory-aligned Space-time Tokens for Few-shot Action Recognition

Pulkit Kumar, Namitha Padmanabhan, Luke Luo et al.

ECCV 2024arXiv:2407.18249
#8076

U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation

li zhang, Weiqing Meng, Yan Zhong et al.

ECCV 2024
#8077

Neural graphics texture compression supporting random access

Farzad Farhadzadeh, Qiqi Hou, Hoang Le et al.

ECCV 2024arXiv:2407.00021
#8078

ControlCap: Controllable Region-level Captioning

Yuzhong Zhao, Liu Yue, Zonghao Guo et al.

ECCV 2024arXiv:2401.17910
#8079

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Ashkan Mirzaei, Tristan T Aumentado-Armstrong, Marcus A Brubaker et al.

ECCV 2024arXiv:2308.08947
#8080

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Junfei Xiao, Ziqi Zhou, Wenxuan Li et al.

ECCV 2024arXiv:2312.13764
#8081

Fast View Synthesis of Casual Videos with Soup-of-Planes

Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen et al.

ECCV 2024arXiv:2312.02135
#8082

Confidence Self-Calibration for Multi-Label Class-Incremental Learning

Kaile Du, Yifan Zhou, Fan Lyu et al.

ECCV 2024arXiv:2403.12559
#8083

SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields

Yu Liu, Baoxiong Jia, Yixin Chen et al.

ECCV 2024arXiv:2408.06697
#8084

Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

Rong Wang, Wei Mao, Changsheng Lu et al.

ECCV 2024arXiv:2407.11266
#8085

AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration

Rao Fu, Zehao Wen, Zichen Liu et al.

ECCV 2024
#8086

iMatching: Imperative Correspondence Learning

Chen Wang, Dasong Gao, Yun-Jou Lin et al.

ECCV 2024
#8087

Appearance-based Refinement for Object-Centric Motion Segmentation

Junyu Xie, Weidi Xie, Andrew ZISSERMAN

ECCV 2024arXiv:2312.11463
#8088

MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution

Yuxuan Jiang, Chen Feng, Fan Zhang et al.

ECCV 2024arXiv:2404.09571
#8089

DEAL: Disentangle and Localize Concept-level Explanations for VLMs

Tang Li, Mengmeng Ma, Xi Peng

ECCV 2024arXiv:2407.14412
#8090

ReMatching: Low-Resolution Representations for Scalable Shape Correspondence

Filippo Maggioli, Daniele Baieri, Emanuele Rodola et al.

ECCV 2024arXiv:2305.09274
#8091

Global Structure-from-Motion Revisited

Linfei Pan, Daniel Barath, Marc Pollefeys et al.

ECCV 2024arXiv:2407.20219
#8092

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

KUNPENG SONG, Yizhe Zhu, Bingchen Liu et al.

ECCV 2024arXiv:2404.05674
#8093

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Wonjae Kim, Sanghyuk Chun, Taekyung Kim et al.

ECCV 2024arXiv:2404.17507
#8094

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai et al.

ECCV 2024arXiv:2311.17717
#8095

Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture

ShahRukh Athar, Shunsuke Saito, Stanislav Pidhorskyi et al.

ECCV 2024arXiv:2407.19593
#8096

Expressive Whole-Body 3D Gaussian Avatar

Gyeongsik Moon, Takaaki Shiratori, Shunsuke Saito

ECCV 2024arXiv:2407.21686
#8097

Strike a Balance in Continual Panoptic Segmentation

Jinpeng Chen, Runmin Cong, Yuxuan Luo et al.

ECCV 2024arXiv:2407.16354
#8098

TrajPrompt: Aligning Color Trajectory with Vision-Language Representations

Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan et al.

ECCV 2024
#8099

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

Zicong Fan, Takehiko Ohkawa, Linlin Yang et al.

ECCV 2024arXiv:2403.16428
#8100

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330
#8101

DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models

Yuyang Huang, Yabo Chen, Yuchen Liu et al.

ECCV 2024
#8102

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024arXiv:2407.17671
#8103

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024arXiv:2404.07389
#8104

Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks

Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.

ECCV 2024
#8105

A Direct Approach to Viewing Graph Solvability

Federica Arrigoni, Andrea Fusiello, Tomas Pajdla

ECCV 2024
#8106

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232
#8107

Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning

Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park

ECCV 2024arXiv:2409.10956
#8108

Solving Motion Planning Tasks with a Scalable Generative Model

Yihan Hu, Siqi Chai, Zhening Yang et al.

ECCV 2024arXiv:2407.02797
#8109

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475
#8110

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

ECCV 2024arXiv:2407.20657
#8111

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Yunbin Tu, Liang Li, Li Su et al.

ECCV 2024arXiv:2407.11683
#8112

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.

ECCV 2024arXiv:2403.13745
#8113

Physically Plausible Color Correction for Neural Radiance Fields

Qi Zhang, Ying Feng, HONGDONG LI

ECCV 2024
#8114

LLM as Copilot for Coarse-grained Vision-and-Language Navigation

Yanyuan Qiao, Qianyi Liu, Jiajun Liu et al.

ECCV 2024
#8115

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution

Xi Yang, Chenhang He, Jianqi Ma et al.

ECCV 2024arXiv:2312.00853
#8116

MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction

Qiang Wang

ECCV 2024
#8117

SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation

Sayan Nag, Koustava Goswami, Srikrishna Karanam

ECCV 2024
#8118

BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

Xinmin Qiu, Congying Han, Zicheng Zhang et al.

ECCV 2024arXiv:2403.06243
#8119

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

Yixiang Qiu, Hao Fang, Hongyao Yu et al.

ECCV 2024arXiv:2407.13863
#8120

Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing

Jian Gao, chun gu, Youtian Lin et al.

ECCV 2024arXiv:2311.16043
#8121

Text2Place: Affordance-aware Text Guided Human Placement

Rishubh Parihar, Harsh Gupta, Sachidanand VS et al.

ECCV 2024arXiv:2407.15446
#8122

TAPTR: Tracking Any Point with Transformers as Detection

Hongyang Li, Hao Zhang, Shilong Liu et al.

ECCV 2024arXiv:2403.13042
#8123

Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents

MENGJUN CHENG, Chengquan Zhang, Chang Liu et al.

ECCV 2024
#8124

Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery

Haiyang Zheng, Pu Nan, Wenjing Li et al.

ECCV 2024arXiv:2403.07369
#8125

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Masashi Hatano, Ryo Hachiuma, Ryo Fujii et al.

ECCV 2024arXiv:2405.19917
#8126

AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models

Xuelong Dai, Kaisheng Liang, Bin Xiao

ECCV 2024arXiv:2307.12499
#8127

ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images

Xiangtian Xue, Jiasong Wu, Youyong Kong et al.

ECCV 2024arXiv:2403.10004
#8128

Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery

Grzegorz Rypesc, Daniel Marczak, Sebastian Cygert et al.

ECCV 2024
#8129

Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset

Mijoo Kim, Junseok Kwon

ECCV 2024arXiv:2407.12330
#8130

OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing

Pranav Gupta, Rishubh Singh, Pradeep Shenoy et al.

ECCV 2024arXiv:2411.02858
#8131

DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction

YANLONG LI, Chamara Madarasingha, Kanchana Thilakarathna

ECCV 2024arXiv:2312.03298
#8132

Motion Aware Event Representation-driven Image Deblurring

Zhijing Sun, Xueyang Fu, Longzhuo Huang et al.

ECCV 2024
#8133

GroupDiff: Diffusion-based Group Portrait Editing

Yuming Jiang, Nanxuan Zhao, Qing Liu et al.

ECCV 2024arXiv:2409.14379
#8134

Privacy-Preserving Adaptive Re-Identification without Image Transfer

Hamza Rami, Jhony H. Giraldo, Nicolas Winckler et al.

ECCV 2024arXiv:2407.12589
#8135

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation

Yufei Liu, Junwei Zhu, Junshu Tang et al.

ECCV 2024arXiv:2403.12906
#8136

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024arXiv:2407.08061
#8137

Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection

Lianjun Wu, Jiangxiao Han, Zengqiang Zheng et al.

ECCV 2024
#8138

Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling

Wonwoong Cho, Hareesh Ravi, Midhun Harikumar et al.

ECCV 2024
#8139

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Dylan Li, Gyungin Shin

ECCV 2024arXiv:2409.18961
#8140

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024arXiv:2311.11241
#8141

GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval

Han Zhou, Wei Dong, Xiaohong Liu et al.

ECCV 2024arXiv:2407.12431
#8142

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024arXiv:2311.15562
#8143

Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration

Youngjin Oh, Keuntek Lee, Jooyoung Lee et al.

ECCV 2024
#8144

Diffusion-Guided Weakly Supervised Semantic Segmentation

Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong et al.

ECCV 2024
#8145

Online Vectorized HD Map Construction using Geometry

Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding et al.

ECCV 2024arXiv:2312.03341
#8146

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

Seokhun Choi, Hyeonseop Song, Jaechul Kim et al.

ECCV 2024arXiv:2407.11793
#8147

MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation

Jiaxi Jiang, Paul Streli, Xuejing Luo et al.

ECCV 2024
#8148

Disentangled Generation and Aggregation for Robust Radiance Fields

Shihe Shen, Huachen Gao, Wangze Xu et al.

ECCV 2024arXiv:2409.15715
#8149

Online Temporal Action Localization with Memory-Augmented Transformer

Youngkil Song, Dongkeun Kim, Minsu Cho et al.

ECCV 2024arXiv:2408.02957
#8150

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024arXiv:2407.12291
#8151

FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds

Keke Tang, Lujie Huang, Weilong Peng et al.

ECCV 2024
#8152

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Zhipeng Qian, Yiwei Ma, Zhekai Lin et al.

ECCV 2024arXiv:2407.05363
#8153

Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation

Hyun Seok Seong, WonJun Moon, SuBeen Lee et al.

ECCV 2024arXiv:2407.12463
#8154

Revisit Human-Scene Interaction via Space Occupancy

Xinpeng Liu, Haowen Hou, Yanchao Yang et al.

ECCV 2024arXiv:2312.02700
#8155

Towards Stable 3D Object Detection

Jiabao Wang, Qiang Meng, Guochao Liu et al.

ECCV 2024arXiv:2407.04305
#8156

FYI: Flip Your Images for Dataset Distillation

Byunggwan Son, Youngmin Oh, Donghyeon Baek et al.

ECCV 2024arXiv:2407.08113
#8157

Attention Decomposition for Cross-Domain Semantic Segmentation

Liqiang He, Sinisa Todorovic

ECCV 2024
#8158

Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights

Shunqi Mao, Chaoyi Zhang, Hang Su et al.

ECCV 2024arXiv:2407.11449
#8159

Diffusion Models as Optimizers for Efficient Planning in Offline RL

Renming Huang, Yunqiang Pei, Guoqing Wang et al.

ECCV 2024arXiv:2407.16142
#8160

HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models

Shen Zhang, Zhaowei CHEN, Zhenyu Zhao et al.

ECCV 2024arXiv:2311.17528
#8161

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

Meixuan Li, Tianyu Li, Guoqing Wang et al.

ECCV 2024arXiv:2403.10252
#8162

MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation

Yuxiang WEI, Zhilong Ji, Jinfeng Bai et al.

ECCV 2024arXiv:2405.05806
#8163

PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training

SUYI CHEN, Hao Xu, Haipeng Li et al.

ECCV 2024arXiv:2407.14054
#8164

DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks

Caixin Kang, Yinpeng Dong, Zhengyi Wang et al.

ECCV 2024arXiv:2306.09124
#8165

DriveLM: Driving with Graph Visual Question Answering

Chonghao Sima, Katrin Renz, Kashyap Chitta et al.

ECCV 2024arXiv:2312.14150
#8166

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

Haibo Wang, Weifeng Ge

ECCV 2024arXiv:2401.10712
#8167

CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Haibo Jin, Ruoxi Chen, Jinyin Chen et al.

ECCV 2024arXiv:2112.13064
#8168

HARIVO: Harnessing Text-to-Image Models for Video Generation

Mingi Kwon, Seoung Wug Oh, Yang Zhou et al.

ECCV 2024arXiv:2410.07763
#8169

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024arXiv:2408.10624
#8170

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Chao Gong, Kai Chen, Zhipeng Wei et al.

ECCV 2024arXiv:2407.12383
#8171

Length-Aware Motion Synthesis via Latent Diffusion

Alessio Sampieri, Alessio Palma, Indro Spinelli et al.

ECCV 2024arXiv:2407.11532
#8172

Improving image synthesis with diffusion-negative sampling

Alakh Desai, Nuno Vasconcelos

ECCV 2024arXiv:2411.05473
#8173

SignGen: End-to-End Sign Language Video Generation with Latent Diffusion

Fan Qi, Yu Duan, Changsheng Xu et al.

ECCV 2024
#8174

Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization

Hongjing Niu, Hanting Li, Bin Li et al.

ECCV 2024
#8175

GRA: Detecting Oriented Objects through Group-wise Rotating and Attention

Jiangshan Wang, Yifan Pu, Yizeng Han et al.

ECCV 2024arXiv:2403.11127
#8176

Track Everything Everywhere Fast and Robustly

Yunzhou Song, Jiahui Lei, Ziyun Wang et al.

ECCV 2024arXiv:2403.17931
#8177

Label-free Neural Semantic Image Synthesis

Jiayi Wang, Kevin Alexander Laube, Yumeng Li et al.

ECCV 2024arXiv:2407.01790
#8178

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Fanyue Wei, Wei Zeng, Zhenyang Li et al.

ECCV 2024arXiv:2407.06642
#8179

HiEI: A Universal Framework for Generating High-quality Emerging Images from Natural Images

Jingmeng Li, Lukang Fu, Surun Yang et al.

ECCV 2024
#8180

Nonverbal Interaction Detection

Jianan Wei, Tianfei Zhou, Yi Yang et al.

ECCV 2024arXiv:2407.08133
#8181

The Sky's the Limit: Relightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility

James Gardner, Evgenii Kashin, Bernhard Egger et al.

ECCV 2024
#8182

Neural Spectral Decomposition for Dataset Distillation

Yang Shaolei, Shen Cheng, Mingbo Hong et al.

ECCV 2024arXiv:2408.16236
#8183

Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition

Haijun Xiong, Bin Feng, Xinggang Wang et al.

ECCV 2024arXiv:2407.12519
#8184

Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning

Zijun Long, Lipeng Zhuang, George W Killick et al.

ECCV 2024arXiv:2403.06289
#8185

Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network

Sukwon Yun, Jie Peng, Alexandro E Trevino et al.

ECCV 2024arXiv:2407.17857
#8186

HERGen: Elevating Radiology Report Generation with Longitudinal Data

Fuying Wang, Shenghui Du, Lequan Yu

ECCV 2024arXiv:2407.15158
#8187

Labeled Data Selection for Category Discovery

Bingchen Zhao, Nico Lang, Serge Belongie et al.

ECCV 2024arXiv:2406.04898
#8188

Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation

Bowei Xing, Xianghua Ying, Ruibin Wang et al.

ECCV 2024
#8189

GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer

Youngho Yoon, Hyun-Kurl Jang, Kuk-Jin Yoon

ECCV 2024arXiv:2410.00672
#8190

SNeRV: Spectra-preserving Neural Representation for Video

Jina Kim, Jihoo Lee, Jewon Kang

ECCV 2024arXiv:2501.01681
#8191

WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

xinjian wu, Ruisong Zhang, Jie Qin et al.

ECCV 2024arXiv:2407.10131
#8192

Analysis-by-Synthesis Transformer for Single-View 3D Reconstruction

Dian Jia, Xiaoqian Ruan, Kun Xia et al.

ECCV 2024
#8193

DMiT: Deformable Mipmapped Tri-Plane Representation for Dynamic Scenes

Jing-Wen Yang, Jia-Mu Sun, Yong-Liang Yang et al.

ECCV 2024
#8194

Bayesian Self-Training for Semi-Supervised 3D Segmentation

Ozan Unal, Christos Sakaridis, Luc Van Gool

ECCV 2024arXiv:2409.08102
#8195

Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling

Jaehyeok Kim, Dongyoon Wee, Dan Xu

ECCV 2024arXiv:2407.11962
#8196

Tiny Models are the Computational Saver for Large Models

Qingyuan Wang, Barry Cardiff, Antoine Frappé et al.

ECCV 2024arXiv:2403.17726
#8197

AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization

Shixiong Xu, Chenghao Zhang, Lubin Fan et al.

ECCV 2024arXiv:2407.08156
#8198

Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients

Yiming Chen, Xiangyu Yang, Nikos Deligiannis

ECCV 2024
#8199

Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°

Yuxiao He, Yiyu Zhuang, Yanwen Wang et al.

ECCV 2024arXiv:2408.00296
#8200

KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding

Zhihao Xu, Shengjie Gong, Jiapeng Tang et al.

ECCV 2024arXiv:2409.01113