Most Cited 2024 "conditional score networks" Papers

12,324 papers found • Page 42 of 62

#8201

Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models

Siao Tang, Xin Wang, Hong Chen et al.

ECCV 2024
#8202

DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control

Xinyu Xu, Shengcheng Luo, Yanchao Yang et al.

ECCV 2024arXiv:2407.14758
#8203

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Shilong Liu, Hao Cheng, Haotian Liu et al.

ECCV 2024arXiv:2311.05437
#8204

Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception

Dingkang Yang, Ke Li, Dongling Xiao et al.

ECCV 2024
#8205

Textual Query-Driven Mask Transformer for Domain Generalized Segmentation

Byeonghyun Pak, Byeongju Woo, Sunghwan Kim et al.

ECCV 2024arXiv:2407.09033
#8206

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors

Wei Shang, Dongwei Ren, Wanying Zhang et al.

ECCV 2024arXiv:2407.09919
#8207

Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density

Peiyu Yang, Naveed Akhtar, Shah Mubarak et al.

ECCV 2024arXiv:2407.04370
#8208

Combining Generative and Geometry Priors for Wide-Angle Portrait Correction

Lan Yao, Chaofeng Chen, Xiaoming Li et al.

ECCV 2024arXiv:2410.09911
#8209

To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now

Yimeng Zhang, jinghan jia, Xin Chen et al.

ECCV 2024arXiv:2310.11868
#8210

StereoGlue: Joint Feature Matching and Robust Estimation

Daniel Barath, Dmytro Mishkin, Luca Cavalli et al.

ECCV 2024
#8211

MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Muyao Niu, Xiaodong Cun, Xintao Wang et al.

ECCV 2024arXiv:2405.20222
#8212

Object-Aware NIR-to-Visible Translation

Yunyi Gao, Lin Gu, Qiankun Liu et al.

ECCV 2024
#8213

DualDn: Dual-domain Denoising via Differentiable ISP

Ruikang Li, Yujin Wang, Shiqi Chen et al.

ECCV 2024arXiv:2409.18783
#8214

Syn-to-Real Domain Adaptation for Point Cloud Completion via Part-based Approach

Yunseo Yang, Jihun Kim, Kuk-Jin Yoon

ECCV 2024
#8215

Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras

Hoonhee Cho, Sung-Hoon Yoon, Hyeokjun Kweon et al.

ECCV 2024arXiv:2407.11216
#8216

StableDrag: Stable Dragging for Point-based Image Editing

Yutao Cui, Xiaotong Zhao, Guozhen Zhang et al.

ECCV 2024arXiv:2403.04437
#8217

Phase Concentration and Shortcut Suppression for Weakly Supervised Semantic Segmentation

Hoyong Kwon, Jaeseok Jeong, Sung-Hoon Yoon et al.

ECCV 2024
#8218

Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation

Hyunwoo Yu, Yubin Cho, Beoungwoo Kang et al.

ECCV 2024arXiv:2407.17261
#8219

AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network

Yuxi Li, Fuyuan Cheng, Wangbo Yu et al.

ECCV 2024
#8220

Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively

Haobo Yuan, Xiangtai Li, Chong Zhou et al.

ECCV 2024arXiv:2401.02955
#8221

Event-based Head Pose Estimation: Benchmark and Method

jiahui yuan, Hebei Li, Yansong Peng et al.

ECCV 2024
#8222

Robustness Tokens: Towards Adversarial Robustness of Transformers

Brian Pulfer, Yury Belousov, Slava Voloshynovskiy

ECCV 2024arXiv:2503.10191
#8223

EINet: Point Cloud Completion via Extrapolation and Interpolation

Pingping Cai, Canyu Zhang, LINGJIA SHI et al.

ECCV 2024
#8224

Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases

Xinpeng Liu, Yong-Lu Li, AILING ZENG et al.

ECCV 2024arXiv:2310.04189
#8225

ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories

Chen-yi Lu, Shubham Agarwal, Mehrab Tanjim et al.

ECCV 2024
#8226

Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images

Junhao Zhang, Mutian Xu, Jay Zhangjie Wu et al.

ECCV 2024
#8227

Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors

Wen Yuan Zhang, Kanle Shi, Yushen Liu et al.

ECCV 2024
#8228

Assessing Sample Quality via the Latent Space of Generative Models

Jingyi Xu, Hieu Le, Dimitris Samaras

ECCV 2024arXiv:2407.15171
#8229

Responsible Visual Editing

Minheng Ni, Yeli Shen, Yabin Zhang et al.

ECCV 2024arXiv:2404.05580
#8230

Distributed Active Client Selection With Noisy Clients Using Model Association Scores

Kwang In Kim

ECCV 2024
#8231

SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning

Runmin Zhang, Jun Ma, Lun Luo et al.

ECCV 2024arXiv:2407.08148
#8232

Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation

Zhihang Zhong, Gurunandan Krishnan, Xiao Sun et al.

ECCV 2024
#8233

MotionDirector: Motion Customization of Text-to-Video Diffusion Models

Rui Zhao, Yuchao Gu, Jay Zhangjie Wu et al.

ECCV 2024arXiv:2310.08465
#8234

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024arXiv:2407.07077
#8235

AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild

Junho Park, Kyeongbo Kong, Suk-Ju Kang

ECCV 2024arXiv:2407.18034
#8236

OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

Guoqing Wang, Zhongdao Wang, Pin Tang et al.

ECCV 2024arXiv:2404.15014
#8237

Probabilistic Image-Driven Traffic Modeling via Remote Sensing

Scott Workman, Armin Hadzic

ECCV 2024arXiv:2403.05521
#8238

VideoStudio: Generating Consistent-Content and Multi-Scene Videos

Fuchen Long, Zhaofan Qiu, Ting Yao et al.

ECCV 2024arXiv:2401.01256
#8239

Occupancy as Set of Points

Yiang Shi, Tianheng Cheng, Qian Zhang et al.

ECCV 2024arXiv:2407.04049
#8240

UAV First-Person Viewers Are Radiance Field Learners

Liqi Yan, Qifan Wang, Junhan Zhao et al.

ECCV 2024
#8241

Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Xiao Zhou, Xiaoman Zhang, Chaoyi Wu et al.

ECCV 2024arXiv:2404.09942
#8242

Pick-a-back: Selective Device-to-Device Knowledge Transfer in Federated Continual Learning

JinYi Yoon, HyungJune Lee

ECCV 2024
#8243

Situated Instruction Following

So Yeon Min, Xavier Puig, Devendra Singh Chaplot et al.

ECCV 2024arXiv:2407.12061
#8244

Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography

Dorian Chan, Matthew O'Toole, Sizhuo Ma et al.

ECCV 2024
#8245

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Yifan Pu, Xia Zhuofan, Jiayi Guo et al.

ECCV 2024arXiv:2408.05710
#8246

Two-Stage Video Shadow Detection via Temporal-Spatial Adaption

Xin Duan, Yu Cao, Lei Zhu et al.

ECCV 2024
#8247

Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization

Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero et al.

ECCV 2024arXiv:2410.07901
#8248

CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation

Monika Wysoczanska, Oriane Siméoni, Michaël Ramamonjisoa et al.

ECCV 2024
#8249

FMBoost: Boosting Latent Diffusion with Flow Matching

Johannes Schusterbauer-Fischer, Ming Gui, Pingchuan Ma et al.

ECCV 2024
#8250

M^2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation

Yingshuang Zou, Yikang Ding, Xi Qiu et al.

ECCV 2024
#8251

FoundPose: Unseen Object Pose Estimation with Foundation Features

Evin Pınar Örnek, Yann Labbé, Bugra Tekin et al.

ECCV 2024arXiv:2311.18809
#8252

Diffusion Models as Data Mining Tools

Ioannis Siglidis, Aleksander Holynski, Alexei Efros et al.

ECCV 2024arXiv:2408.02752
#8253

SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models

Ziyi Lin, Dongyang Liu, Renrui Zhang et al.

ECCV 2024
#8254

Improving Adversarial Transferability via Model Alignment

Avery Ma, Amir-massoud Farahmand, Yangchen Pan et al.

ECCV 2024arXiv:2311.18495
#8255

RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

Wenhao Ding, Yulong Cao, DING ZHAO et al.

ECCV 2024arXiv:2312.13303
#8256

Embodied Understanding of Driving Scenarios

Yunsong Zhou, Linyan Huang, Qingwen Bu et al.

ECCV 2024arXiv:2403.04593
#8257

Factorizing Text-to-Video Generation by Explicit Image Conditioning

Rohit Girdhar, Mannat Singh, Andrew Brown et al.

ECCV 2024arXiv:2311.10709
#8258

Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements

Niels Chr. Overgaard, Anders Holst

ECCV 2024
#8259

DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields

Yu Chi, Fangneng Zhan, Sibo Wu et al.

ECCV 2024arXiv:2311.12063
#8260

Cut out the Middleman: Revisiting Pose-based Gait Recognition

YANG FU, Saihui Hou, Shibei Meng et al.

ECCV 2024
#8261

FedHARM: Harmonizing Model Architectural Diversity in Federated Learning

Anestis Kastellos, Athanasios Psaltis, Charalampos Z Patrikakis et al.

ECCV 2024
#8262

Thinking Outside the BBox: Unconstrained Generative Object Compositing

Gemma Canet Tarrés, Zhe Lin, Zhifei Zhang et al.

ECCV 2024arXiv:2409.04559
#8263

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

Sharath Girish, Kamal Gupta, Abhinav Shrivastava

ECCV 2024arXiv:2312.04564
#8264

Caltech Aerial RGB-Thermal Dataset in the Wild

Connor Lee, Matthew Anderson, Nikhil Ranganathan et al.

ECCV 2024arXiv:2403.08997
#8265

UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation

Jinho Park, Se Young Chun, Mingoo Seok

ECCV 2024arXiv:2409.13106
#8266

DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks

Sarah Jabbour, Gregory Kondas, Ella Kazerooni et al.

ECCV 2024arXiv:2407.14509
#8267

Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time

Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta et al.

ECCV 2024arXiv:2407.01851
#8268

BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

Gwanghyun Kim, Hayeon Kim, Hoigi Seo et al.

ECCV 2024arXiv:2404.04544
#8269

MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke et al.

ECCV 2024arXiv:2405.02771
#8270

All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation

Seongho Kim, Byung Cheol Song

ECCV 2024
#8271

POET: Prompt Offset Tuning for Continual Human Action Adaptation

Prachi Garg, Joseph K J, Vineeth N Balasubramanian et al.

ECCV 2024arXiv:2504.18059
#8272

TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance

Guoxing Zhang, Yiming Liu, xiaoyu yang et al.

ECCV 2024
#8273

Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing

Yushi Lan, Feitong Tan, Qiangeng Xu et al.

ECCV 2024
#8274

Learning to Distinguish Samples for Generalized Category Discovery

Fengxiang Yang, Pu Nan, Wenjing Li et al.

ECCV 2024
#8275

COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark

Atsushi Hashimoto, Koki Maeda, Tosho Hirasawa et al.

ECCV 2024
#8276

WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning

Kunbei Cai, Zhenkai Zhang, Qian Lou et al.

ECCV 2024
#8277

Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice

Xiayu Wang, Ke Ma, Ruiyun Zhong et al.

ECCV 2024
#8278

Delving into Adversarial Robustness on Document Tampering Localization

Huiru Shao, Zhuang Qian, Kaizhu Huang et al.

ECCV 2024
#8279

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024arXiv:2403.00522
#8280

HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation

Noranart Vesdapunt, Kah Kuen Fu, Yue Wu et al.

ECCV 2024
#8281

Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data

Sneha Paul, Zachary Patterson, Nizar Bouguila

ECCV 2024arXiv:2409.13977
#8282

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

Zuyao Chen, Jinlin Wu, Zhen Lei et al.

ECCV 2024arXiv:2311.10988
#8283

MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction

Seongju Lee, Junseok Lee, Yeonguk Yu et al.

ECCV 2024arXiv:2407.21635
#8284

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Ankit Vani, Bac Nguyen, Samuel Lavoie et al.

ECCV 2024arXiv:2404.15721
#8285

Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling

Zixiao Wang, Hongtao Xie, YuXin Wang et al.

ECCV 2024arXiv:2409.13431
#8286

Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation

Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024arXiv:2407.10703
#8287

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Akshat Ramachandran, Souvik Kundu, Tushar Krishna

ECCV 2024arXiv:2407.05266
#8288

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Tahmina Khanam, Mohammed Bennamoun, Guan Wang et al.

ECCV 2024arXiv:2408.12443
#8289

Robustness Preserving Fine-tuning using Neuron Importance

Guangrui Li, Rahul Duggal, Aaditya Singh et al.

ECCV 2024
#8290

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Mengcheng Lan, Chaofeng Chen, Yiping Ke et al.

ECCV 2024arXiv:2408.04883
#8291

Similarity of Neural Architectures using Adversarial Attack Transferability

Jaehui Hwang, Dongyoon Han, Byeongho Heo et al.

ECCV 2024arXiv:2210.11407
#8292

Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers

Tingting Chen, Beibei Lin, Yeying Jin et al.

ECCV 2024
#8293

Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks

Jiawei Wu, Zhi Jin

ECCV 2024arXiv:2408.08149
#8294

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

Hao Xu, Xi Zhang, Xiaolin Wu

ECCV 2024arXiv:2408.02966
#8295

Scene-Conditional 3D Object Stylization and Composition

Jinghao Zhou, Tomas Jakab, Philip Torr et al.

ECCV 2024arXiv:2312.12419
#8296

Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme

Jintae Kim, Seungwon Yang, Seong-Gyun Jeong et al.

ECCV 2024arXiv:2407.14170
#8297

Information Bottleneck Based Data Correction in Continual Learning

Shuai Chen, mingyi zhang, Junge Zhang et al.

ECCV 2024
#8298

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux et al.

ECCV 2024arXiv:2407.03036
#8299

Generalizing to Unseen Domains via Text-guided Augmentation

Daiqing Qi, Handong Zhao, Aidong Zhang et al.

ECCV 2024
#8300

Contextual Correspondence Matters: Bidirectional Graph Matching for Video Summarization

yunzuo zhang, Yameng Liu

ECCV 2024
#8301

Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models

Juntu Zhao, Junyu Deng, Yixin Ye et al.

ECCV 2024arXiv:2408.00230
#8302

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024arXiv:2310.05615
#8303

Easing 3D Pattern Reasoning with Side-view Features for Semantic Scene Completion

Linxi Huan, Mingyue Dong, Linwei Yue et al.

ECCV 2024
#8304

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Animesh Sinha, Bo Sun, Anmol Kalia et al.

ECCV 2024arXiv:2311.10794
#8305

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Xin Ming, Jiawei Li, Jingwang Ling et al.

ECCV 2024arXiv:2401.08398
#8306

Early Anticipation of Driving Maneuvers

Abdul Wasi Lone, Shankar Gangisetty, Shyam Nandan et al.

ECCV 2024
#8307

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

Yiyang Chen, Siyan Dong, Xulong Wang et al.

ECCV 2024arXiv:2407.12667
#8308

InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction

Xulong Wang, Siyan Dong, Youyi Zheng et al.

ECCV 2024arXiv:2407.12661
#8309

DreamReward: Aligning Human Preference in Text-to-3D Generation

junliang ye, Fangfu Liu, Qixiu Li et al.

ECCV 2024
#8310

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024arXiv:2409.17457
#8311

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730
#8312

FedHide: Federated Learning by Hiding in the Neighbors

Hyunsin Park, Sungrack Yun

ECCV 2024arXiv:2409.07808
#8313

HoloADMM: High-Quality Holographic Complex Field Recovery

Mazen Mel, Paul Springer, Pietro Zanuttigh et al.

ECCV 2024
#8314

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024arXiv:2407.15763
#8315

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068
#8316

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024arXiv:2312.08977
#8317

GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Hao Li, Yuanyuan Gao, Dingwen Zhang et al.

ECCV 2024
#8318

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024
#8319

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024
#8320

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024
#8321

Chains of Diffusion Models

Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.

ECCV 2024
#8322

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024
#8323

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966
#8324

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024arXiv:2402.19091
#8325

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.

ECCV 2024arXiv:2403.17377
#8326

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024
#8327

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387
#8328

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422
#8329

TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection

Xixi Liu, Christopher Zach

ECCV 2024
#8330

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation

Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.

ECCV 2024
#8331

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2407.10151
#8332

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Tianhe Wu, Kede Ma, Jie Liang et al.

ECCV 2024arXiv:2403.10854
#8333

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Haoran Li, Haolin Shi, Wenli Zhang et al.

ECCV 2024arXiv:2404.03575
#8334

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947
#8335

MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling

Jian Yang, Jiakun Li, Guoming Li et al.

ECCV 2024
#8336

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology

Andrei Atanov, Rishubh Singh, Jiawei Fu et al.

ECCV 2024
#8337

MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models

Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.

ECCV 2024
#8338

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024
#8339

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Prantik Howlader, Srijan Das, Hieu Le et al.

ECCV 2024arXiv:2407.04036
#8340

Spiking Wavelet Transformer

Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.

ECCV 2024arXiv:2403.11138
#8341

WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing

Yutang Feng, Sicheng Gao, Yuxiang Bao et al.

ECCV 2024
#8342

COD: Learning Conditional Invariant Representation for Domain Adaptation Regression

Hao-Ran Yang, Chuan-Xian Ren, You-Wei Luo

ECCV 2024arXiv:2408.06638
#8343

RANRAC: Robust Neural Scene Representations via Random Ray Consensus

Benno Buschmann, Andreea Dogaru, Elmar Eisemann et al.

ECCV 2024arXiv:2312.09780
#8344

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Runhui Huang, Kaixin Cai, Jianhua Han et al.

ECCV 2024arXiv:2403.11929
#8345

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024arXiv:2408.00372
#8346

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Yassine Ouali, Adrian Bulat, Brais Martinez et al.

ECCV 2024arXiv:2408.10433
#8347

Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring

Sizhuo Li, Dimitri Gominski, Martin Brandt et al.

ECCV 2024arXiv:2405.00514
#8348

Curved Diffusion: A Generative Model With Optical Geometry Control

Andrey Voynov, Amir Hertz, Moab Arar et al.

ECCV 2024arXiv:2311.17609
#8349

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024arXiv:2403.17213
#8350

LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis

Kevin Xie, Tianshi Cao, Jonathan P Lorraine et al.

ECCV 2024arXiv:2403.15385
#8351

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Tom Fischer, Yaoyao Liu, Artur Jesslen et al.

ECCV 2024arXiv:2407.09271
#8352

Pose Guided Fine-Grained Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang et al.

ECCV 2024
#8353

SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning

Qi Qian, Yuanhong Xu, JUHUA HU

ECCV 2024arXiv:2408.13351
#8354

3D Reconstruction of Objects in Hands without Real World 3D Supervision

Aditya Prakash, Matthew Chang, Matthew Jin et al.

ECCV 2024arXiv:2305.03036
#8355

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

Souhail Hadgi, Lei Li, Maks Ovsjanikov

ECCV 2024arXiv:2403.17869
#8356

Optimization-based Uncertainty Attribution Via Learning Informative Perturbations

Hanjing Wang, Bashirul Azam Biswas, Qiang Ji

ECCV 2024
#8357

A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control

Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.

ECCV 2024arXiv:2407.15631
#8358

Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off

Levente Ferenc Halmosi, Bálint Mohos, Márk Jelasity

ECCV 2024arXiv:2407.09150
#8359

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Shengkun Tang, Yaqing Wang, Caiwen Ding et al.

ECCV 2024arXiv:2309.17074
#8360

Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval

Naoya Sogi, Takashi Shibata, Makoto Terao

ECCV 2024arXiv:2407.12346
#8361

Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding

Minh Tran, Yelin Kim, Che-Chun Su et al.

ECCV 2024
#8362

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024arXiv:2407.03788
#8363

GRiT: A Generative Region-to-text Transformer for Object Understanding

Jialian Wu, Jianfeng Wang, Zhengyuan Yang et al.

ECCV 2024arXiv:2212.00280
#8364

LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Hongbeen Park, Minjeong Park, Giljoo Nam et al.

ECCV 2024arXiv:2506.10567
#8365

Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning

Seokwon Shin, Hyungrok Do, Youngdoo Son

ECCV 2024
#8366

BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling

Cheng Peng, Yutao Tang, Yifan Zhou et al.

ECCV 2024arXiv:2403.04926
#8367

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024arXiv:2404.00875
#8368

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

Zhiyu Tan, Mengping Yang, Luozheng Qin et al.

ECCV 2024arXiv:2405.12914
#8369

Generalizable Symbolic Optimizer Learning

Xiaotian Song, Peng Zeng, Yanan Sun et al.

ECCV 2024
#8370

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024arXiv:2409.01696
#8371

Reinforcement Learning via Auxillary Task Distillation

Abhinav Narayan Harish, Larry Heck, Josiah P Hanna et al.

ECCV 2024
#8372

Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception

TIANYOU LUO, Quan Yuan, Yuchen Xia et al.

ECCV 2024
#8373

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

Yuchen Yang, Kwonjoon Lee, Behzad Dariush et al.

ECCV 2024arXiv:2407.10299
#8374

Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation

Clinton Mo, Kun Hu, Chengjiang Long et al.

ECCV 2024
#8375

Improving Hyperbolic Representations via Gromov-Wasserstein Regularization

yifei Yang, Wonjun Lee, Dongmian Zou et al.

ECCV 2024arXiv:2407.10495
#8376

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics

Woojin Cho, Jihyun Lee, Minjae Yi et al.

ECCV 2024arXiv:2409.04033
#8377

Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery

Chao Wang, Zhedong Zheng, Ruijie Quan et al.

ECCV 2024
#8378

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

ECCV 2024arXiv:2403.11415
#8379

PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control

Rishubh Parihar, Sachidanand VS, Sabariswaran Mani et al.

ECCV 2024arXiv:2408.05083
#8380

SRPose: Two-view Relative Pose Estimation with Sparse Keypoints

Rui Yin, Yulun Zhang, Zherong Pan et al.

ECCV 2024arXiv:2407.08199
#8381

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models

Xiaoshi Wu, Yiming Hao, Manyuan Zhang et al.

ECCV 2024arXiv:2405.00760
#8382

Efficient Vision Transformers with Partial Attention

Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.

ECCV 2024
#8383

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

ECCV 2024arXiv:2407.12212
#8384

Kinetic Typography Diffusion Model

Seonmi Park, Inhwan Bae, Seunghyun Shin et al.

ECCV 2024arXiv:2407.10476
#8385

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Changhoon Kim, Kyle Min, Yezhou Yang

ECCV 2024arXiv:2405.16341
#8386

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Amandeep Kumar, Muhammad Awais, Sanath Narayan et al.

ECCV 2024arXiv:2406.04413
#8387

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Hu Cao, Zehua Zhang, Yan Xia et al.

ECCV 2024arXiv:2407.12582
#8388

Unsupervised Representation Learning by Balanced Self Attention Matching

Daniel Shalam, Simon Korman

ECCV 2024arXiv:2408.02014
#8389

Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging

Wenhua Wu, Kun Hu, Wenxi Yue et al.

ECCV 2024arXiv:2407.21381
#8390

Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation

Fangfu Liu, Hanyang Wang, Weiliang Chen et al.

ECCV 2024arXiv:2403.09625
#8391

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024arXiv:2408.03574
#8392

Compact 3D Scene Representation via Self-Organizing Gaussian Grids

Wieland Morgenstern, Florian Barthel, Anna Hilsmann et al.

ECCV 2024arXiv:2312.13299
#8393

Linking in Style: Understanding learned features in deep learning models

Maren Wehrheim, Pamela Osuna Vargas, Matthias Kaschube

ECCV 2024arXiv:2409.16865
#8394

Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator

Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi et al.

ECCV 2024arXiv:2312.02350
#8395

SHIC: Shape-Image Correspondences with no Keypoint Supervision

Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

ECCV 2024arXiv:2407.18907
#8396

SceneTeller: Language-to-3D Scene Generation

Basak Melis Ocal, Maxim Tatarchenko, Sezer Karaoglu et al.

ECCV 2024
#8397

MagMax: Leveraging Model Merging for Seamless Continual Learning

Daniel Marczak, Bartlomiej Twardowski, Tomasz Trzcinski et al.

ECCV 2024arXiv:2407.06322
#8398

Debiasing surgeon: fantastic weights and how to find them

Remi Nahon, Ivan Luiz De Moura Matos, Van-Tam Nguyen et al.

ECCV 2024arXiv:2403.14200
#8399

Spline-based Transformers

Prashanth Chandran, Agon Serifi, Markus Gross et al.

ECCV 2024arXiv:2504.02797
#8400

Efficient NeRF Optimization - Not All Samples Remain Equally Hard

Juuso Korhonen, Goutham Rangu, Hamed Rezazadegan Tavakoli et al.

ECCV 2024arXiv:2408.03193