Most Cited ECCV "semantic diversity" Papers

2,387 papers found • Page 12 of 12

#2201

Plain-Det: A Plain Multi-Dataset Object Detector

cheng Shi, yuchen zhu, Sibei Yang

ECCV 2024arXiv:2407.10083
#2202

Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations

Ofir Shifman, Yair Weiss

ECCV 2024arXiv:2404.07153
#2203

m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Zixian Ma, Weikai Huang, Jieyu Zhang et al.

ECCV 2024arXiv:2403.11085
#2204

SENC: Handling Self-collision in Neural Cloth Simulation

Zhouyingcheng Liao, Sinan Wang, Taku Komura

ECCV 2024arXiv:2407.12479
#2205

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Baoxiong Jia, Yixin Chen, Huangyue Yu et al.

ECCV 2024arXiv:2401.09340
#2206

Convex Relaxations for Manifold-Valued Markov Random Fields with Approximation Guarantees

Robin Kenis, Emanuel Laude, Panagiotis Patrinos

ECCV 2024
#2207

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024arXiv:2409.15269
#2208

Controlling the World by Sleight of Hand

Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.

ECCV 2024arXiv:2408.07147
#2209

Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields

Yonggan Fu, Huaizhi Qu, Zhifan Ye et al.

ECCV 2024arXiv:2403.11131
#2210

Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation

Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha et al.

ECCV 2024arXiv:2407.11954
#2211

Scalar Function Topology Divergence: Comparing Topology of 3D Objects

Ilya Trofimov, Daria Voronkova, Eduard Tulchinskii et al.

ECCV 2024arXiv:2407.08364
#2212

Pseudo-Labelling Should Be Aware of Disguising Channel Activations

Changrui Chen, Kurt Debattista, Jungong Han

ECCV 2024
#2213

See and Think: Embodied Agent in Virtual Environment

Zhonghan Zhao, Xuan Wang, Wenhao Chai et al.

ECCV 2024arXiv:2311.15209
#2214

ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities

CHENMING ZHU, Tai Wang, Wenwei Zhang et al.

ECCV 2024arXiv:2407.01525
#2215

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327
#2216

Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks

Weizhi An, Wenliang Zhong, Feng Jiang et al.

ECCV 2024
#2217

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu, Lingdong Kong, Hui Shuai et al.

ECCV 2024arXiv:2407.06190
#2218

PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects

Guangcheng Chen, Yicheng He, Li He et al.

ECCV 2024arXiv:2409.14331
#2219

Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians

Licheng Zhong, Hong-Xing Yu, Jiajun Wu et al.

ECCV 2024arXiv:2403.09434
#2220

Revisiting Domain-Adaptive Object Detection in Adverse Weather by the Generation and Composition of High-Quality Pseudo-Labels

Rui Zhao, Huibin Yan, Shuoyao Wang

ECCV 2024
#2221

GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation

Haonan Wang, Jie Liu, Jie Tang et al.

ECCV 2024arXiv:2407.10756
#2222

Energy-induced Explicit quantification for Multi-modality MRI fusion

Xiaoming Qi, Yuan Zhang, Tong Wang et al.

ECCV 2024
#2223

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers

Ziling Huang, Shin’ichi Satoh

ECCV 2024
#2224

Self-supervised Shape Completion via Involution and Implicit Correspondences

Mengya Liu, Ajad Chhatkuli, Janis Postels et al.

ECCV 2024arXiv:2409.15939
#2225

3D Single-object Tracking in Point Clouds with High Temporal Variation

Qiao Wu, Kun Sun, Pei An et al.

ECCV 2024arXiv:2408.02049
#2226

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024arXiv:2407.10159
#2227

Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics

Shuai Yang, ZhiFei Chen, Pengguang Chen et al.

ECCV 2024arXiv:2310.17316
#2228

Robust Fitting on a Gate Quantum Computer

Frances Yang, Michele Sasdelli, Tat-Jun Chin

ECCV 2024arXiv:2409.02006
#2229

Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection

Kohei Yamashita, Vincent Lepetit, Ko Nishino

ECCV 2024arXiv:2312.04527
#2230

Shapefusion: 3D localized human diffusion models

Rolandos Alexandros Potamias, Michael Tarasiou, Stylianos Ploumpis et al.

ECCV 2024
#2231

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024arXiv:2409.17439
#2232

Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective

Panjian Huang, Yunjie Peng, Saihui Hou et al.

ECCV 2024
#2233

Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment

Wulian Yun, Mengshi Qi, Fei Peng et al.

ECCV 2024arXiv:2407.19675
#2234

3D Congealing: 3D-Aware Image Alignment in the Wild

Yunzhi Zhang, Zizhang Li, Amit Raj et al.

ECCV 2024arXiv:2404.02125
#2235

GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Bangyan Liao, Zhenjun Zhao, Lu Chen et al.

ECCV 2024arXiv:2407.13537
#2236

Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data

Jiayi Li, Xi-Le Zhao, Jian-Li Wang et al.

ECCV 2024arXiv:2411.11356
#2237

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins et al.

ECCV 2024arXiv:2403.13064
#2238

Unsupervised Exposure Correction

Ruodai Cui, Li Niu, Guosheng Hu

ECCV 2024arXiv:2507.17252
#2239

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2407.13342
#2240

MMBENCH: Is Your Multi-Modal Model an All-around Player?

Yuan Liu, Haodong Duan, Yuanhan Zhang et al.

ECCV 2024arXiv:2307.06281
#2241

HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis

Fangqin Zhou, Mert Kilickaya, Joaquin Vanschoren et al.

ECCV 2024arXiv:2407.16269
#2242

LiDAR-Event Stereo Fusion with Hallucinations

Luca Bartolomei, Matteo Poggi, Andrea Conti et al.

ECCV 2024arXiv:2408.04633
#2243

Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation

Yushun Tang, Shuoshuo Chen, Zhihe Lu et al.

ECCV 2024arXiv:2408.13983
#2244

Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework

Wei Suo, Lanqing Lai, Mengyang Sun et al.

ECCV 2024
#2245

Cross-Input Certified Training for Universal Perturbations

Changming Xu, Gagandeep Singh

ECCV 2024arXiv:2405.09176
#2246

Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation

Juncheng Ma, Peiwen Sun, Yaoting Wang et al.

ECCV 2024arXiv:2407.11820
#2247

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

Pengxiang Ding, Han Zhao, Wenjie Zhang et al.

ECCV 2024arXiv:2312.14457
#2248

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Lin Chen, Jinsong Li, Xiaoyi Dong et al.

ECCV 2024arXiv:2311.12793
#2249

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Peixi Xiong, Michael A Kozuch, Nilesh Jain

ECCV 2024
#2250

When and How do negative prompts take effect?

Yuanhao Ban, Ruochen Wang, Tianyi Zhou et al.

ECCV 2024
#2251

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Yunhao Gou, Kai Chen, Zhili LIU et al.

ECCV 2024arXiv:2403.09572
#2252

Training A Small Emotional Vision Language Model for Visual Art Comprehension

Jing Zhang, Liang Zheng, Meng Wang et al.

ECCV 2024arXiv:2403.11150
#2253

Spectral Subsurface Scattering for Material Classification

Haejoon Lee, Aswin C. Sankaranarayanan

ECCV 2024
#2254

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Runhui Huang, Kaixin Cai, Jianhua Han et al.

ECCV 2024arXiv:2403.11929
#2255

RANRAC: Robust Neural Scene Representations via Random Ray Consensus

Benno Buschmann, Andreea Dogaru, Elmar Eisemann et al.

ECCV 2024arXiv:2312.09780
#2256

COD: Learning Conditional Invariant Representation for Domain Adaptation Regression

Hao-Ran Yang, Chuan-Xian Ren, You-Wei Luo

ECCV 2024arXiv:2408.06638
#2257

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024arXiv:2408.00372
#2258

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Yassine Ouali, Adrian Bulat, Brais Martinez et al.

ECCV 2024arXiv:2408.10433
#2259

WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing

Yutang Feng, Sicheng Gao, Yuxiang Bao et al.

ECCV 2024
#2260

Spiking Wavelet Transformer

Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.

ECCV 2024arXiv:2403.11138
#2261

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Prantik Howlader, Srijan Das, Hieu Le et al.

ECCV 2024arXiv:2407.04036
#2262

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024
#2263

Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring

Sizhuo Li, Dimitri Gominski, Martin Brandt et al.

ECCV 2024arXiv:2405.00514
#2264

MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models

Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.

ECCV 2024
#2265

Curved Diffusion: A Generative Model With Optical Geometry Control

Andrey Voynov, Amir Hertz, Moab Arar et al.

ECCV 2024arXiv:2311.17609
#2266

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology

Andrei Atanov, Rishubh Singh, Jiawei Fu et al.

ECCV 2024
#2267

MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling

Jian Yang, Jiakun Li, Guoming Li et al.

ECCV 2024
#2268

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947
#2269

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Haoran Li, Haolin Shi, Wenli Zhang et al.

ECCV 2024arXiv:2404.03575
#2270

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024arXiv:2403.17213
#2271

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Tianhe Wu, Kede Ma, Jie Liang et al.

ECCV 2024arXiv:2403.10854
#2272

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2407.10151
#2273

LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis

Kevin Xie, Tianshi Cao, Jonathan P Lorraine et al.

ECCV 2024arXiv:2403.15385
#2274

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation

Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.

ECCV 2024
#2275

TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection

Xixi Liu, Christopher Zach

ECCV 2024
#2276

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Tom Fischer, Yaoyao Liu, Artur Jesslen et al.

ECCV 2024arXiv:2407.09271
#2277

Pose Guided Fine-Grained Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang et al.

ECCV 2024
#2278

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422
#2279

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387
#2280

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024
#2281

SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning

Qi Qian, Yuanhong Xu, JUHUA HU

ECCV 2024arXiv:2408.13351
#2282

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.

ECCV 2024arXiv:2403.17377
#2283

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024arXiv:2402.19091
#2284

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966
#2285

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024
#2286

3D Reconstruction of Objects in Hands without Real World 3D Supervision

Aditya Prakash, Matthew Chang, Matthew Jin et al.

ECCV 2024arXiv:2305.03036
#2287

Chains of Diffusion Models

Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.

ECCV 2024
#2288

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

Souhail Hadgi, Lei Li, Maks Ovsjanikov

ECCV 2024arXiv:2403.17869
#2289

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024
#2290

Optimization-based Uncertainty Attribution Via Learning Informative Perturbations

Hanjing Wang, Bashirul Azam Biswas, Qiang Ji

ECCV 2024
#2291

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024
#2292

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024
#2293

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

Yang Miao, Francis Engelmann, Olga Vysotska et al.

ECCV 2024arXiv:2404.00469
#2294

GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Hao Li, Yuanyuan Gao, Dingwen Zhang et al.

ECCV 2024
#2295

A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control

Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.

ECCV 2024arXiv:2407.15631
#2296

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024arXiv:2312.08977
#2297

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068
#2298

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024arXiv:2407.15763
#2299

Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off

Levente Ferenc Halmosi, Bálint Mohos, Márk Jelasity

ECCV 2024arXiv:2407.09150
#2300

HoloADMM: High-Quality Holographic Complex Field Recovery

Mazen Mel, Paul Springer, Pietro Zanuttigh et al.

ECCV 2024
#2301

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Shengkun Tang, Yaqing Wang, Caiwen Ding et al.

ECCV 2024arXiv:2309.17074
#2302

FedHide: Federated Learning by Hiding in the Neighbors

Hyunsin Park, Sungrack Yun

ECCV 2024arXiv:2409.07808
#2303

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730
#2304

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024arXiv:2409.17457
#2305

DreamReward: Aligning Human Preference in Text-to-3D Generation

junliang ye, Fangfu Liu, Qixiu Li et al.

ECCV 2024
#2306

InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction

Xulong Wang, Siyan Dong, Youyi Zheng et al.

ECCV 2024arXiv:2407.12661
#2307

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

Yiyang Chen, Siyan Dong, Xulong Wang et al.

ECCV 2024arXiv:2407.12667
#2308

Early Anticipation of Driving Maneuvers

Abdul Wasi Lone, Shankar Gangisetty, Shyam Nandan et al.

ECCV 2024
#2309

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Xin Ming, Jiawei Li, Jingwang Ling et al.

ECCV 2024arXiv:2401.08398
#2310

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Animesh Sinha, Bo Sun, Anmol Kalia et al.

ECCV 2024arXiv:2311.10794
#2311

Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval

Naoya Sogi, Takashi Shibata, Makoto Terao

ECCV 2024arXiv:2407.12346
#2312

Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding

Minh Tran, Yelin Kim, Che-Chun Su et al.

ECCV 2024
#2313

Easing 3D Pattern Reasoning with Side-view Features for Semantic Scene Completion

Linxi Huan, Mingyue Dong, Linwei Yue et al.

ECCV 2024
#2314

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024arXiv:2407.03788
#2315

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024arXiv:2310.05615
#2316

Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models

Juntu Zhao, Junyu Deng, Yixin Ye et al.

ECCV 2024arXiv:2408.00230
#2317

Contextual Correspondence Matters: Bidirectional Graph Matching for Video Summarization

yunzuo zhang, Yameng Liu

ECCV 2024
#2318

GRiT: A Generative Region-to-text Transformer for Object Understanding

Jialian Wu, Jianfeng Wang, Zhengyuan Yang et al.

ECCV 2024arXiv:2212.00280
#2319

LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Hongbeen Park, Minjeong Park, Giljoo Nam et al.

ECCV 2024arXiv:2506.10567
#2320

Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning

Seokwon Shin, Hyungrok Do, Youngdoo Son

ECCV 2024
#2321

Generalizing to Unseen Domains via Text-guided Augmentation

Daiqing Qi, Handong Zhao, Aidong Zhang et al.

ECCV 2024
#2322

BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling

Cheng Peng, Yutao Tang, Yifan Zhou et al.

ECCV 2024arXiv:2403.04926
#2323

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux et al.

ECCV 2024arXiv:2407.03036
#2324

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024arXiv:2404.00875
#2325

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

Zhiyu Tan, Mengping Yang, Luozheng Qin et al.

ECCV 2024arXiv:2405.12914
#2326

Information Bottleneck Based Data Correction in Continual Learning

Shuai Chen, mingyi zhang, Junge Zhang et al.

ECCV 2024
#2327

Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme

Jintae Kim, Seungwon Yang, Seong-Gyun Jeong et al.

ECCV 2024arXiv:2407.14170
#2328

Generalizable Symbolic Optimizer Learning

Xiaotian Song, Peng Zeng, Yanan Sun et al.

ECCV 2024
#2329

Scene-Conditional 3D Object Stylization and Composition

Jinghao Zhou, Tomas Jakab, Philip Torr et al.

ECCV 2024arXiv:2312.12419
#2330

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

Hao Xu, Xi Zhang, Xiaolin Wu

ECCV 2024arXiv:2408.02966
#2331

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024arXiv:2409.01696
#2332

Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks

Jiawei Wu, Zhi Jin

ECCV 2024arXiv:2408.08149
#2333

Reinforcement Learning via Auxillary Task Distillation

Abhinav Narayan Harish, Larry Heck, Josiah P Hanna et al.

ECCV 2024
#2334

Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers

Tingting Chen, Beibei Lin, Yeying Jin et al.

ECCV 2024
#2335

Similarity of Neural Architectures using Adversarial Attack Transferability

Jaehui Hwang, Dongyoon Han, Byeongho Heo et al.

ECCV 2024arXiv:2210.11407
#2336

Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception

TIANYOU LUO, Quan Yuan, Yuchen Xia et al.

ECCV 2024
#2337

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

Yuchen Yang, Kwonjoon Lee, Behzad Dariush et al.

ECCV 2024arXiv:2407.10299
#2338

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Mengcheng Lan, Chaofeng Chen, Yiping Ke et al.

ECCV 2024arXiv:2408.04883
#2339

Robustness Preserving Fine-tuning using Neuron Importance

Guangrui Li, Rahul Duggal, Aaditya Singh et al.

ECCV 2024
#2340

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Tahmina Khanam, Mohammed Bennamoun, Guan Wang et al.

ECCV 2024arXiv:2408.12443
#2341

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Akshat Ramachandran, Souvik Kundu, Tushar Krishna

ECCV 2024arXiv:2407.05266
#2342

Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation

Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024arXiv:2407.10703
#2343

E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness

Robin Courant, Nicolas Dufour, Xi WANG et al.

ECCV 2024arXiv:2407.01516
#2344

Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation

Clinton Mo, Kun Hu, Chengjiang Long et al.

ECCV 2024
#2345

Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling

Zixiao Wang, Hongtao Xie, YuXin Wang et al.

ECCV 2024arXiv:2409.13431
#2346

Improving Hyperbolic Representations via Gromov-Wasserstein Regularization

yifei Yang, Wonjun Lee, Dongmian Zou et al.

ECCV 2024arXiv:2407.10495
#2347

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Ankit Vani, Bac Nguyen, Samuel Lavoie et al.

ECCV 2024arXiv:2404.15721
#2348

On the Topology Awareness and Generalization Performance of Graph Neural Networks

Junwei Su, Chuan Wu

ECCV 2024arXiv:2403.04482
#2349

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics

Woojin Cho, Jihyun Lee, Minjae Yi et al.

ECCV 2024arXiv:2409.04033
#2350

MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction

Seongju Lee, Junseok Lee, Yeonguk Yu et al.

ECCV 2024arXiv:2407.21635
#2351

Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery

Chao Wang, Zhedong Zheng, Ruijie Quan et al.

ECCV 2024
#2352

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

ECCV 2024arXiv:2403.11415
#2353

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

Zuyao Chen, Jinlin Wu, Zhen Lei et al.

ECCV 2024arXiv:2311.10988
#2354

Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data

Sneha Paul, Zachary Patterson, Nizar Bouguila

ECCV 2024arXiv:2409.13977
#2355

PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control

Rishubh Parihar, Sachidanand VS, Sabariswaran Mani et al.

ECCV 2024arXiv:2408.05083
#2356

HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation

Noranart Vesdapunt, Kah Kuen Fu, Yue Wu et al.

ECCV 2024
#2357

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024arXiv:2403.00522
#2358

SRPose: Two-view Relative Pose Estimation with Sparse Keypoints

Rui Yin, Yulun Zhang, Zherong Pan et al.

ECCV 2024arXiv:2407.08199
#2359

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models

Xiaoshi Wu, Yiming Hao, Manyuan Zhang et al.

ECCV 2024arXiv:2405.00760
#2360

Delving into Adversarial Robustness on Document Tampering Localization

Huiru Shao, Zhuang Qian, Kaizhu Huang et al.

ECCV 2024
#2361

Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice

Xiayu Wang, Ke Ma, Ruiyun Zhong et al.

ECCV 2024
#2362

WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning

Kunbei Cai, Zhenkai Zhang, Qian Lou et al.

ECCV 2024
#2363

COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark

Atsushi Hashimoto, Koki Maeda, Tosho Hirasawa et al.

ECCV 2024
#2364

Efficient Vision Transformers with Partial Attention

Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.

ECCV 2024
#2365

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

ECCV 2024arXiv:2407.12212
#2366

Learning to Distinguish Samples for Generalized Category Discovery

Fengxiang Yang, Pu Nan, Wenjing Li et al.

ECCV 2024
#2367

Kinetic Typography Diffusion Model

Seonmi Park, Inhwan Bae, Seunghyun Shin et al.

ECCV 2024arXiv:2407.10476
#2368

Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing

Yushi Lan, Feitong Tan, Qiangeng Xu et al.

ECCV 2024
#2369

TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance

Guoxing Zhang, Yiming Liu, xiaoyu yang et al.

ECCV 2024
#2370

POET: Prompt Offset Tuning for Continual Human Action Adaptation

Prachi Garg, Joseph K J, Vineeth N Balasubramanian et al.

ECCV 2024arXiv:2504.18059
#2371

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Changhoon Kim, Kyle Min, Yezhou Yang

ECCV 2024arXiv:2405.16341
#2372

All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation

Seongho Kim, Byung Cheol Song

ECCV 2024
#2373

MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke et al.

ECCV 2024arXiv:2405.02771
#2374

BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

Gwanghyun Kim, Hayeon Kim, Hoigi Seo et al.

ECCV 2024arXiv:2404.04544
#2375

Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time

Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta et al.

ECCV 2024arXiv:2407.01851
#2376

DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks

Sarah Jabbour, Gregory Kondas, Ella Kazerooni et al.

ECCV 2024arXiv:2407.14509
#2377

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Amandeep Kumar, Muhammad Awais, Sanath Narayan et al.

ECCV 2024arXiv:2406.04413
#2378

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Hu Cao, Zehua Zhang, Yan Xia et al.

ECCV 2024arXiv:2407.12582
#2379

UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation

Jinho Park, Se Young Chun, Mingoo Seok

ECCV 2024arXiv:2409.13106
#2380

Unsupervised Representation Learning by Balanced Self Attention Matching

Daniel Shalam, Simon Korman

ECCV 2024arXiv:2408.02014
#2381

Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging

Wenhua Wu, Kun Hu, Wenxi Yue et al.

ECCV 2024arXiv:2407.21381
#2382

Caltech Aerial RGB-Thermal Dataset in the Wild

Connor Lee, Matthew Anderson, Nikhil Ranganathan et al.

ECCV 2024arXiv:2403.08997
#2383

Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation

Fangfu Liu, Hanyang Wang, Weiliang Chen et al.

ECCV 2024arXiv:2403.09625
#2384

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

Sharath Girish, Kamal Gupta, Abhinav Shrivastava

ECCV 2024arXiv:2312.04564
#2385

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024arXiv:2408.03574
#2386

Thinking Outside the BBox: Unconstrained Generative Object Compositing

Gemma Canet Tarrés, Zhe Lin, Zhifei Zhang et al.

ECCV 2024arXiv:2409.04559
#2387

Compact 3D Scene Representation via Self-Organizing Gaussian Grids

Wieland Morgenstern, Florian Barthel, Anna Hilsmann et al.

ECCV 2024arXiv:2312.13299