Most Cited ECCV "semantic diversity" Papers

2,387 papers found • Page 12 of 12

Filters:Most Cited ECCV semantic diversity Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2201

Plain-Det: A Plain Multi-Dataset Object Detector

cheng Shi, yuchen zhu, Sibei Yang

ECCV 2024arXiv:2407.10083

#2202

Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations

Ofir Shifman, Yair Weiss

ECCV 2024arXiv:2404.07153

#2203

m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Zixian Ma, Weikai Huang, Jieyu Zhang et al.

ECCV 2024arXiv:2403.11085

#2204

SENC: Handling Self-collision in Neural Cloth Simulation

Zhouyingcheng Liao, Sinan Wang, Taku Komura

ECCV 2024arXiv:2407.12479

#2205

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Baoxiong Jia, Yixin Chen, Huangyue Yu et al.

ECCV 2024arXiv:2401.09340

#2206

Convex Relaxations for Manifold-Valued Markov Random Fields with Approximation Guarantees

Robin Kenis, Emanuel Laude, Panagiotis Patrinos

ECCV 2024

#2207

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024arXiv:2409.15269

#2208

Controlling the World by Sleight of Hand

Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.

ECCV 2024arXiv:2408.07147

#2209

Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields

Yonggan Fu, Huaizhi Qu, Zhifan Ye et al.

ECCV 2024arXiv:2403.11131

#2210

Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation

Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha et al.

ECCV 2024arXiv:2407.11954

#2211

Scalar Function Topology Divergence: Comparing Topology of 3D Objects

Ilya Trofimov, Daria Voronkova, Eduard Tulchinskii et al.

ECCV 2024arXiv:2407.08364

#2212

Pseudo-Labelling Should Be Aware of Disguising Channel Activations

Changrui Chen, Kurt Debattista, Jungong Han

ECCV 2024

#2213

See and Think: Embodied Agent in Virtual Environment

Zhonghan Zhao, Xuan Wang, Wenhao Chai et al.

ECCV 2024arXiv:2311.15209

#2214

ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities

CHENMING ZHU, Tai Wang, Wenwei Zhang et al.

ECCV 2024arXiv:2407.01525

#2215

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327

#2216

Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks

Weizhi An, Wenliang Zhong, Feng Jiang et al.

ECCV 2024

#2217

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu, Lingdong Kong, Hui Shuai et al.

ECCV 2024arXiv:2407.06190

#2218

PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects

Guangcheng Chen, Yicheng He, Li He et al.

ECCV 2024arXiv:2409.14331

#2219

Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians

Licheng Zhong, Hong-Xing Yu, Jiajun Wu et al.

ECCV 2024arXiv:2403.09434

#2220

Revisiting Domain-Adaptive Object Detection in Adverse Weather by the Generation and Composition of High-Quality Pseudo-Labels

Rui Zhao, Huibin Yan, Shuoyao Wang

ECCV 2024

#2221

GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation

Haonan Wang, Jie Liu, Jie Tang et al.

ECCV 2024arXiv:2407.10756

#2222

Energy-induced Explicit quantification for Multi-modality MRI fusion

Xiaoming Qi, Yuan Zhang, Tong Wang et al.

ECCV 2024

#2223

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers

Ziling Huang, Shin’ichi Satoh

ECCV 2024

#2224

Self-supervised Shape Completion via Involution and Implicit Correspondences

Mengya Liu, Ajad Chhatkuli, Janis Postels et al.

ECCV 2024arXiv:2409.15939

#2225

3D Single-object Tracking in Point Clouds with High Temporal Variation

Qiao Wu, Kun Sun, Pei An et al.

ECCV 2024arXiv:2408.02049

#2226

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024arXiv:2407.10159

#2227

Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics

Shuai Yang, ZhiFei Chen, Pengguang Chen et al.

ECCV 2024arXiv:2310.17316

#2228

Robust Fitting on a Gate Quantum Computer

Frances Yang, Michele Sasdelli, Tat-Jun Chin

ECCV 2024arXiv:2409.02006

#2229

Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection

Kohei Yamashita, Vincent Lepetit, Ko Nishino

ECCV 2024arXiv:2312.04527

#2230

Shapefusion: 3D localized human diffusion models

Rolandos Alexandros Potamias, Michael Tarasiou, Stylianos Ploumpis et al.

ECCV 2024

#2231

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024arXiv:2409.17439

#2232

Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective

Panjian Huang, Yunjie Peng, Saihui Hou et al.

ECCV 2024

#2233

Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment

Wulian Yun, Mengshi Qi, Fei Peng et al.

ECCV 2024arXiv:2407.19675

#2234

3D Congealing: 3D-Aware Image Alignment in the Wild

Yunzhi Zhang, Zizhang Li, Amit Raj et al.

ECCV 2024arXiv:2404.02125

#2235

GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Bangyan Liao, Zhenjun Zhao, Lu Chen et al.

ECCV 2024arXiv:2407.13537

#2236

Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data

Jiayi Li, Xi-Le Zhao, Jian-Li Wang et al.

ECCV 2024arXiv:2411.11356

#2237

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins et al.

ECCV 2024arXiv:2403.13064

#2238

Unsupervised Exposure Correction

Ruodai Cui, Li Niu, Guosheng Hu

ECCV 2024arXiv:2507.17252

#2239

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2407.13342

#2240

MMBENCH: Is Your Multi-Modal Model an All-around Player?

Yuan Liu, Haodong Duan, Yuanhan Zhang et al.

ECCV 2024arXiv:2307.06281

#2241

HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis

Fangqin Zhou, Mert Kilickaya, Joaquin Vanschoren et al.

ECCV 2024arXiv:2407.16269

#2242

LiDAR-Event Stereo Fusion with Hallucinations

Luca Bartolomei, Matteo Poggi, Andrea Conti et al.

ECCV 2024arXiv:2408.04633

#2243

Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation

Yushun Tang, Shuoshuo Chen, Zhihe Lu et al.

ECCV 2024arXiv:2408.13983

#2244

Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework

Wei Suo, Lanqing Lai, Mengyang Sun et al.

ECCV 2024

#2245

Cross-Input Certified Training for Universal Perturbations

Changming Xu, Gagandeep Singh

ECCV 2024arXiv:2405.09176

#2246

Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation

Juncheng Ma, Peiwen Sun, Yaoting Wang et al.

ECCV 2024arXiv:2407.11820

#2247

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

Pengxiang Ding, Han Zhao, Wenjie Zhang et al.

ECCV 2024arXiv:2312.14457

#2248

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Lin Chen, Jinsong Li, Xiaoyi Dong et al.

ECCV 2024arXiv:2311.12793

#2249

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Peixi Xiong, Michael A Kozuch, Nilesh Jain

ECCV 2024

#2250

When and How do negative prompts take effect?

Yuanhao Ban, Ruochen Wang, Tianyi Zhou et al.

ECCV 2024

#2251

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Yunhao Gou, Kai Chen, Zhili LIU et al.

ECCV 2024arXiv:2403.09572

#2252

Training A Small Emotional Vision Language Model for Visual Art Comprehension

Jing Zhang, Liang Zheng, Meng Wang et al.

ECCV 2024arXiv:2403.11150

#2253

Spectral Subsurface Scattering for Material Classification

Haejoon Lee, Aswin C. Sankaranarayanan

ECCV 2024

#2254

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Runhui Huang, Kaixin Cai, Jianhua Han et al.

ECCV 2024arXiv:2403.11929

#2255

RANRAC: Robust Neural Scene Representations via Random Ray Consensus

Benno Buschmann, Andreea Dogaru, Elmar Eisemann et al.

ECCV 2024arXiv:2312.09780

#2256

COD: Learning Conditional Invariant Representation for Domain Adaptation Regression

Hao-Ran Yang, Chuan-Xian Ren, You-Wei Luo

ECCV 2024arXiv:2408.06638

#2257

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024arXiv:2408.00372

#2258

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Yassine Ouali, Adrian Bulat, Brais Martinez et al.

ECCV 2024arXiv:2408.10433

#2259

WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing

Yutang Feng, Sicheng Gao, Yuxiang Bao et al.

ECCV 2024

#2260

Spiking Wavelet Transformer

Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.

ECCV 2024arXiv:2403.11138

#2261

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Prantik Howlader, Srijan Das, Hieu Le et al.

ECCV 2024arXiv:2407.04036

#2262

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024

#2263

Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring

Sizhuo Li, Dimitri Gominski, Martin Brandt et al.

ECCV 2024arXiv:2405.00514

#2264

MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models

Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.

ECCV 2024

#2265

Curved Diffusion: A Generative Model With Optical Geometry Control

Andrey Voynov, Amir Hertz, Moab Arar et al.

ECCV 2024arXiv:2311.17609

#2266

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology

Andrei Atanov, Rishubh Singh, Jiawei Fu et al.

ECCV 2024

#2267

MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling

Jian Yang, Jiakun Li, Guoming Li et al.

ECCV 2024

#2268

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947

#2269

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Haoran Li, Haolin Shi, Wenli Zhang et al.

ECCV 2024arXiv:2404.03575

#2270

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024arXiv:2403.17213

#2271

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Tianhe Wu, Kede Ma, Jie Liang et al.

ECCV 2024arXiv:2403.10854

#2272

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2407.10151

#2273

LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis

Kevin Xie, Tianshi Cao, Jonathan P Lorraine et al.

ECCV 2024arXiv:2403.15385

#2274

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation

Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.

ECCV 2024

#2275

TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection

Xixi Liu, Christopher Zach

ECCV 2024

#2276

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Tom Fischer, Yaoyao Liu, Artur Jesslen et al.

ECCV 2024arXiv:2407.09271

#2277

Pose Guided Fine-Grained Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang et al.

ECCV 2024

#2278

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422

#2279

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024arXiv:2406.18387

#2280

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024

#2281

SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning

Qi Qian, Yuanhong Xu, JUHUA HU

ECCV 2024arXiv:2408.13351

#2282

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.

ECCV 2024arXiv:2403.17377

#2283

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024arXiv:2402.19091

#2284

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966

#2285

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024

#2286

3D Reconstruction of Objects in Hands without Real World 3D Supervision

Aditya Prakash, Matthew Chang, Matthew Jin et al.

ECCV 2024arXiv:2305.03036

#2287

Chains of Diffusion Models

Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.

ECCV 2024

#2288

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

Souhail Hadgi, Lei Li, Maks Ovsjanikov

ECCV 2024arXiv:2403.17869

#2289

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024

#2290

Optimization-based Uncertainty Attribution Via Learning Informative Perturbations

Hanjing Wang, Bashirul Azam Biswas, Qiang Ji

ECCV 2024

#2291

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024

#2292

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024

#2293

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

Yang Miao, Francis Engelmann, Olga Vysotska et al.

ECCV 2024arXiv:2404.00469

#2294

GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Hao Li, Yuanyuan Gao, Dingwen Zhang et al.

ECCV 2024

#2295

A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control

Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.

ECCV 2024arXiv:2407.15631

#2296

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024arXiv:2312.08977

#2297

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068

#2298

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024arXiv:2407.15763

#2299

Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off

Levente Ferenc Halmosi, Bálint Mohos, Márk Jelasity

ECCV 2024arXiv:2407.09150

#2300

HoloADMM: High-Quality Holographic Complex Field Recovery

Mazen Mel, Paul Springer, Pietro Zanuttigh et al.

ECCV 2024

#2301

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Shengkun Tang, Yaqing Wang, Caiwen Ding et al.

ECCV 2024arXiv:2309.17074

#2302

FedHide: Federated Learning by Hiding in the Neighbors

Hyunsin Park, Sungrack Yun

ECCV 2024arXiv:2409.07808

#2303

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730

#2304

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024arXiv:2409.17457

#2305

DreamReward: Aligning Human Preference in Text-to-3D Generation

junliang ye, Fangfu Liu, Qixiu Li et al.

ECCV 2024

#2306

InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction

Xulong Wang, Siyan Dong, Youyi Zheng et al.

ECCV 2024arXiv:2407.12661

#2307

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

Yiyang Chen, Siyan Dong, Xulong Wang et al.

ECCV 2024arXiv:2407.12667

#2308

Early Anticipation of Driving Maneuvers

Abdul Wasi Lone, Shankar Gangisetty, Shyam Nandan et al.

ECCV 2024

#2309

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Xin Ming, Jiawei Li, Jingwang Ling et al.

ECCV 2024arXiv:2401.08398

#2310

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Animesh Sinha, Bo Sun, Anmol Kalia et al.

ECCV 2024arXiv:2311.10794

#2311

Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval

Naoya Sogi, Takashi Shibata, Makoto Terao

ECCV 2024arXiv:2407.12346

#2312

Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding

Minh Tran, Yelin Kim, Che-Chun Su et al.

ECCV 2024

#2313

Easing 3D Pattern Reasoning with Side-view Features for Semantic Scene Completion

Linxi Huan, Mingyue Dong, Linwei Yue et al.

ECCV 2024

#2314

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024arXiv:2407.03788

#2315

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024arXiv:2310.05615

#2316

Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models

Juntu Zhao, Junyu Deng, Yixin Ye et al.

ECCV 2024arXiv:2408.00230

#2317

Contextual Correspondence Matters: Bidirectional Graph Matching for Video Summarization

yunzuo zhang, Yameng Liu

ECCV 2024

#2318

GRiT: A Generative Region-to-text Transformer for Object Understanding

Jialian Wu, Jianfeng Wang, Zhengyuan Yang et al.

ECCV 2024arXiv:2212.00280

#2319

LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Hongbeen Park, Minjeong Park, Giljoo Nam et al.

ECCV 2024arXiv:2506.10567

#2320

Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning

Seokwon Shin, Hyungrok Do, Youngdoo Son

ECCV 2024

#2321

Generalizing to Unseen Domains via Text-guided Augmentation

Daiqing Qi, Handong Zhao, Aidong Zhang et al.

ECCV 2024

#2322

BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling

Cheng Peng, Yutao Tang, Yifan Zhou et al.

ECCV 2024arXiv:2403.04926

#2323

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux et al.

ECCV 2024arXiv:2407.03036

#2324

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024arXiv:2404.00875

#2325

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

Zhiyu Tan, Mengping Yang, Luozheng Qin et al.

ECCV 2024arXiv:2405.12914

#2326

Information Bottleneck Based Data Correction in Continual Learning

Shuai Chen, mingyi zhang, Junge Zhang et al.

ECCV 2024

#2327

Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme

Jintae Kim, Seungwon Yang, Seong-Gyun Jeong et al.

ECCV 2024arXiv:2407.14170

#2328

Generalizable Symbolic Optimizer Learning

Xiaotian Song, Peng Zeng, Yanan Sun et al.

ECCV 2024

#2329

Scene-Conditional 3D Object Stylization and Composition

Jinghao Zhou, Tomas Jakab, Philip Torr et al.

ECCV 2024arXiv:2312.12419

#2330

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

Hao Xu, Xi Zhang, Xiaolin Wu

ECCV 2024arXiv:2408.02966

#2331

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024arXiv:2409.01696

#2332

Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks

Jiawei Wu, Zhi Jin

ECCV 2024arXiv:2408.08149

#2333

Reinforcement Learning via Auxillary Task Distillation

Abhinav Narayan Harish, Larry Heck, Josiah P Hanna et al.

ECCV 2024

#2334

Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers

Tingting Chen, Beibei Lin, Yeying Jin et al.

ECCV 2024

#2335

Similarity of Neural Architectures using Adversarial Attack Transferability

Jaehui Hwang, Dongyoon Han, Byeongho Heo et al.

ECCV 2024arXiv:2210.11407

#2336

Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception

TIANYOU LUO, Quan Yuan, Yuchen Xia et al.

ECCV 2024

#2337

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

Yuchen Yang, Kwonjoon Lee, Behzad Dariush et al.

ECCV 2024arXiv:2407.10299

#2338

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Mengcheng Lan, Chaofeng Chen, Yiping Ke et al.

ECCV 2024arXiv:2408.04883

#2339

Robustness Preserving Fine-tuning using Neuron Importance

Guangrui Li, Rahul Duggal, Aaditya Singh et al.

ECCV 2024

#2340

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Tahmina Khanam, Mohammed Bennamoun, Guan Wang et al.

ECCV 2024arXiv:2408.12443

#2341

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Akshat Ramachandran, Souvik Kundu, Tushar Krishna

ECCV 2024arXiv:2407.05266

#2342

Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation

Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024arXiv:2407.10703

#2343

E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness

Robin Courant, Nicolas Dufour, Xi WANG et al.

ECCV 2024arXiv:2407.01516

#2344

Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation

Clinton Mo, Kun Hu, Chengjiang Long et al.

ECCV 2024

#2345

Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling

Zixiao Wang, Hongtao Xie, YuXin Wang et al.

ECCV 2024arXiv:2409.13431

#2346

Improving Hyperbolic Representations via Gromov-Wasserstein Regularization

yifei Yang, Wonjun Lee, Dongmian Zou et al.

ECCV 2024arXiv:2407.10495

#2347

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Ankit Vani, Bac Nguyen, Samuel Lavoie et al.

ECCV 2024arXiv:2404.15721

#2348

On the Topology Awareness and Generalization Performance of Graph Neural Networks

Junwei Su, Chuan Wu

ECCV 2024arXiv:2403.04482

#2349

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics

Woojin Cho, Jihyun Lee, Minjae Yi et al.

ECCV 2024arXiv:2409.04033

#2350

MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction

Seongju Lee, Junseok Lee, Yeonguk Yu et al.

ECCV 2024arXiv:2407.21635

#2351

Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery

Chao Wang, Zhedong Zheng, Ruijie Quan et al.

ECCV 2024

#2352

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

ECCV 2024arXiv:2403.11415

#2353

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

Zuyao Chen, Jinlin Wu, Zhen Lei et al.

ECCV 2024arXiv:2311.10988

#2354

Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data

Sneha Paul, Zachary Patterson, Nizar Bouguila

ECCV 2024arXiv:2409.13977

#2355

PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control

Rishubh Parihar, Sachidanand VS, Sabariswaran Mani et al.

ECCV 2024arXiv:2408.05083

#2356

HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation

Noranart Vesdapunt, Kah Kuen Fu, Yue Wu et al.

ECCV 2024

#2357

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024arXiv:2403.00522

#2358

SRPose: Two-view Relative Pose Estimation with Sparse Keypoints

Rui Yin, Yulun Zhang, Zherong Pan et al.

ECCV 2024arXiv:2407.08199

#2359

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models

Xiaoshi Wu, Yiming Hao, Manyuan Zhang et al.

ECCV 2024arXiv:2405.00760

#2360

Delving into Adversarial Robustness on Document Tampering Localization

Huiru Shao, Zhuang Qian, Kaizhu Huang et al.

ECCV 2024

#2361

Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice

Xiayu Wang, Ke Ma, Ruiyun Zhong et al.

ECCV 2024

#2362

WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning

Kunbei Cai, Zhenkai Zhang, Qian Lou et al.

ECCV 2024

#2363

COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark

Atsushi Hashimoto, Koki Maeda, Tosho Hirasawa et al.

ECCV 2024

#2364

Efficient Vision Transformers with Partial Attention

Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.

ECCV 2024

#2365

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

ECCV 2024arXiv:2407.12212

#2366

Learning to Distinguish Samples for Generalized Category Discovery

Fengxiang Yang, Pu Nan, Wenjing Li et al.

ECCV 2024

#2367

Kinetic Typography Diffusion Model

Seonmi Park, Inhwan Bae, Seunghyun Shin et al.

ECCV 2024arXiv:2407.10476

#2368

Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing

Yushi Lan, Feitong Tan, Qiangeng Xu et al.

ECCV 2024

#2369

TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance

Guoxing Zhang, Yiming Liu, xiaoyu yang et al.

ECCV 2024

#2370

POET: Prompt Offset Tuning for Continual Human Action Adaptation

Prachi Garg, Joseph K J, Vineeth N Balasubramanian et al.

ECCV 2024arXiv:2504.18059

#2371

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Changhoon Kim, Kyle Min, Yezhou Yang

ECCV 2024arXiv:2405.16341

#2372

All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation

Seongho Kim, Byung Cheol Song

ECCV 2024

#2373

MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke et al.

ECCV 2024arXiv:2405.02771

#2374

BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

Gwanghyun Kim, Hayeon Kim, Hoigi Seo et al.

ECCV 2024arXiv:2404.04544

#2375

Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time

Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta et al.

ECCV 2024arXiv:2407.01851

#2376

DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks

Sarah Jabbour, Gregory Kondas, Ella Kazerooni et al.

ECCV 2024arXiv:2407.14509

#2377

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Amandeep Kumar, Muhammad Awais, Sanath Narayan et al.

ECCV 2024arXiv:2406.04413

#2378

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Hu Cao, Zehua Zhang, Yan Xia et al.

ECCV 2024arXiv:2407.12582

#2379

UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation

Jinho Park, Se Young Chun, Mingoo Seok

ECCV 2024arXiv:2409.13106

#2380

Unsupervised Representation Learning by Balanced Self Attention Matching

Daniel Shalam, Simon Korman

ECCV 2024arXiv:2408.02014

#2381

Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging

Wenhua Wu, Kun Hu, Wenxi Yue et al.

ECCV 2024arXiv:2407.21381

#2382

Caltech Aerial RGB-Thermal Dataset in the Wild

Connor Lee, Matthew Anderson, Nikhil Ranganathan et al.

ECCV 2024arXiv:2403.08997

#2383

Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation

Fangfu Liu, Hanyang Wang, Weiliang Chen et al.

ECCV 2024arXiv:2403.09625

#2384

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

Sharath Girish, Kamal Gupta, Abhinav Shrivastava

ECCV 2024arXiv:2312.04564

#2385

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024arXiv:2408.03574

#2386

Thinking Outside the BBox: Unconstrained Generative Object Compositing

Gemma Canet Tarrés, Zhe Lin, Zhifei Zhang et al.

ECCV 2024arXiv:2409.04559

#2387

Compact 3D Scene Representation via Self-Organizing Gaussian Grids

Wieland Morgenstern, Florian Barthel, Anna Hilsmann et al.

ECCV 2024arXiv:2312.13299

← Previous

1...10 11 12