Most Cited ECCV "capsule networks" Papers

2,387 papers found • Page 8 of 12

Filters:Most Cited ECCV capsule networks Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1401

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

ECCV 2024arXiv:2312.04885

citations

#1402

MeshVPR: Citywide Visual Place Recognition Using 3D Meshes

Gabriele Berton, Lorenz Junglas, Riccardo Zaccone et al.

ECCV 2024arXiv:2406.02776

citations

#1403

AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos

Feichi Lu, Zijian Dong, Jie Song et al.

ECCV 2024arXiv:2408.02110

citations

#1404

Temporal Residual Jacobians for Rig-free Motion Transfer

Sanjeev Muralikrishnan, Niladri Shekhar Dutt, Siddhartha Chaudhuri et al.

ECCV 2024arXiv:2407.14958

citations

#1405

CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.

Long Li, Nian Liu, Dingwen Zhang et al.

ECCV 2024arXiv:2409.01021

citations

#1406

Self-Supervised Audio-Visual Soundscape Stylization

Tingle Li, Renhao Wang, Po-Yao Huang et al.

ECCV 2024arXiv:2409.14340

citations

#1407

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

KILICHBEK HAYDAROV, Xiaoqian Shen, Avinash Madasu et al.

ECCV 2024arXiv:2308.16349

citations

#1408

Domain Generalization of 3D Object Detection by Density-Resampling

Shuangzhi Li, Lei Ma, Xingyu Li

ECCV 2024arXiv:2311.10845

citations

#1409

From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition

Maan Qraitem, Kate Saenko, Bryan Plummer

ECCV 2024arXiv:2308.04553

citations

#1410

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

ECCV 2024

citations

#1411

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

Qitai Wang, Jiawei He, Yuntao Chen et al.

ECCV 2024

citations

#1412

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.

ECCV 2024arXiv:2407.16448

citations

#1413

Agglomerative Token Clustering

Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor et al.

ECCV 2024arXiv:2409.11923

citations

#1414

Fast Encoding and Decoding for Implicit Video Representation

Hao Chen, Saining Xie, Ser-Nam Lim et al.

ECCV 2024arXiv:2409.19429

citations

#1415

Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM

Baicheng Li, Zike Yan, Dong Wu et al.

ECCV 2024arXiv:2407.13338

citations

#1416

The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation

Muyang Qiu, Jian Zhang, Lei Qi et al.

ECCV 2024arXiv:2407.11356

citations

#1417

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

ECCV 2024arXiv:2407.12212

citations

#1418

Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition

Zhongxi Chen, Shen Chen, Taiping Yao et al.

ECCV 2024

citations

#1419

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

Youheng Sun, Shengming Yuan, Xuanhan Wang et al.

ECCV 2024arXiv:2407.12292

citations

#1420

Appearance-based Refinement for Object-Centric Motion Segmentation

Junyu Xie, Weidi Xie, Andrew ZISSERMAN

ECCV 2024arXiv:2312.11463

citations

#1421

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Zihan Zhang, Zhuo Xu, Xiang Xiang

ECCV 2024

citations

#1422

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Yuxiao Chen, Kai Li, Wentao Bao et al.

ECCV 2024arXiv:2409.16145

citations

#1423

Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model

Guanren Qiao, Guiliang Liu, Guorui Quan et al.

ECCV 2024

citations

#1424

DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction

MOZHGAN POURKESHAVARZ, Arielle Zhang, Amir Rasouli

ECCV 2024

citations

#1425

Semantically Guided Representation Learning For Action Anticipation

Anxhelo Diko, Danilo Avola, Bardh Prenkaj et al.

ECCV 2024arXiv:2407.02309

citations

#1426

Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs

Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi et al.

ECCV 2024arXiv:2404.10700

citations

#1427

Understanding Physical Dynamics with Counterfactual World Modeling

Rahul Mysore Venkatesh, Honglin Chen, Kevin Feigelis et al.

ECCV 2024arXiv:2312.06721

citations

#1428

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla et al.

ECCV 2024arXiv:2408.05364

citations

#1429

Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training

Yuanqi Yao, Gang Wu, Kui Jiang et al.

ECCV 2024arXiv:2411.02149

citations

#1430

PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation

Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno et al.

ECCV 2024arXiv:2409.06535

citations

#1431

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.

ECCV 2024arXiv:2407.06315

citations

#1432

Generating 3D House Wireframes with Semantics

Xueqi Ma, Yilin Liu, Wenjun Zhou et al.

ECCV 2024arXiv:2407.12267

citations

#1433

Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning

Shibo Jie, Yehui Tang, Jianyuan Guo et al.

ECCV 2024arXiv:2408.06798

citations

#1434

Synergy of Sight and Semantics: Visual Intention Understanding with CLIP

Qu Yang, Mang Ye, Dacheng Tao

ECCV 2024

citations

#1435

SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning

Runmin Zhang, Jun Ma, Lun Luo et al.

ECCV 2024arXiv:2407.08148

citations

#1436

Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching

Dongliang Cao, Zorah Laehner, Florian Bernard

ECCV 2024arXiv:2407.08244

citations

#1437

De-confounded Gaze Estimation

Ziyang Liang, Yiwei Bao, Feng Lu

ECCV 2024

citations

#1438

GenQ: Quantization in Low Data Regimes with Generative Synthetic Data

YUHANG LI, Youngeun Kim, Donghyun Lee et al.

ECCV 2024arXiv:2312.05272

citations

#1439

Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach

Taolin Zhang, Jiawang Bai, Zhihe Lu et al.

ECCV 2024arXiv:2407.06964

citations

#1440

MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps

Jianhao Zheng, Daniel Barath, Marc Pollefeys et al.

ECCV 2024arXiv:2406.05849

citations

#1441

MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

ECCV 2024arXiv:2407.03919

citations

#1442

DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation

Junkai Yan, Yipeng Gao, Qize Yang et al.

ECCV 2024arXiv:2404.06119

citations

#1443

Scene-aware Human Motion Forecasting via Mutual Distance Prediction

Chaoyue Xing, Wei Mao, Miaomiao LIU

ECCV 2024arXiv:2310.00615

citations

#1444

PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers

Ananthu Aniraj, Cassio F. Dantas, Dino Ienco et al.

ECCV 2024arXiv:2407.04538

citations

#1445

CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection

Jinhao Deng, Wei Ye, Hai Wu et al.

ECCV 2024

citations

#1446

Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation

ChenChen Zong, Ye-Wen Wang, Kun-Peng Ning et al.

ECCV 2024arXiv:2402.15198

citations

#1447

EAFormer: Scene Text Segmentation with Edge-Aware Transformers

Haiyang Yu, Teng Fu, Bin Li et al.

ECCV 2024arXiv:2407.17020

citations

#1448

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Sungmin Woo, Wonjoon Lee, Woo Jin Kim et al.

ECCV 2024arXiv:2407.09303

citations

#1449

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning

Artemis Panagopoulou, Le Xue, Ning Yu et al.

ECCV 2024

citations

#1450

ADMap: Anti-disturbance Framework for Vectorized HD Map Construction

Haotian Hu, Fanyi Wang, Yaonong Wang et al.

ECCV 2024

citations

#1451

This Probably Looks Exactly Like That: An Invertible Prototypical Network

Zachariah Carmichael, Timothy Redgrave, Daniel Gonzalez Cedre et al.

ECCV 2024arXiv:2407.12200

citations

#1452

3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing

Haoran Li, Long Ma, Haolin Shi et al.

ECCV 2024arXiv:2311.12050

citations

#1453

Edge-Guided Fusion and Motion Augmentation for Event-Image Stereo

Fengan Zhao, Qianang Zhou, Junlin Xiong

ECCV 2024

citations

#1454

Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization

Jiayun Wang, Yubei Chen, Stella Yu

ECCV 2024arXiv:2403.14973

citations

#1455

ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency

Shaocheng Yan, Pengcheng Shi, Jiayuan Li

ECCV 2024arXiv:2407.09862

citations

#1456

Operational Open-Set Recognition and PostMax Refinement

Steve Cruz, Ryan Rabinowitz, Manuel Günther et al.

ECCV 2024

citations

#1457

Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains

Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim et al.

ECCV 2024arXiv:2312.12098

citations

#1458

DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement

Qimin Chen, Zhiqin Chen, Vladimir Kim et al.

ECCV 2024arXiv:2409.06129

citations

#1459

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception

Kai Jiang, Jiaxing Huang, Weiying Xie et al.

ECCV 2024arXiv:2401.08687

citations

#1460

Investigating Style Similarity in Diffusion Models

Gowthami Somepalli, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024

citations

#1461

Occlusion-Aware Seamless Segmentation

Yihong Cao, Jiaming Zhang, Hao Shi et al.

ECCV 2024arXiv:2407.02182

citations

#1462

Augmented Neural Fine-tuning for Efficient Backdoor Purification

Md Nazmul Karim, Abdullah Al Arafat, Umar Khalid et al.

ECCV 2024arXiv:2407.10052

citations

#1463

Scalable Group Choreography via Variational Phase Manifold Learning

Nhat Le, Khoa Do, Xuan Bui et al.

ECCV 2024arXiv:2407.18839

citations

#1464

Bi-directional Contextual Attention for 3D Dense Captioning

Minjung Kim, Hyung Suk Lim, Soonyoung Lee et al.

ECCV 2024arXiv:2408.06662

citations

#1465

Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

Xuanchen Li, Yuhao Cheng, Xingyu Ren et al.

ECCV 2024arXiv:2406.00440

citations

#1466

Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

Rong Wang, Wei Mao, Changsheng Lu et al.

ECCV 2024arXiv:2407.11266

citations

#1467

DEVIAS: Learning Disentangled Video Representations of Action and Scene

Kyungho Bae, Youngrae Kim, Geo Ahn et al.

ECCV 2024arXiv:2312.00826

citations

#1468

Noise-assisted Prompt Learning for Image Forgery Detection and Localization

Dong Li, Jiaying Zhu, Xueyang Fu et al.

ECCV 2024

citations

#1469

Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation

Seongsu Ha, Chaeyun Kim, Donghwa Kim et al.

ECCV 2024arXiv:2411.01494

citations

#1470

LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement

Ye Yu, Fengxin Chen, Jun Yu et al.

ECCV 2024arXiv:2408.16235

citations

#1471

Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models

James Burgess, Kuan-Chieh Wang, Serena Yeung-Levy

ECCV 2024arXiv:2309.07986

citations

#1472

SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images

josh myers-dean, Jarek T Reynolds, Brian Price et al.

ECCV 2024arXiv:2407.09686

citations

#1473

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Animesh Sinha, Bo Sun, Anmol Kalia et al.

ECCV 2024arXiv:2311.10794

citations

#1474

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Xin Ming, Jiawei Li, Jingwang Ling et al.

ECCV 2024arXiv:2401.08398

citations

#1475

Delving Deep into Engagement Prediction of Short Videos

dasong Li, Wenjie Li, Baili Lu et al.

ECCV 2024arXiv:2410.00289

citations

#1476

Quanta Video Restoration

PRATEEK CHENNURI, Yiheng Chi, Enze Jiang et al.

ECCV 2024arXiv:2410.14994

citations

#1477

Adapt without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models

MENGYU ZHENG, Yehui Tang, Zhiwei Hao et al.

ECCV 2024

citations

#1478

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024arXiv:2407.15763

citations

#1479

Efficient Bias Mitigation Without Privileged Information

Mateo Espinosa Zarlenga, Sankaranarayanan, Jerone Andrews et al.

ECCV 2024arXiv:2409.17691

citations

#1480

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024arXiv:2409.17917

citations

#1481

Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination

Yunan LI, Yihao Zhang, Shoude Li et al.

ECCV 2024

citations

#1482

Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving

Yixuan Fan, Ya-Li Li, Shengjin Wang

ECCV 2024

citations

#1483

Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model

Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong

ECCV 2024arXiv:2407.14434

citations

#1484

Two-Stage Active Learning for Efficient Temporal Action Segmentation

Yuhao Su, Ehsan Elhamifar

ECCV 2024

citations

#1485

Adaptive Multi-task Learning for Few-shot Object Detection

Yan Ren, Yanling Li, Wai-Kin Adams Kong

ECCV 2024

citations

#1486

Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction

Lin Zhu, Yunlong Zheng, Yijun Zhang et al.

ECCV 2024arXiv:2407.10636

citations

#1487

Improving image synthesis with diffusion-negative sampling

Alakh Desai, Nuno Vasconcelos

ECCV 2024arXiv:2411.05473

citations

#1488

LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang

Yuqing Zhang, Hangqi Li, Shengyu Zhang et al.

ECCV 2024

citations

#1489

LiDAR-Event Stereo Fusion with Hallucinations

Luca Bartolomei, Matteo Poggi, Andrea Conti et al.

ECCV 2024arXiv:2408.04633

citations

#1490

HARIVO: Harnessing Text-to-Image Models for Video Generation

Mingi Kwon, Seoung Wug Oh, Yang Zhou et al.

ECCV 2024arXiv:2410.07763

citations

#1491

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation

Jiannan Ge, Lingxi Xie, Hongtao Xie et al.

ECCV 2024arXiv:2404.05667

citations

#1492

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

Kwanyong Park, Kuniaki Saito, Donghyun Kim

ECCV 2024arXiv:2407.15296

citations

#1493

Event Trojan: Asynchronous Event-based Backdoor Attacks

Ruofei Wang, Qing Guo, Haoliang Li et al.

ECCV 2024arXiv:2407.06838

citations

#1494

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding

Ofir Abramovich, Niv Nayman, Sharon Fogel et al.

ECCV 2024arXiv:2407.12594

citations

#1495

STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning

Hao CHENG, SIYUAN YANG, Chong Wang et al.

ECCV 2024

citations

#1496

SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments

Niklas Gard, Anna Hilsmann, Peter Eisert

ECCV 2024arXiv:2404.10527

citations

#1497

Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

Yeongtak Oh, Jonghyun Lee, Jooyoung Choi et al.

ECCV 2024arXiv:2403.10911

citations

#1498

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Renjie Lu, Jing-Ke Meng, WEISHI ZHENG

ECCV 2024arXiv:2407.11487

citations

#1499

Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

Shengqi Xu, Run Sun, Yi Chang et al.

ECCV 2024arXiv:2407.08377

citations

#1500

Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning

Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park

ECCV 2024arXiv:2409.10956

citations

#1501

PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training

SUYI CHEN, Hao Xu, Haipeng Li et al.

ECCV 2024arXiv:2407.14054

citations

#1502

FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos

Florian Langer, Jihong Ju, Georgi Dikov et al.

ECCV 2024arXiv:2403.15161

citations

#1503

MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

ECCV 2024arXiv:2409.00674

citations

#1504

E3M: Zero-Shot Spatio-Temporal Video Grounding with Expectation-Maximization Multimodal Modulation

Peijun Bao, Zihao Shao, Wenhan Yang et al.

ECCV 2024

citations

#1505

Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging

In Cho, Hyunbo Shim, Seon Joo Kim

ECCV 2024arXiv:2407.18574

citations

#1506

Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas

Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli et al.

ECCV 2024arXiv:2408.15660

citations

#1507

UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework

Tarun Kalluri, Sreyas Ravichandran, Manmohan Chandraker

ECCV 2024arXiv:2409.15264

citations

#1508

Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution

Mridul Khurana, Arka Daw, M. Maruf et al.

ECCV 2024arXiv:2408.00160

citations

#1509

Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Jeeyung Kim, Ze Wang, Qiang Qiu

ECCV 2024arXiv:2407.08947

citations

#1510

Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers

Ekaterina Grishina, Mikhail Gorbunov, Maxim Rakhuba

ECCV 2024arXiv:2409.11859

citations

#1511

AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation

Ri-Zhao Qiu, Yu-Xiong Wang, Kris Hauser

ECCV 2024

citations

#1512

Geometry Fidelity for Spherical Images

Anders Christensen, Nooshin Mojab, Khushman Patel et al.

ECCV 2024arXiv:2407.18207

citations

#1513

DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation

Sanghyun Jo, Fei Pan, In-Jae Yu et al.

ECCV 2024arXiv:2404.00380

citations

#1514

VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition

Ahmad Khaliq, Ming Xu, Stephen Hausler et al.

ECCV 2024arXiv:2409.19293

citations

#1515

Better Regression Makes Better Test-time Adaptive 3D Object Detection

Jiakang Yuan, Bo Zhang, Kaixiong Gong et al.

ECCV 2024

citations

#1516

Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification

Cheng-Chang Tsai, Yuan-Chih Chen, Chun-Shien Lu

ECCV 2024

citations

#1517

Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning

Sanjoy Kundu, Shubham Trehan, Sathyanarayanan Aakur

ECCV 2024arXiv:2305.16602

citations

#1518

Six-Point Method for Multi-Camera Systems with Reduced Solution Space

Banglei Guan, Ji Zhao, Laurent Kneip

ECCV 2024arXiv:2402.18066

citations

#1519

Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection

Yongwei Nie, Hao Huang, Chengjiang Long et al.

ECCV 2024arXiv:2401.13551

citations

#1520

LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar

Yujeong Chae, HYEONSEONG KIM, Changgyoon Oh et al.

ECCV 2024

citations

#1521

Pseudo-keypoint RKHS Learning for Self-supervised 6DoF Pose Estimation

Yangzheng Wu, Michael Alan Greenspan

ECCV 2024arXiv:2311.09500

citations

#1522

Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions

Yihao Ai, Yifei Qi, Bo Wang et al.

ECCV 2024arXiv:2407.15451

citations

#1523

Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation

Yuwen Pan, Rui Sun, Naisong Luo et al.

ECCV 2024arXiv:2408.13838

citations

#1524

Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-View Stereo with DIV Loss

Alex Rich, Noah Stier, Pradeep Sen et al.

ECCV 2024

citations

#1525

RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

Muyao Niu, Tong Chen, Yifan Zhan et al.

ECCV 2024arXiv:2407.10267

citations

#1526

MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation

Yuxiang WEI, Zhilong Ji, Jinfeng Bai et al.

ECCV 2024arXiv:2405.05806

citations

#1527

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

Zhili Chen, Shuangjie Xu, Maosheng Ye et al.

ECCV 2024arXiv:2407.15354

citations

#1528

Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction

Yansheng Li, Tingzhu Wang, Kang Wu et al.

ECCV 2024arXiv:2407.19259

citations

#1529

DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction

YANLONG LI, Chamara Madarasingha, Kanchana Thilakarathna

ECCV 2024arXiv:2312.03298

citations

#1530

SCOMatch: Alleviating Overtrusting in Open-set Semi-supervised Learning

ZERUN WANG, Liuyu Xiang, Lang Huang et al.

ECCV 2024arXiv:2409.17512

citations

#1531

Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction

Jianxiong Tang, Jian-Huang Lai, Lingxiao Yang et al.

ECCV 2024

citations

#1532

Personalized Privacy Protection Mask Against Unauthorized Facial Recognition

Ka Ho Chow, Sihao Hu, Tiansheng Huang et al.

ECCV 2024arXiv:2407.13975

citations

#1533

Reprojection Errors as Prompts for Efficient Scene Coordinate Regression

Ting-Ru Liu, Hsuan-Kung Yang, Jou-Min Liu et al.

ECCV 2024arXiv:2409.04178

citations

#1534

Feature Diversification and Adaptation for Federated Domain Generalization

Seunghan Yang, Seokeon Choi, Hyunsin Park et al.

ECCV 2024arXiv:2407.08245

citations

#1535

Fundamental Matrix Estimation Using Relative Depths

Yaqing Ding, Václav Vávra, Snehal Bhayani et al.

ECCV 2024

citations

#1536

Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?

Rosario Leonardi, Antonino Furnari, Francesco Ragusa et al.

ECCV 2024arXiv:2312.02672

citations

#1537

Self-Cooperation Knowledge Distillation for Novel Class Discovery

Yuzheng Wang, Zhaoyu Chen, Dingkang Yang et al.

ECCV 2024arXiv:2407.01930

citations

#1538

A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control

Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.

ECCV 2024arXiv:2407.15631

citations

#1539

Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts

Yanting Yang, Minghao Chen, Qibo Qiu et al.

ECCV 2024arXiv:2407.14872

citations

#1540

AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes

Dongxu Yue, Maomao Li, Yunfei Liu et al.

ECCV 2024

citations

#1541

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024arXiv:2409.17439

citations

#1542

Online Continuous Generalized Category Discovery

Keon-Hee Park, Hakyung Lee, Kyungwoo Song et al.

ECCV 2024arXiv:2408.13492

citations

#1543

Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation

Arpit Garg, Cuong Cao Nguyen, RAFAEL FELIX et al.

ECCV 2024arXiv:2305.19486

citations

#1544

Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning

Jihai Zhang, Xiang Lan, Xiaoye Qu et al.

ECCV 2024arXiv:2402.11816

citations

#1545

cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process

Yihang Chen, TSAI HOR CHAN, Guosheng Yin et al.

ECCV 2024arXiv:2407.11448

citations

#1546

RaFE: Generative Radiance Fields Restoration

Zhongkai Wu, Ziyu Wan, Jing Zhang et al.

ECCV 2024arXiv:2404.03654

citations

#1547

DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism

Zhen Wang, Xinyun Jiang, Jun Xiao et al.

ECCV 2024arXiv:2311.14920

citations

#1548

SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes

Mohammad Zohaib, Luca Cosmo, Alessio Del Bue

ECCV 2024arXiv:2408.02291

citations

#1549

On the Evaluation Consistency of Attribution-based Explanations

Jiarui Duan, Haoling Li, Haofei Zhang et al.

ECCV 2024arXiv:2407.19471

citations

#1550

Open-World Dynamic Prompt and Continual Visual Representation Learning

Youngeun Kim, Jun Fang, Qin Zhang et al.

ECCV 2024arXiv:2409.05312

citations

#1551

Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models

Luozhou Wang, Guibao Shen, Wenhang Ge et al.

ECCV 2024arXiv:2306.14408

citations

#1552

ProSub: Probabilistic Open-Set Semi-Supervised Learning with Subspace-Based Out-of-Distribution Detection

Erik Wallin, Lennart Svensson, Fredrik Kahl et al.

ECCV 2024arXiv:2407.11735

citations

#1553

Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution

Zhiheng Li, Muheng Li, Jixuan Fan et al.

ECCV 2024arXiv:2403.10925

citations

#1554

Open Vocabulary Multi-Label Video Classification

Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan et al.

ECCV 2024arXiv:2407.09073

citations

#1555

Pre-trained Visual Dynamics Representations for Efficient Policy Learning

Hao Luo, Bohan Zhou, Zongqing Lu

ECCV 2024arXiv:2411.03169

citations

#1556

FTBC: Forward Temporal Bias Correction for Optimizing ANN-SNN Conversion

Xiaofeng Wu, Velibor Bojkovic, Bin Gu et al.

ECCV 2024arXiv:2403.18388

citations

#1557

A Fair Ranking and New Model for Panoptic Scene Graph Generation

Julian Lorenz, Alexander Pest, Daniel Kienzle et al.

ECCV 2024arXiv:2407.09216

citations

#1558

EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation

Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali et al.

ECCV 2024arXiv:2403.18080

citations

#1559

Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering

Francesco Di Sario, Riccardo Renzulli, Marco Grangetto et al.

ECCV 2024arXiv:2407.10389

citations

#1560

Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment

Wulian Yun, Mengshi Qi, Fei Peng et al.

ECCV 2024arXiv:2407.19675

citations

#1561

Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation

Chang Liu, Giulia Rizzoli, Pietro Zanuttigh et al.

ECCV 2024arXiv:2407.13363

citations

#1562

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

Honghao Xu, Juzhan Xu, Zeyu Huang et al.

ECCV 2024arXiv:2407.10687

citations

#1563

Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

Junxiong Lin, Yan Wang, Zeng Tao et al.

ECCV 2024arXiv:2403.05808

citations

#1564

Towards Stable 3D Object Detection

Jiabao Wang, Qiang Meng, Guochao Liu et al.

ECCV 2024arXiv:2407.04305

citations

#1565

LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment

Yiming Ren, Xiao Han, Yichen Yao et al.

ECCV 2024arXiv:2407.09833

citations

#1566

VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG

Yankun Xu, Junzhe Wang, Yun-Hsuan Chen et al.

ECCV 2024arXiv:2311.14775

citations

#1567

GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Bangyan Liao, Zhenjun Zhao, Lu Chen et al.

ECCV 2024arXiv:2407.13537

citations

#1568

Bucketed Ranking-based Losses for Efficient Training of Object Detectors

Feyza Yavuz, Baris Can Cam, Adnan Harun Dogan et al.

ECCV 2024arXiv:2407.14204

citations

#1569

HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization

Sakib Reza, Yuexi Zhang, Mohsen Moghaddam et al.

ECCV 2024arXiv:2408.06437

citations

#1570

DATENeRF: Depth-Aware Text-based Editing of NeRFs

Sara Rojas Martinez, Julien Philip, Kai Zhang et al.

ECCV 2024arXiv:2404.04526

citations

#1571

Revisiting Calibration of Wide-Angle Radially Symmetric Cameras

Andrea Porfiri Dal Cin, Francesco Azzoni, Giacomo Boracchi et al.

ECCV 2024

citations

#1572

Event-based Mosaicing Bundle Adjustment

Shuang Guo, Guillermo Gallego

ECCV 2024arXiv:2409.07365

citations

#1573

SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference

Alind Khare, Animesh Agrawal, Aditya Annavajjala et al.

ECCV 2024arXiv:2301.10879

citations

#1574

Local and Global Flatness for Federated Domain Generalization

Hao Yan, Yuhong Guo

ECCV 2024

citations

#1575

TPA3D: Triplane Attention for Fast Text-to-3D Generation

Bin-Shih Wu, HONG-EN CHEN, Sheng-Yu Huang et al.

ECCV 2024arXiv:2312.02647

citations

#1576

EgoBody3M: Egocentric Body Tracking on a VR Headset using a Diverse Dataset

Amy Zhao, Chengcheng Tang, Lezi Wang et al.

ECCV 2024

citations

#1577

Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning

Ray Zhang, Zheming Zhou, Min Sun et al.

ECCV 2024arXiv:2407.20223

citations

#1578

Unified Medical Image Pre-training in Language-Guided Common Semantic Space

Xiaoxuan He, Yifan Yang, Xinyang Jiang et al.

ECCV 2024arXiv:2311.14851

citations

#1579

Understanding Multi-compositional learning in Vision and Language models via Category Theory

Sotirios Panagiotis Takis Chytas, Hyunwoo J. Kim, Vikas Singh

ECCV 2024

citations

#1580

Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

Xianren Zhang, Dongwon Lee, Suhang Wang

ECCV 2024arXiv:2407.19308

citations

#1581

Spatial-Temporal Multi-level Association for Video Object Segmentation

Deshui Miao, Xin Li, Zhenyu He et al.

ECCV 2024arXiv:2404.06265

citations

#1582

GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers

Manu S Pillai, Mamshad Nayeem Rizve, Shah Mubarak

ECCV 2024arXiv:2408.02840

citations

#1583

View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields

Haodi He, Colton Stearns, Adam Harley et al.

ECCV 2024arXiv:2405.19678

citations

#1584

Efficient Depth-Guided Urban View Synthesis

sheng miao, Jiaxin Huang, Dongfeng Bai et al.

ECCV 2024arXiv:2407.12395

citations

#1585

Efficient Training with Denoised Neural Weights

Yifan Gong, Zheng Zhan, Yanyu Li et al.

ECCV 2024arXiv:2407.11966

citations

#1586

OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection

Changsheng Lu, Zheyuan Liu, Piotr Koniusz

ECCV 2024arXiv:2409.19899

citations

#1587

SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

Peishen Yan, Hao Wang, Tao Song et al.

ECCV 2024arXiv:2312.12484

citations

#1588

FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions

Sohyun Lee, Namyup Kim, Sungyeon Kim et al.

ECCV 2024arXiv:2407.13437

citations

#1589

Seeing Faces in Things: A Model and Dataset for Pareidolia

Mark T Hamilton, Simon Stent, Vasha G DuTell et al.

ECCV 2024arXiv:2409.16143

citations

#1590

Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization

Qi Zhang, Kaiyi Zhang, Antoni Chan et al.

ECCV 2024arXiv:2409.01726

citations

#1591

Region-Aware Sequence-to-Sequence Learning for Hyperspectral Denoising

JiaHua Xiao, Yang Liu, Xing Wei

ECCV 2024

citations

#1592

Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation

Haozhi Cao, Yuecong Xu, Jianfei Yang et al.

ECCV 2024

citations

#1593

Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs

Jeongkee Lim, Yusung Kim

ECCV 2024arXiv:2408.02261

citations

#1594

OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks

JINGYANG XIANG, Zuohui Chen, Siqi Li et al.

ECCV 2024arXiv:2407.05257

citations

#1595

Using My Artistic Style? You Must Obtain My Authorization

Xiuli Bi, Haowei Liu, Weisheng Li et al.

ECCV 2024

citations

#1596

RPBG: Towards Robust Neural Point-based Graphics in the Wild

Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng et al.

ECCV 2024arXiv:2405.05663

citations

#1597

Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction

Rui Peng, Shihe Shen, Kaiqiang Xiong et al.

ECCV 2024arXiv:2409.03634

citations

#1598

SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Yujia Liang, Zixuan Ye, Wenze Liu et al.

ECCV 2024arXiv:2407.13483

citations

#1599

Minimalist Vision with Freeform Pixels

Jeremy Klotz, Shree Nayar

ECCV 2024arXiv:2501.00142

citations

#1600

SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization

Xixu Hu, Runkai Zheng, Jindong Wang et al.

ECCV 2024arXiv:2402.03317

citations

← Previous

1...6 7 8 9 10...12