Most Cited ECCV "autoregressive emulation" Papers

2,387 papers found • Page 9 of 12

Filters:Most Cited ECCV autoregressive emulation Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#1601

Collaborative Control for Geometry-Conditioned PBR Image Generation

Shimon Vainer, Mark Boss, Mathias Parger et al.

ECCV 2024posterarXiv:2402.05919

#1602

Open-set Domain Adaptation via Joint Error based Multi-class Positive and Unlabeled Learning

Dexuan Zhang, Thomas Westfechtel, Tatsuya Harada

ECCV 2024poster

#1603

Look Around and Learn: Self-Training Object Detection by Exploration

Gianluca Scarpellini, Stefano Rosa, Pietro Morerio et al.

ECCV 2024posterarXiv:2302.03566

#1604

Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model

Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong

ECCV 2024posterarXiv:2407.14434

#1605

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024posterarXiv:2409.01696

#1606

Adaptive Human Trajectory Prediction via Latent Corridors

Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy et al.

ECCV 2024posterarXiv:2312.06653

#1607

Generalizable Symbolic Optimizer Learning

Xiaotian Song, Peng Zeng, Yanan Sun et al.

ECCV 2024poster

#1608

FreestyleRet: Retrieving Images from Style-Diversified Queries

Hao Li, Yanhao Jia, Peng Jin et al.

ECCV 2024posterarXiv:2312.02428

#1609

AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion

Zhiheng Fu, Longguang Wang, Lian Xu et al.

ECCV 2024poster

#1610

Efficient Bias Mitigation Without Privileged Information

Mateo Espinosa Zarlenga, Sankaranarayanan, Jerone Andrews et al.

ECCV 2024posterarXiv:2409.17691

#1611

Towards Open-Ended Visual Recognition with Large Language Models

Qihang Yu, Xiaohui Shen, Liang-Chieh Chen

ECCV 2024posterarXiv:2311.08400

#1612

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Bowen Zhang, Yiji Cheng, Chunyu Wang et al.

ECCV 2024posterarXiv:2407.06938

#1613

IRGen: Generative Modeling for Image Retrieval

Yidan Zhang, Ting Zhang, DONG CHEN et al.

ECCV 2024posterarXiv:2303.10126

#1614

LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow

Hongyu Wen, Erich Liang, Jia Deng

ECCV 2024posterarXiv:2409.05688

#1615

Adaptive Parametric Activation

Konstantinos P Alexandridis, Jiankang Deng, Anh Nguyen et al.

ECCV 2024poster

#1616

Towards Multi-modal Transformers in Federated Learning

Guangyu Sun, Matias Mendieta, Aritra Dutta et al.

ECCV 2024posterarXiv:2404.12467

#1617

GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator

Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

ECCV 2024posterarXiv:2312.06731

#1618

FisherRF: Active View Selection and Mapping with Radiance Fields using Fisher Information

Wen Jiang, BOSHU LEI, Kostas Daniilidis

ECCV 2024poster

#1619

Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy

Fadlullah Raji, John Murray-Bruce

ECCV 2024posterarXiv:2601.12257

#1620

Learning 3D-aware GANs from Unposed Images with Template Feature Field

XINYA CHEN, Hanlei Guo, Yanrui Bin et al.

ECCV 2024posterarXiv:2404.05705

#1621

The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation

Yi Yao, Chan-Feng Hsu, Jhe-Hao Lin et al.

ECCV 2024posterarXiv:2407.12579

#1622

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Guez Aflalo et al.

ECCV 2024posterarXiv:2404.01197

#1623

CIC-BART-SSA: : Controllable Image Captioning with Structured Semantic Augmentation

Kalliopi Basioti, Mohamed A Abdelsalam, Federico Fancellu et al.

ECCV 2024posterarXiv:2407.11393

#1624

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Kuo Wang, Lechao Cheng, Weikai Chen et al.

ECCV 2024posterarXiv:2407.21465

#1625

Learning to Robustly Reconstruct Dynamic Scenes from Low-light Spike Streams

Liwen Hu, gang ding, Mianzhi Liu et al.

ECCV 2024poster

#1626

Restoring Images in Adverse Weather Conditions via Histogram Transformer

Shangquan Sun, Wenqi Ren, Xinwei Gao et al.

ECCV 2024posterarXiv:2407.10172

#1627

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

Jiefeng Li, Ye Yuan, Davis Rempe et al.

ECCV 2024posterarXiv:2408.16426

#1628

Resilience of Entropy Model in Distributed Neural Networks

Milin Zhang, Mohammad Abdi, Shahriar Rifat et al.

ECCV 2024posterarXiv:2403.00942

#1629

Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

Zhihao Liang, Qi Zhang, WENBO HU et al.

ECCV 2024posterarXiv:2403.11056

#1630

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024posterarXiv:2408.10614

#1631

Invertible Neural Warp for NeRF

Shin-Fang Chng, Ravi Garg, Hemanth Saratchandran et al.

ECCV 2024posterarXiv:2407.12354

#1632

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Shilong Liu, Hao Cheng, Haotian Liu et al.

ECCV 2024posterarXiv:2311.05437

#1633

Efficient Frequency-Domain Image Deraining with Contrastive Regularization

Ning Gao, xingyu jiang, Xiuhui Zhang et al.

ECCV 2024poster

#1634

Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception

Dingkang Yang, Ke Li, Dongling Xiao et al.

ECCV 2024poster

#1635

MambaIR: A Simple Baseline for Image Restoration with State-Space Model

Hang Guo, Jinmin Li, Tao Dai et al.

ECCV 2024posterarXiv:2402.15648

#1636

I Can't Believe It's Not Scene Flow!

Ishan Khatri, Kyle Vedder, Neehar Peri et al.

ECCV 2024posterarXiv:2403.04739

#1637

Bi-directional Contextual Attention for 3D Dense Captioning

Minjung Kim, Hyung Suk Lim, Soonyoung Lee et al.

ECCV 2024posterarXiv:2408.06662

#1638

Scalable Group Choreography via Variational Phase Manifold Learning

Nhat Le, Khoa Do, Xuan Bui et al.

ECCV 2024posterarXiv:2407.18839

#1639

RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

Muyao Niu, Tong Chen, Yifan Zhan et al.

ECCV 2024posterarXiv:2407.10267

#1640

Retrieval Robust to Object Motion Blur

Rong Zou, Marc Pollefeys, Denys Rozumnyi

ECCV 2024posterarXiv:2404.18025

#1641

Binomial Self-compensation for Motion Error in Dynamic 3D Scanning

Geyou Zhang, Ce Zhu, Kai Liu

ECCV 2024posterarXiv:2404.06693

#1642

Free-Viewpoint Video of Outdoor Sports Using a Drone

Zhengdong Hong

ECCV 2024poster

#1643

Blind image deblurring with noise-robust kernel estimation

Chanseok Lee, Jeongsol Kim, Seungmin Lee et al.

ECCV 2024poster

#1644

How Video Meetings Change Your Expression

Sumit Sarin, Utkarsh Mall, Purva Tendulkar et al.

ECCV 2024posterarXiv:2406.00955

#1645

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

Zhiyu Tan, Mengping Yang, Luozheng Qin et al.

ECCV 2024posterarXiv:2405.12914

#1646

LetsMap: Unsupervised Representation Learning for Label-Efficient Semantic BEV Mapping

Nikhil Gosala, Kürsat Petek, B Ravi Kiran et al.

ECCV 2024poster

#1647

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Yu Wang, Xiaogeng Liu, Yu Li et al.

ECCV 2024posterarXiv:2403.09513

#1648

Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density

Peiyu Yang, Naveed Akhtar, Shah Mubarak et al.

ECCV 2024posterarXiv:2407.04370

#1649

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders

Carlos Hinojosa, Shuming Liu, Bernard Ghanem

ECCV 2024posterarXiv:2407.13036

#1650

Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

Bjoern Michele, Alexandre Boulch, Tuan Hung Vu et al.

ECCV 2024posterarXiv:2409.04409

#1651

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Yue Fan, Xiaojian Ma, Rujie Wu et al.

ECCV 2024posterarXiv:2403.11481

#1652

Audio-driven Talking Face Generation with Stabilized Synchronization Loss

Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.

ECCV 2024posterarXiv:2307.09368

#1653

G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields

Shuxiang Xie, Shuyi Zhou, Ken Sakurada et al.

ECCV 2024poster

#1654

Eliminating Feature Ambiguity for Few-Shot Segmentation

Qianxiong Xu, Guosheng Lin, Chen Change Loy et al.

ECCV 2024posterarXiv:2407.09842

#1655

PreLAR: World Model Pre-training with Learnable Action Representation

Lixuan Zhang, Meina Kan, Shiguang Shan et al.

ECCV 2024poster

#1656

SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding

Zixu Cheng, Yujiang Pu, Shaogang Gong et al.

ECCV 2024posterarXiv:2407.05118

#1657

SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

Peishen Yan, Hao Wang, Tao Song et al.

ECCV 2024posterarXiv:2312.12484

#1658

Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning

Mainak Singha, Ankit Jha, Divyam Gupta et al.

ECCV 2024posterarXiv:2407.04207

#1659

Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization

Jiajun Hu, Jian Zhang, Lei Qi et al.

ECCV 2024posterarXiv:2407.15085

#1660

Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning

Seokwon Shin, Hyungrok Do, Youngdoo Son

ECCV 2024poster

#1661

SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild

Pengfei Wang, Xiaofei Hui, Jing Wu et al.

ECCV 2024poster

#1662

Fully Sparse 3D Occupancy Prediction

Haisong Liu, Yang Chen, Haiguang Wang et al.

ECCV 2024posterarXiv:2312.17118

#1663

Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction

Bingyu Xin, Meng Ye, Leon Axel et al.

ECCV 2024poster

#1664

EAFormer: Scene Text Segmentation with Edge-Aware Transformers

Haiyang Yu, Teng Fu, Bin Li et al.

ECCV 2024posterarXiv:2407.17020

#1665

Zero-Shot Detection of AI-Generated Images

Davide Cozzolino, GIovanni Poggi, Matthias Niessner et al.

ECCV 2024posterarXiv:2409.15875

#1666

Augmented Neural Fine-tuning for Efficient Backdoor Purification

Md Nazmul Karim, Abdullah Al Arafat, Umar Khalid et al.

ECCV 2024posterarXiv:2407.10052

#1667

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

Tianqi Liu, Guangcong Wang, Shoukang Hu et al.

ECCV 2024posterarXiv:2405.12218

#1668

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024posterarXiv:2408.05749

#1669

T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers et al.

ECCV 2024posterarXiv:2312.10217

#1670

MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain

Timothy Chase, Karthik Dantu

ECCV 2024posterarXiv:2410.05182

#1671

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

Nishad Singhi, Jae Myung Kim, Karsten Roth et al.

ECCV 2024posterarXiv:2405.01531

#1672

G3R: Gradient Guided Generalizable Reconstruction

Yun Chen, Jingkang Wang, Ze Yang et al.

ECCV 2024posterarXiv:2409.19405

#1673

Gaze Target Detection Based on Head-Local-Global Coordination

Yaokun Yang, Feng Lu

ECCV 2024poster

#1674

An Economic Framework for 6-DoF Grasp Detection

Xiao-Ming Wu, Jia-Feng Cai, Jian-Jian Jiang et al.

ECCV 2024posterarXiv:2407.08366

#1675

Uni3DL: A Unified Model for 3D Vision-Language Understanding

Xiang Li, Jian Ding, Zhaoyang Chen et al.

ECCV 2024poster

#1676

Rethinking Image Super Resolution from Training Data Perspectives

Go Ohtani, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024poster

#1677

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds

Zicheng Wang, Zhen Zhao, Yiming Wu et al.

ECCV 2024posterarXiv:2311.16474

#1678

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024posterarXiv:2407.13460

#1679

SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images

Jintu Zheng, Yi Ding, Qizhe Liu et al.

ECCV 2024posterarXiv:2407.02159

#1680

Human Hair Reconstruction with Strand-Aligned 3D Gaussians

Egor Zakharov, Vanessa Sklyarova, Michael J. Black et al.

ECCV 2024posterarXiv:2409.14778

#1681

Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning

Pengyu Li, Biao Wang, Tianchu Guo et al.

ECCV 2024poster

#1682

General and Task-Oriented Video Segmentation

Mu Chen, Liulei Li, Wenguan Wang et al.

ECCV 2024posterarXiv:2407.06540

#1683

MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory

Juwon Kang, Nayeong Kim, Jungseul Ok et al.

ECCV 2024poster

#1684

StereoGlue: Joint Feature Matching and Robust Estimation

Daniel Barath, Dmytro Mishkin, Luca Cavalli et al.

ECCV 2024poster

#1685

Scaling Backwards: Minimal Synthetic Pre-training?

Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024posterarXiv:2408.00677

#1686

Enhanced Motion Forecasting with Visual Relation Reasoning

Sungjune Kim, Hadam Baek, Seunggwan Lee et al.

ECCV 2024poster

#1687

Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

Omer Dahary, Or Patashnik, Kfir Aberman et al.

ECCV 2024posterarXiv:2403.16990

#1688

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Zhongyi Shui, Yunlong Zhang, Kai Yao et al.

ECCV 2024posterarXiv:2311.15939

#1689

Domain-adaptive Video Deblurring via Test-time Blurring

Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu et al.

ECCV 2024posterarXiv:2407.09059

#1690

GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views

Vinayak Gupta, Rongali Simhachala Venkata Girish, Mukund Varma T et al.

ECCV 2024posterarXiv:2407.08221

#1691

Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation

Genki Kinoshita, Ko Nishino

ECCV 2024posterarXiv:2312.04530

#1692

MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Muyao Niu, Xiaodong Cun, Xintao Wang et al.

ECCV 2024posterarXiv:2405.20222

#1693

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Jing Wu, Jiawang Bian, Xinghui Li et al.

ECCV 2024posterarXiv:2403.08733

#1694

Think before Placement: Common Sense Enhanced Transformer for Object Placement

Yaxuan Qin, Jiayu Xu, Ruiping Wang et al.

ECCV 2024poster

#1695

Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

Jinxing Zhou, Dan Guo, Yuxin Mao et al.

ECCV 2024posterarXiv:2407.08126

#1696

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Kong Zhe, Yong Zhang, Tianyu Yang et al.

ECCV 2024posterarXiv:2403.10983

#1697

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024posterarXiv:2409.17917

#1698

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024posterarXiv:2403.09638

#1699

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Haiyang Wang, Hao Tang, Li Jiang et al.

ECCV 2024posterarXiv:2403.09394

#1700

Object-Aware NIR-to-Visible Translation

Yunyi Gao, Lin Gu, Qiankun Liu et al.

ECCV 2024poster

#1701

Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection

Xinhao Luo, Man Yao, Yuhong Chou et al.

ECCV 2024posterarXiv:2407.20708

#1702

DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators

Hanyang Kong, Dongze Lian, Michael Bi Mi et al.

ECCV 2024posterarXiv:2312.08746

#1703

SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images

Nir Barel, Ron Aharon Shapira Weber, Nir Mualem et al.

ECCV 2024posterarXiv:2407.11850

#1704

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024posterarXiv:2403.17915

#1705

Multi-modal Relation Distillation for Unified 3D Representation Learning

Huiqun Wang, Yiping Bao, Panwang Pan et al.

ECCV 2024posterarXiv:2407.14007

#1706

Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction

Xinhang Liu, Jiaben Chen, Shiu-Hong Kao et al.

ECCV 2024posterarXiv:2305.15171

#1707

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024posterarXiv:2408.10739

#1708

DSA: Discriminative Scatter Analysis for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

ECCV 2024poster

#1709

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Rui Huang, Songyou Peng, Ayca Takmaz et al.

ECCV 2024posterarXiv:2312.17232

#1710

Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture

Zhigao Cao, Meng Li, Xiashuang Wang et al.

ECCV 2024poster

#1711

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Shiyue Zhang, Zheng Chong, Xujie Zhang et al.

ECCV 2024posterarXiv:2408.12352

#1712

Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis

Jaein Kim, HEE BIN YOO, Dong-Sig Han et al.

ECCV 2024poster

#1713

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.

ECCV 2024posterarXiv:2401.05675

#1714

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024posterarXiv:2407.20228

#1715

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024posterarXiv:2309.03244

#1716

AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering

Xiuyuan Chen, Yuan Lin, Yuchen Zhang et al.

ECCV 2024posterarXiv:2311.14906

#1717

MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

ECCV 2024posterarXiv:2407.03919

#1718

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024posterarXiv:2407.08931

#1719

Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification

Linhao Qu, Dingkang Yang, Dan Huang et al.

ECCV 2024posterarXiv:2407.10814

#1720

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

Longxiang Tang, Zhuotao Tian, Kai Li et al.

ECCV 2024posterarXiv:2407.05342

#1721

U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation

li zhang, Weiqing Meng, Yan Zhong et al.

ECCV 2024poster

#1722

DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution

Shrey Singh, Prateek Keserwani, Masakazu Iwamura et al.

ECCV 2024poster

#1723

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024posterarXiv:2407.03788

#1724

Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization

Naiyu Yin, Hanjing Wang, Yue Yu et al.

ECCV 2024poster

#1725

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Junfei Xiao, Ziqi Zhou, Wenxuan Li et al.

ECCV 2024posterarXiv:2312.13764

#1726

Plain-Det: A Plain Multi-Dataset Object Detector

cheng Shi, yuchen zhu, Sibei Yang

ECCV 2024posterarXiv:2407.10083

#1727

Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding

Minh Tran, Yelin Kim, Che-Chun Su et al.

ECCV 2024poster

#1728

Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

Rong Wang, Wei Mao, Changsheng Lu et al.

ECCV 2024posterarXiv:2407.11266

#1729

iMatching: Imperative Correspondence Learning

Chen Wang, Dasong Gao, Yun-Jou Lin et al.

ECCV 2024poster

#1730

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Raghav Kapoor, Yash Parag Butala, Melisa A Russak et al.

ECCV 2024posterarXiv:2402.17553

#1731

MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution

Yuxuan Jiang, Chen Feng, Fan Zhang et al.

ECCV 2024posterarXiv:2404.09571

#1732

ReMatching: Low-Resolution Representations for Scalable Shape Correspondence

Filippo Maggioli, Daniele Baieri, Emanuele Rodola et al.

ECCV 2024posterarXiv:2305.09274

#1733

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024posterarXiv:2409.15269

#1734

Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields

Yonggan Fu, Huaizhi Qu, Zhifan Ye et al.

ECCV 2024posterarXiv:2403.11131

#1735

Syn-to-Real Domain Adaptation for Point Cloud Completion via Part-based Approach

Yunseo Yang, Jihun Kim, Kuk-Jin Yoon

ECCV 2024poster

#1736

ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities

CHENMING ZHU, Tai Wang, Wenwei Zhang et al.

ECCV 2024posterarXiv:2407.01525

#1737

TrajPrompt: Aligning Color Trajectory with Vision-Language Representations

Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan et al.

ECCV 2024poster

#1738

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024posterarXiv:2404.08327

#1739

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

Zicong Fan, Takehiko Ohkawa, Linlin Yang et al.

ECCV 2024posterarXiv:2403.16428

#1740

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024posterarXiv:2407.10330

#1741

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers

Ziling Huang, Shin’ichi Satoh

ECCV 2024poster

#1742

DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models

Yuyang Huang, Yabo Chen, Yuchen Liu et al.

ECCV 2024poster

#1743

3D Single-object Tracking in Point Clouds with High Temporal Variation

Qiao Wu, Kun Sun, Pei An et al.

ECCV 2024posterarXiv:2408.02049

#1744

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024posterarXiv:2404.07389

#1745

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024posterarXiv:2407.10159

#1746

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024posterarXiv:2409.17439

#1747

Solving Motion Planning Tasks with a Scalable Generative Model

Yihan Hu, Siqi Chai, Zhening Yang et al.

ECCV 2024posterarXiv:2407.02797

#1748

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins et al.

ECCV 2024posterarXiv:2403.13064

#1749

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024posterarXiv:2407.13342

#1750

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024posterarXiv:2409.13475

#1751

Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras

Hoonhee Cho, Sung-Hoon Yoon, Hyeokjun Kweon et al.

ECCV 2024posterarXiv:2407.11216

#1752

LiDAR-Event Stereo Fusion with Hallucinations

Luca Bartolomei, Matteo Poggi, Andrea Conti et al.

ECCV 2024posterarXiv:2408.04633

#1753

Tensorial template matching for fast cross-correlation with rotations and its application for tomography

Antonio Martinez-Sanchez, Ulrike Homberg, J. M. Almira et al.

ECCV 2024posterarXiv:2408.02398

#1754

Cross-Input Certified Training for Universal Perturbations

Changming Xu, Gagandeep Singh

ECCV 2024posterarXiv:2405.09176

#1755

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution

Xi Yang, Chenhang He, Jianqi Ma et al.

ECCV 2024posterarXiv:2312.00853

#1756

SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation

Sayan Nag, Koustava Goswami, Srikrishna Karanam

ECCV 2024poster

#1757

BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

Xinmin Qiu, Congying Han, Zicheng Zhang et al.

ECCV 2024posterarXiv:2403.06243

#1758

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

Yixiang Qiu, Hao Fang, Hongyao Yu et al.

ECCV 2024posterarXiv:2407.13863

#1759

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Lin Chen, Jinsong Li, Xiaoyi Dong et al.

ECCV 2024posterarXiv:2311.12793

#1760

Text2Place: Affordance-aware Text Guided Human Placement

Rishubh Parihar, Harsh Gupta, Sachidanand VS et al.

ECCV 2024posterarXiv:2407.15446

#1761

Adaptive Multi-task Learning for Few-shot Object Detection

Yan Ren, Yanling Li, Wai-Kin Adams Kong

ECCV 2024poster

#1762

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Peixi Xiong, Michael A Kozuch, Nilesh Jain

ECCV 2024poster

#1763

Spectral Subsurface Scattering for Material Classification

Haejoon Lee, Aswin C. Sankaranarayanan

ECCV 2024poster

#1764

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Masashi Hatano, Ryo Hachiuma, Ryo Fujii et al.

ECCV 2024posterarXiv:2405.19917

#1765

AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models

Xuelong Dai, Kaisheng Liang, Bin Xiao

ECCV 2024posterarXiv:2307.12499

#1766

Merlin: Empowering Multimodal LLMs with Foresight Minds

En Yu, liang zhao, YANA WEI et al.

ECCV 2024poster

#1767

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Xintao Lv, Liang Xu, Yichao Yan et al.

ECCV 2024posterarXiv:2407.12371

#1768

High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs

Ruikang Xu, Mingde Yao, Yue Li et al.

ECCV 2024poster

#1769

DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction

YANLONG LI, Chamara Madarasingha, Kanchana Thilakarathna

ECCV 2024posterarXiv:2312.03298

#1770

BRAVE: Broadening the visual encoding of vision-language models

Oguzhan Fatih Kar, Alessio Tonioni, Petra Poklukar et al.

ECCV 2024posterarXiv:2404.07204

#1771

GroupDiff: Diffusion-based Group Portrait Editing

Yuming Jiang, Nanxuan Zhao, Qing Liu et al.

ECCV 2024posterarXiv:2409.14379

#1772

BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

Pilhyeon Lee, Hyeran Byun

ECCV 2024posterarXiv:2312.00083

#1773

Disentangling Masked Autoencoders for Unsupervised Domain Generalization

An Zhang, Han Wang, Xiang Wang et al.

ECCV 2024posterarXiv:2407.07544

#1774

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024posterarXiv:2407.08061

#1775

Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection

Lianjun Wu, Jiangxiao Han, Zengqiang Zheng et al.

ECCV 2024poster

#1776

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Dylan Li, Gyungin Shin

ECCV 2024posterarXiv:2409.18961

#1777

GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval

Han Zhou, Wei Dong, Xiaohong Liu et al.

ECCV 2024posterarXiv:2407.12431

#1778

Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation

Taekyung Ki, Dongchan Min, Gyeongsu Chae

ECCV 2024posterarXiv:2404.00636

#1779

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu, Lingdong Kong, Hui Shuai et al.

ECCV 2024posterarXiv:2407.06190

#1780

Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration

Youngjin Oh, Keuntek Lee, Jooyoung Lee et al.

ECCV 2024poster

#1781

Diffusion-Guided Weakly Supervised Semantic Segmentation

Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong et al.

ECCV 2024poster

#1782

On the Approximation Risk of Few-Shot Class-Incremental Learning

Xuan Wang, Zhong Ji, Xiyao Liu et al.

ECCV 2024poster

#1783

Online Vectorized HD Map Construction using Geometry

Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding et al.

ECCV 2024posterarXiv:2312.03341

#1784

Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM

Jonathan Sauder, Devis TUIA

ECCV 2024poster

#1785

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

Seokhun Choi, Hyeonseop Song, Jaechul Kim et al.

ECCV 2024posterarXiv:2407.11793

#1786

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024poster

#1787

MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation

Jiaxi Jiang, Paul Streli, Xuejing Luo et al.

ECCV 2024poster

#1788

Disentangled Generation and Aggregation for Robust Radiance Fields

Shihe Shen, Huachen Gao, Wangze Xu et al.

ECCV 2024posterarXiv:2409.15715

#1789

Momentum Auxiliary Network for Supervised Local Learning

Junhao Su, Changpeng Cai, Feiyu Zhu et al.

ECCV 2024posterarXiv:2407.05623

#1790

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024posterarXiv:2407.12291

#1791

Implicit Neural Models to Extract Heart Rate from Video

Pradyumna Chari, Anirudh Bindiganavale Harish, Adnan Armouti et al.

ECCV 2024poster

#1792

Occupancy as Set of Points

Yiang Shi, Tianheng Cheng, Qian Zhang et al.

ECCV 2024posterarXiv:2407.04049

#1793

Cocktail Universal Adversarial Attack on Deep Neural Networks

Shaoxin Li, Xiaofeng Liao, Xin Che et al.

ECCV 2024poster

#1794

FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds

Keke Tang, Lujie Huang, Weilong Peng et al.

ECCV 2024poster

#1795

Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation

Hyun Seok Seong, WonJun Moon, SuBeen Lee et al.

ECCV 2024posterarXiv:2407.12463

#1796

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Shengkun Tang, Yaqing Wang, Caiwen Ding et al.

ECCV 2024posterarXiv:2309.17074

#1797

SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Yujia Liang, Zixuan Ye, Wenze Liu et al.

ECCV 2024posterarXiv:2407.13483

#1798

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Zhikai Zhang, Yitang Li, Haofeng Huang et al.

ECCV 2024posterarXiv:2406.10740

#1799

Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification

Yu Bai, Bo Zhang, Zheng Zhang et al.

ECCV 2024poster

#1800

Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights

Shunqi Mao, Chaoyi Zhang, Hang Su et al.

ECCV 2024posterarXiv:2407.11449

← Previous

1...7 8 9 10 11 12