Most Cited ECCV "autoregressive emulation" Papers

2,387 papers found • Page 9 of 12

#1601

Collaborative Control for Geometry-Conditioned PBR Image Generation

Shimon Vainer, Mark Boss, Mathias Parger et al.

ECCV 2024posterarXiv:2402.05919
#1602

Open-set Domain Adaptation via Joint Error based Multi-class Positive and Unlabeled Learning

Dexuan Zhang, Thomas Westfechtel, Tatsuya Harada

ECCV 2024poster
#1603

Look Around and Learn: Self-Training Object Detection by Exploration

Gianluca Scarpellini, Stefano Rosa, Pietro Morerio et al.

ECCV 2024posterarXiv:2302.03566
#1604

Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model

Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong

ECCV 2024posterarXiv:2407.14434
#1605

On the Vulnerability of Skip Connections to Model Inversion Attacks

Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen et al.

ECCV 2024posterarXiv:2409.01696
#1606

Adaptive Human Trajectory Prediction via Latent Corridors

Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy et al.

ECCV 2024posterarXiv:2312.06653
#1607

Generalizable Symbolic Optimizer Learning

Xiaotian Song, Peng Zeng, Yanan Sun et al.

ECCV 2024poster
#1608

FreestyleRet: Retrieving Images from Style-Diversified Queries

Hao Li, Yanhao Jia, Peng Jin et al.

ECCV 2024posterarXiv:2312.02428
#1609

AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion

Zhiheng Fu, Longguang Wang, Lian Xu et al.

ECCV 2024poster
#1610

Efficient Bias Mitigation Without Privileged Information

Mateo Espinosa Zarlenga, Sankaranarayanan, Jerone Andrews et al.

ECCV 2024posterarXiv:2409.17691
#1611

Towards Open-Ended Visual Recognition with Large Language Models

Qihang Yu, Xiaohui Shen, Liang-Chieh Chen

ECCV 2024posterarXiv:2311.08400
#1612

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Bowen Zhang, Yiji Cheng, Chunyu Wang et al.

ECCV 2024posterarXiv:2407.06938
#1613

IRGen: Generative Modeling for Image Retrieval

Yidan Zhang, Ting Zhang, DONG CHEN et al.

ECCV 2024posterarXiv:2303.10126
#1614

LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow

Hongyu Wen, Erich Liang, Jia Deng

ECCV 2024posterarXiv:2409.05688
#1615

Adaptive Parametric Activation

Konstantinos P Alexandridis, Jiankang Deng, Anh Nguyen et al.

ECCV 2024poster
#1616

Towards Multi-modal Transformers in Federated Learning

Guangyu Sun, Matias Mendieta, Aritra Dutta et al.

ECCV 2024posterarXiv:2404.12467
#1617

GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator

Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

ECCV 2024posterarXiv:2312.06731
#1618

FisherRF: Active View Selection and Mapping with Radiance Fields using Fisher Information

Wen Jiang, BOSHU LEI, Kostas Daniilidis

ECCV 2024poster
#1619

Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy

Fadlullah Raji, John Murray-Bruce

ECCV 2024posterarXiv:2601.12257
#1620

Learning 3D-aware GANs from Unposed Images with Template Feature Field

XINYA CHEN, Hanlei Guo, Yanrui Bin et al.

ECCV 2024posterarXiv:2404.05705
#1621

The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation

Yi Yao, Chan-Feng Hsu, Jhe-Hao Lin et al.

ECCV 2024posterarXiv:2407.12579
#1622

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Guez Aflalo et al.

ECCV 2024posterarXiv:2404.01197
#1623

CIC-BART-SSA: : Controllable Image Captioning with Structured Semantic Augmentation

Kalliopi Basioti, Mohamed A Abdelsalam, Federico Fancellu et al.

ECCV 2024posterarXiv:2407.11393
#1624

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Kuo Wang, Lechao Cheng, Weikai Chen et al.

ECCV 2024posterarXiv:2407.21465
#1625

Learning to Robustly Reconstruct Dynamic Scenes from Low-light Spike Streams

Liwen Hu, gang ding, Mianzhi Liu et al.

ECCV 2024poster
#1626

Restoring Images in Adverse Weather Conditions via Histogram Transformer

Shangquan Sun, Wenqi Ren, Xinwei Gao et al.

ECCV 2024posterarXiv:2407.10172
#1627

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

Jiefeng Li, Ye Yuan, Davis Rempe et al.

ECCV 2024posterarXiv:2408.16426
#1628

Resilience of Entropy Model in Distributed Neural Networks

Milin Zhang, Mohammad Abdi, Shahriar Rifat et al.

ECCV 2024posterarXiv:2403.00942
#1629

Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

Zhihao Liang, Qi Zhang, WENBO HU et al.

ECCV 2024posterarXiv:2403.11056
#1630

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024posterarXiv:2408.10614
#1631

Invertible Neural Warp for NeRF

Shin-Fang Chng, Ravi Garg, Hemanth Saratchandran et al.

ECCV 2024posterarXiv:2407.12354
#1632

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Shilong Liu, Hao Cheng, Haotian Liu et al.

ECCV 2024posterarXiv:2311.05437
#1633

Efficient Frequency-Domain Image Deraining with Contrastive Regularization

Ning Gao, xingyu jiang, Xiuhui Zhang et al.

ECCV 2024poster
#1634

Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception

Dingkang Yang, Ke Li, Dongling Xiao et al.

ECCV 2024poster
#1635

MambaIR: A Simple Baseline for Image Restoration with State-Space Model

Hang Guo, Jinmin Li, Tao Dai et al.

ECCV 2024posterarXiv:2402.15648
#1636

I Can't Believe It's Not Scene Flow!

Ishan Khatri, Kyle Vedder, Neehar Peri et al.

ECCV 2024posterarXiv:2403.04739
#1637

Bi-directional Contextual Attention for 3D Dense Captioning

Minjung Kim, Hyung Suk Lim, Soonyoung Lee et al.

ECCV 2024posterarXiv:2408.06662
#1638

Scalable Group Choreography via Variational Phase Manifold Learning

Nhat Le, Khoa Do, Xuan Bui et al.

ECCV 2024posterarXiv:2407.18839
#1639

RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

Muyao Niu, Tong Chen, Yifan Zhan et al.

ECCV 2024posterarXiv:2407.10267
#1640

Retrieval Robust to Object Motion Blur

Rong Zou, Marc Pollefeys, Denys Rozumnyi

ECCV 2024posterarXiv:2404.18025
#1641

Binomial Self-compensation for Motion Error in Dynamic 3D Scanning

Geyou Zhang, Ce Zhu, Kai Liu

ECCV 2024posterarXiv:2404.06693
#1642

Free-Viewpoint Video of Outdoor Sports Using a Drone

Zhengdong Hong

ECCV 2024poster
#1643

Blind image deblurring with noise-robust kernel estimation

Chanseok Lee, Jeongsol Kim, Seungmin Lee et al.

ECCV 2024poster
#1644

How Video Meetings Change Your Expression

Sumit Sarin, Utkarsh Mall, Purva Tendulkar et al.

ECCV 2024posterarXiv:2406.00955
#1645

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

Zhiyu Tan, Mengping Yang, Luozheng Qin et al.

ECCV 2024posterarXiv:2405.12914
#1646

LetsMap: Unsupervised Representation Learning for Label-Efficient Semantic BEV Mapping

Nikhil Gosala, Kürsat Petek, B Ravi Kiran et al.

ECCV 2024poster
#1647

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Yu Wang, Xiaogeng Liu, Yu Li et al.

ECCV 2024posterarXiv:2403.09513
#1648

Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density

Peiyu Yang, Naveed Akhtar, Shah Mubarak et al.

ECCV 2024posterarXiv:2407.04370
#1649

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders

Carlos Hinojosa, Shuming Liu, Bernard Ghanem

ECCV 2024posterarXiv:2407.13036
#1650

Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

Bjoern Michele, Alexandre Boulch, Tuan Hung Vu et al.

ECCV 2024posterarXiv:2409.04409
#1651

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Yue Fan, Xiaojian Ma, Rujie Wu et al.

ECCV 2024posterarXiv:2403.11481
#1652

Audio-driven Talking Face Generation with Stabilized Synchronization Loss

Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.

ECCV 2024posterarXiv:2307.09368
#1653

G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields

Shuxiang Xie, Shuyi Zhou, Ken Sakurada et al.

ECCV 2024poster
#1654

Eliminating Feature Ambiguity for Few-Shot Segmentation

Qianxiong Xu, Guosheng Lin, Chen Change Loy et al.

ECCV 2024posterarXiv:2407.09842
#1655

PreLAR: World Model Pre-training with Learnable Action Representation

Lixuan Zhang, Meina Kan, Shiguang Shan et al.

ECCV 2024poster
#1656

SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding

Zixu Cheng, Yujiang Pu, Shaogang Gong et al.

ECCV 2024posterarXiv:2407.05118
#1657

SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

Peishen Yan, Hao Wang, Tao Song et al.

ECCV 2024posterarXiv:2312.12484
#1658

Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning

Mainak Singha, Ankit Jha, Divyam Gupta et al.

ECCV 2024posterarXiv:2407.04207
#1659

Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization

Jiajun Hu, Jian Zhang, Lei Qi et al.

ECCV 2024posterarXiv:2407.15085
#1660

Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning

Seokwon Shin, Hyungrok Do, Youngdoo Son

ECCV 2024poster
#1661

SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild

Pengfei Wang, Xiaofei Hui, Jing Wu et al.

ECCV 2024poster
#1662

Fully Sparse 3D Occupancy Prediction

Haisong Liu, Yang Chen, Haiguang Wang et al.

ECCV 2024posterarXiv:2312.17118
#1663

Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction

Bingyu Xin, Meng Ye, Leon Axel et al.

ECCV 2024poster
#1664

EAFormer: Scene Text Segmentation with Edge-Aware Transformers

Haiyang Yu, Teng Fu, Bin Li et al.

ECCV 2024posterarXiv:2407.17020
#1665

Zero-Shot Detection of AI-Generated Images

Davide Cozzolino, GIovanni Poggi, Matthias Niessner et al.

ECCV 2024posterarXiv:2409.15875
#1666

Augmented Neural Fine-tuning for Efficient Backdoor Purification

Md Nazmul Karim, Abdullah Al Arafat, Umar Khalid et al.

ECCV 2024posterarXiv:2407.10052
#1667

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

Tianqi Liu, Guangcong Wang, Shoukang Hu et al.

ECCV 2024posterarXiv:2405.12218
#1668

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024posterarXiv:2408.05749
#1669

T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers et al.

ECCV 2024posterarXiv:2312.10217
#1670

MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain

Timothy Chase, Karthik Dantu

ECCV 2024posterarXiv:2410.05182
#1671

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

Nishad Singhi, Jae Myung Kim, Karsten Roth et al.

ECCV 2024posterarXiv:2405.01531
#1672

G3R: Gradient Guided Generalizable Reconstruction

Yun Chen, Jingkang Wang, Ze Yang et al.

ECCV 2024posterarXiv:2409.19405
#1673

Gaze Target Detection Based on Head-Local-Global Coordination

Yaokun Yang, Feng Lu

ECCV 2024poster
#1674

An Economic Framework for 6-DoF Grasp Detection

Xiao-Ming Wu, Jia-Feng Cai, Jian-Jian Jiang et al.

ECCV 2024posterarXiv:2407.08366
#1675

Uni3DL: A Unified Model for 3D Vision-Language Understanding

Xiang Li, Jian Ding, Zhaoyang Chen et al.

ECCV 2024poster
#1676

Rethinking Image Super Resolution from Training Data Perspectives

Go Ohtani, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024poster
#1677

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds

Zicheng Wang, Zhen Zhao, Yiming Wu et al.

ECCV 2024posterarXiv:2311.16474
#1678

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024posterarXiv:2407.13460
#1679

SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images

Jintu Zheng, Yi Ding, Qizhe Liu et al.

ECCV 2024posterarXiv:2407.02159
#1680

Human Hair Reconstruction with Strand-Aligned 3D Gaussians

Egor Zakharov, Vanessa Sklyarova, Michael J. Black et al.

ECCV 2024posterarXiv:2409.14778
#1681

Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning

Pengyu Li, Biao Wang, Tianchu Guo et al.

ECCV 2024poster
#1682

General and Task-Oriented Video Segmentation

Mu Chen, Liulei Li, Wenguan Wang et al.

ECCV 2024posterarXiv:2407.06540
#1683

MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory

Juwon Kang, Nayeong Kim, Jungseul Ok et al.

ECCV 2024poster
#1684

StereoGlue: Joint Feature Matching and Robust Estimation

Daniel Barath, Dmytro Mishkin, Luca Cavalli et al.

ECCV 2024poster
#1685

Scaling Backwards: Minimal Synthetic Pre-training?

Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024posterarXiv:2408.00677
#1686

Enhanced Motion Forecasting with Visual Relation Reasoning

Sungjune Kim, Hadam Baek, Seunggwan Lee et al.

ECCV 2024poster
#1687

Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

Omer Dahary, Or Patashnik, Kfir Aberman et al.

ECCV 2024posterarXiv:2403.16990
#1688

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Zhongyi Shui, Yunlong Zhang, Kai Yao et al.

ECCV 2024posterarXiv:2311.15939
#1689

Domain-adaptive Video Deblurring via Test-time Blurring

Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu et al.

ECCV 2024posterarXiv:2407.09059
#1690

GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views

Vinayak Gupta, Rongali Simhachala Venkata Girish, Mukund Varma T et al.

ECCV 2024posterarXiv:2407.08221
#1691

Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation

Genki Kinoshita, Ko Nishino

ECCV 2024posterarXiv:2312.04530
#1692

MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Muyao Niu, Xiaodong Cun, Xintao Wang et al.

ECCV 2024posterarXiv:2405.20222
#1693

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Jing Wu, Jiawang Bian, Xinghui Li et al.

ECCV 2024posterarXiv:2403.08733
#1694

Think before Placement: Common Sense Enhanced Transformer for Object Placement

Yaxuan Qin, Jiayu Xu, Ruiping Wang et al.

ECCV 2024poster
#1695

Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

Jinxing Zhou, Dan Guo, Yuxin Mao et al.

ECCV 2024posterarXiv:2407.08126
#1696

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Kong Zhe, Yong Zhang, Tianyu Yang et al.

ECCV 2024posterarXiv:2403.10983
#1697

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024posterarXiv:2409.17917
#1698

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024posterarXiv:2403.09638
#1699

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Haiyang Wang, Hao Tang, Li Jiang et al.

ECCV 2024posterarXiv:2403.09394
#1700

Object-Aware NIR-to-Visible Translation

Yunyi Gao, Lin Gu, Qiankun Liu et al.

ECCV 2024poster
#1701

Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection

Xinhao Luo, Man Yao, Yuhong Chou et al.

ECCV 2024posterarXiv:2407.20708
#1702

DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators

Hanyang Kong, Dongze Lian, Michael Bi Mi et al.

ECCV 2024posterarXiv:2312.08746
#1703

SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images

Nir Barel, Ron Aharon Shapira Weber, Nir Mualem et al.

ECCV 2024posterarXiv:2407.11850
#1704

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024posterarXiv:2403.17915
#1705

Multi-modal Relation Distillation for Unified 3D Representation Learning

Huiqun Wang, Yiping Bao, Panwang Pan et al.

ECCV 2024posterarXiv:2407.14007
#1706

Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction

Xinhang Liu, Jiaben Chen, Shiu-Hong Kao et al.

ECCV 2024posterarXiv:2305.15171
#1707

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024posterarXiv:2408.10739
#1708

DSA: Discriminative Scatter Analysis for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

ECCV 2024poster
#1709

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Rui Huang, Songyou Peng, Ayca Takmaz et al.

ECCV 2024posterarXiv:2312.17232
#1710

Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture

Zhigao Cao, Meng Li, Xiashuang Wang et al.

ECCV 2024poster
#1711

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Shiyue Zhang, Zheng Chong, Xujie Zhang et al.

ECCV 2024posterarXiv:2408.12352
#1712

Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis

Jaein Kim, HEE BIN YOO, Dong-Sig Han et al.

ECCV 2024poster
#1713

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.

ECCV 2024posterarXiv:2401.05675
#1714

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024posterarXiv:2407.20228
#1715

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024posterarXiv:2309.03244
#1716

AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering

Xiuyuan Chen, Yuan Lin, Yuchen Zhang et al.

ECCV 2024posterarXiv:2311.14906
#1717

MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

ECCV 2024posterarXiv:2407.03919
#1718

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024posterarXiv:2407.08931
#1719

Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification

Linhao Qu, Dingkang Yang, Dan Huang et al.

ECCV 2024posterarXiv:2407.10814
#1720

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

Longxiang Tang, Zhuotao Tian, Kai Li et al.

ECCV 2024posterarXiv:2407.05342
#1721

U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation

li zhang, Weiqing Meng, Yan Zhong et al.

ECCV 2024poster
#1722

DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution

Shrey Singh, Prateek Keserwani, Masakazu Iwamura et al.

ECCV 2024poster
#1723

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Thanh Thong Nguyen, Yi Bin, Xiaobao Wu et al.

ECCV 2024posterarXiv:2407.03788
#1724

Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization

Naiyu Yin, Hanjing Wang, Yue Yu et al.

ECCV 2024poster
#1725

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Junfei Xiao, Ziqi Zhou, Wenxuan Li et al.

ECCV 2024posterarXiv:2312.13764
#1726

Plain-Det: A Plain Multi-Dataset Object Detector

cheng Shi, yuchen zhu, Sibei Yang

ECCV 2024posterarXiv:2407.10083
#1727

Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding

Minh Tran, Yelin Kim, Che-Chun Su et al.

ECCV 2024poster
#1728

Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

Rong Wang, Wei Mao, Changsheng Lu et al.

ECCV 2024posterarXiv:2407.11266
#1729

iMatching: Imperative Correspondence Learning

Chen Wang, Dasong Gao, Yun-Jou Lin et al.

ECCV 2024poster
#1730

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Raghav Kapoor, Yash Parag Butala, Melisa A Russak et al.

ECCV 2024posterarXiv:2402.17553
#1731

MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution

Yuxuan Jiang, Chen Feng, Fan Zhang et al.

ECCV 2024posterarXiv:2404.09571
#1732

ReMatching: Low-Resolution Representations for Scalable Shape Correspondence

Filippo Maggioli, Daniele Baieri, Emanuele Rodola et al.

ECCV 2024posterarXiv:2305.09274
#1733

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024posterarXiv:2409.15269
#1734

Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields

Yonggan Fu, Huaizhi Qu, Zhifan Ye et al.

ECCV 2024posterarXiv:2403.11131
#1735

Syn-to-Real Domain Adaptation for Point Cloud Completion via Part-based Approach

Yunseo Yang, Jihun Kim, Kuk-Jin Yoon

ECCV 2024poster
#1736

ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities

CHENMING ZHU, Tai Wang, Wenwei Zhang et al.

ECCV 2024posterarXiv:2407.01525
#1737

TrajPrompt: Aligning Color Trajectory with Vision-Language Representations

Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan et al.

ECCV 2024poster
#1738

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024posterarXiv:2404.08327
#1739

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

Zicong Fan, Takehiko Ohkawa, Linlin Yang et al.

ECCV 2024posterarXiv:2403.16428
#1740

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024posterarXiv:2407.10330
#1741

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers

Ziling Huang, Shin’ichi Satoh

ECCV 2024poster
#1742

DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models

Yuyang Huang, Yabo Chen, Yuchen Liu et al.

ECCV 2024poster
#1743

3D Single-object Tracking in Point Clouds with High Temporal Variation

Qiao Wu, Kun Sun, Pei An et al.

ECCV 2024posterarXiv:2408.02049
#1744

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024posterarXiv:2404.07389
#1745

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024posterarXiv:2407.10159
#1746

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024posterarXiv:2409.17439
#1747

Solving Motion Planning Tasks with a Scalable Generative Model

Yihan Hu, Siqi Chai, Zhening Yang et al.

ECCV 2024posterarXiv:2407.02797
#1748

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins et al.

ECCV 2024posterarXiv:2403.13064
#1749

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024posterarXiv:2407.13342
#1750

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024posterarXiv:2409.13475
#1751

Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras

Hoonhee Cho, Sung-Hoon Yoon, Hyeokjun Kweon et al.

ECCV 2024posterarXiv:2407.11216
#1752

LiDAR-Event Stereo Fusion with Hallucinations

Luca Bartolomei, Matteo Poggi, Andrea Conti et al.

ECCV 2024posterarXiv:2408.04633
#1753

Tensorial template matching for fast cross-correlation with rotations and its application for tomography

Antonio Martinez-Sanchez, Ulrike Homberg, J. M. Almira et al.

ECCV 2024posterarXiv:2408.02398
#1754

Cross-Input Certified Training for Universal Perturbations

Changming Xu, Gagandeep Singh

ECCV 2024posterarXiv:2405.09176
#1755

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution

Xi Yang, Chenhang He, Jianqi Ma et al.

ECCV 2024posterarXiv:2312.00853
#1756

SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation

Sayan Nag, Koustava Goswami, Srikrishna Karanam

ECCV 2024poster
#1757

BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

Xinmin Qiu, Congying Han, Zicheng Zhang et al.

ECCV 2024posterarXiv:2403.06243
#1758

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

Yixiang Qiu, Hao Fang, Hongyao Yu et al.

ECCV 2024posterarXiv:2407.13863
#1759

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Lin Chen, Jinsong Li, Xiaoyi Dong et al.

ECCV 2024posterarXiv:2311.12793
#1760

Text2Place: Affordance-aware Text Guided Human Placement

Rishubh Parihar, Harsh Gupta, Sachidanand VS et al.

ECCV 2024posterarXiv:2407.15446
#1761

Adaptive Multi-task Learning for Few-shot Object Detection

Yan Ren, Yanling Li, Wai-Kin Adams Kong

ECCV 2024poster
#1762

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Peixi Xiong, Michael A Kozuch, Nilesh Jain

ECCV 2024poster
#1763

Spectral Subsurface Scattering for Material Classification

Haejoon Lee, Aswin C. Sankaranarayanan

ECCV 2024poster
#1764

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Masashi Hatano, Ryo Hachiuma, Ryo Fujii et al.

ECCV 2024posterarXiv:2405.19917
#1765

AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models

Xuelong Dai, Kaisheng Liang, Bin Xiao

ECCV 2024posterarXiv:2307.12499
#1766

Merlin: Empowering Multimodal LLMs with Foresight Minds

En Yu, liang zhao, YANA WEI et al.

ECCV 2024poster
#1767

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Xintao Lv, Liang Xu, Yichao Yan et al.

ECCV 2024posterarXiv:2407.12371
#1768

High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs

Ruikang Xu, Mingde Yao, Yue Li et al.

ECCV 2024poster
#1769

DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction

YANLONG LI, Chamara Madarasingha, Kanchana Thilakarathna

ECCV 2024posterarXiv:2312.03298
#1770

BRAVE: Broadening the visual encoding of vision-language models

Oguzhan Fatih Kar, Alessio Tonioni, Petra Poklukar et al.

ECCV 2024posterarXiv:2404.07204
#1771

GroupDiff: Diffusion-based Group Portrait Editing

Yuming Jiang, Nanxuan Zhao, Qing Liu et al.

ECCV 2024posterarXiv:2409.14379
#1772

BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

Pilhyeon Lee, Hyeran Byun

ECCV 2024posterarXiv:2312.00083
#1773

Disentangling Masked Autoencoders for Unsupervised Domain Generalization

An Zhang, Han Wang, Xiang Wang et al.

ECCV 2024posterarXiv:2407.07544
#1774

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024posterarXiv:2407.08061
#1775

Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection

Lianjun Wu, Jiangxiao Han, Zengqiang Zheng et al.

ECCV 2024poster
#1776

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Dylan Li, Gyungin Shin

ECCV 2024posterarXiv:2409.18961
#1777

GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval

Han Zhou, Wei Dong, Xiaohong Liu et al.

ECCV 2024posterarXiv:2407.12431
#1778

Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation

Taekyung Ki, Dongchan Min, Gyeongsu Chae

ECCV 2024posterarXiv:2404.00636
#1779

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu, Lingdong Kong, Hui Shuai et al.

ECCV 2024posterarXiv:2407.06190
#1780

Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration

Youngjin Oh, Keuntek Lee, Jooyoung Lee et al.

ECCV 2024poster
#1781

Diffusion-Guided Weakly Supervised Semantic Segmentation

Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong et al.

ECCV 2024poster
#1782

On the Approximation Risk of Few-Shot Class-Incremental Learning

Xuan Wang, Zhong Ji, Xiyao Liu et al.

ECCV 2024poster
#1783

Online Vectorized HD Map Construction using Geometry

Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding et al.

ECCV 2024posterarXiv:2312.03341
#1784

Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM

Jonathan Sauder, Devis TUIA

ECCV 2024poster
#1785

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

Seokhun Choi, Hyeonseop Song, Jaechul Kim et al.

ECCV 2024posterarXiv:2407.11793
#1786

AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems

Roye Katzav, Amit Giloni, Edita Grolman et al.

ECCV 2024poster
#1787

MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation

Jiaxi Jiang, Paul Streli, Xuejing Luo et al.

ECCV 2024poster
#1788

Disentangled Generation and Aggregation for Robust Radiance Fields

Shihe Shen, Huachen Gao, Wangze Xu et al.

ECCV 2024posterarXiv:2409.15715
#1789

Momentum Auxiliary Network for Supervised Local Learning

Junhao Su, Changpeng Cai, Feiyu Zhu et al.

ECCV 2024posterarXiv:2407.05623
#1790

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024posterarXiv:2407.12291
#1791

Implicit Neural Models to Extract Heart Rate from Video

Pradyumna Chari, Anirudh Bindiganavale Harish, Adnan Armouti et al.

ECCV 2024poster
#1792

Occupancy as Set of Points

Yiang Shi, Tianheng Cheng, Qian Zhang et al.

ECCV 2024posterarXiv:2407.04049
#1793

Cocktail Universal Adversarial Attack on Deep Neural Networks

Shaoxin Li, Xiaofeng Liao, Xin Che et al.

ECCV 2024poster
#1794

FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds

Keke Tang, Lujie Huang, Weilong Peng et al.

ECCV 2024poster
#1795

Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation

Hyun Seok Seong, WonJun Moon, SuBeen Lee et al.

ECCV 2024posterarXiv:2407.12463
#1796

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Shengkun Tang, Yaqing Wang, Caiwen Ding et al.

ECCV 2024posterarXiv:2309.17074
#1797

SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Yujia Liang, Zixuan Ye, Wenze Liu et al.

ECCV 2024posterarXiv:2407.13483
#1798

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Zhikai Zhang, Yitang Li, Haofeng Huang et al.

ECCV 2024posterarXiv:2406.10740
#1799

Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification

Yu Bai, Bo Zhang, Zheng Zhang et al.

ECCV 2024poster
#1800

Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights

Shunqi Mao, Chaoyi Zhang, Hang Su et al.

ECCV 2024posterarXiv:2407.11449