Most Cited 2024 &quot;diagram analysis&quot; Papers

ECCV 2024posterarXiv:2403.04899

#3002

Towards Scene Graph Anticipation

Rohith Peddi, Saksham Singh, Saurabh . et al.

ECCV 2024posterarXiv:2403.19160

#3003

Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence

Yutong Chen, Yifan Zhan, Zhihang Zhong et al.

ECCV 2024posterarXiv:2401.04339

#3004

Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Hyogon Ryu, Seohyun Lim, Hyunjung Shim

#3005

Relational Matching for Weakly Semi-Supervised Oriented Object Detection

Wenhao Wu, Hau San Wong, Si Wu et al.

#3006

General Point Model Pretraining with Autoencoding and Autoregressive

Zhe Li, Zhangyang Gao, Cheng Tan et al.

ICLR 2024posterarXiv:2401.10632

#3007

Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach

Aoqi Zuo, yiqing li, Susan Wei et al.

ECCV 2024posterarXiv:2408.08671

#3008

Towards Physical World Backdoor Attacks against Skeleton Action Recognition

Qichen Zheng, Yi Yu, SIYUAN YANG et al.

ECCV 2024posterarXiv:2407.08209

#3009

Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets

Qin Lei, Jiang Zhong, Qizhu Dai

ECCV 2024posterarXiv:2403.14270

#3010

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection

Tim Salzmann, Markus Ryll, Alex Bewley et al.

CVPR 2024posterarXiv:2302.04871

#3011

In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing

Yiran Xu, Zhixin Shu, Cameron Smith et al.

#3012

Cross Initialization for Face Personalization of Text-to-Image Models

Lianyu Pang, Jian Yin, Haoran Xie et al.

ECCV 2024posterarXiv:2407.11288

#3013

Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems

Yasar Utku Alcalar, Mehmet Akcakaya

CVPR 2024highlightarXiv:2302.09585

#3014

StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation

Yining Shi, Kun JIANG, Ke Wang et al.

CVPR 2024highlightarXiv:2405.06216

#3015

Event-based Structure-from-Orbit

Ethan Elms, Yasir Latif, Tae Ha Park et al.

ECCV 2024posterarXiv:2403.12038

#3016

Zero-Shot Image Feature Consensus with Deep Functional Maps

Xinle Cheng, Congyue Deng, Adam Harley et al.

CVPR 2024posterarXiv:2403.19235

#3017

DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation

Haonan Lin

AAAI 2024paperarXiv:2401.16687

#3018

Revisiting Gradient Pruning: A Dual Realization for Defending against Gradient Attacks

Lulu Xue, Shengshan Hu, Ruizhi Zhao et al.

CVPR 2024posterarXiv:2404.19294

#3019

Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement

Jinyoung Jun, Jae-Han Lee, Chang-Su Kim

CVPR 2024posterarXiv:2404.11139

#3020

GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement

Linfang Zheng, Tze Ho Elden Tse, Chen Wang et al.

#3021

Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients

Xueyang Tang, Song Guo, Jie ZHANG et al.

ICLR 2024poster

#3022

A Dynamic Learning Method towards Realistic Compositional Zero-Shot Learning

Xiaoming Hu, Zilei Wang

#3023

Prompting Future Driven Diffusion Model for Hand Motion Prediction

Bowen Tang, Kaihao Zhang, Wenhan Luo et al.

#3024

Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search

Haosen SUN, Lujun Li, Peijie Dong et al.

#3025

QPEN: Quantum Projection and Quantum Entanglement Enhanced Network for Cross-Lingual Aspect-Based Sentiment Analysis

Xingqiang Zhao, Hai Wan, Kunxun Qi

ECCV 2024posterarXiv:2407.05594

#3026

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

CVPR 2024posterarXiv:2404.01686

#3027

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

Duy Tho Le, Chenhui Gou, Stavya Datta et al.

CVPR 2024posterarXiv:2405.19902

#3028

Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection

Suyeon Kim, Dongha Lee, SeongKu Kang et al.

ECCV 2024posterarXiv:2404.01889

#3029

RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement

Tatiana Gaintseva, Martin Benning, Greg Slabaugh

#3030

D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On

Zhaotong Yang, Zicheng Jiang, Xinzhe Li et al.

CVPR 2024posterarXiv:2404.04848

#3031

Task-Aware Encoder Control for Deep Video Compression

Xingtong Ge, Jixiang Luo, XINJIE ZHANG et al.

CVPR 2024posterarXiv:2406.06813

#3032

Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation

Dong Zhao, Shuang Wang, Qi Zang et al.

AAAI 2024paperarXiv:2305.11476

#3033

Learning Diverse Risk Preferences in Population-Based Self-Play

Yuhua Jiang, Qihan Liu, Xiaoteng Ma et al.

ECCV 2024posterarXiv:2404.07988

#3034

Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

Xueyi Liu, Kangbo Lyu, jieqiong zhang et al.

AAAI 2024paperarXiv:2302.13543

#3035

BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling

Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham et al.

#3036

IPRemover: A Generative Model Inversion Attack against Deep Neural Network Fingerprinting and Watermarking

Wei Zong, Yang-Wai Chow, Willy Susilo et al.

AAAI 2024paperarXiv:2401.10211

#3037

Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence Representation

Zhengyi Li, Menglu Li, Lida Zhu et al.

CVPR 2024posterarXiv:2404.05661

#3038

Automatic Controllable Colorization via Imagination

Xiaoyan Cong, Yue Wu, Qifeng Chen et al.

ECCV 2024posterarXiv:2405.14582

#3039

PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control

Yong Zhong, Min Zhao, Zebin You et al.

CVPR 2024posterarXiv:2404.00254

#3040

Clustering for Protein Representation Learning

Ruijie Quan, Wenguan Wang, Fan Ma et al.

#3041

Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation

Xiaoyu Liu, Miaomiao Cai, Yinda Chen et al.

AAAI 2024paperarXiv:2303.13077

#3042

An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain

Xiang He, Dongcheng Zhao, Yang Li et al.

CVPR 2024posterarXiv:2403.04245

#3043

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Yusheng Dai, HangChen, Jun Du et al.

AAAI 2024paperarXiv:2406.07967

#3044

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling

Jie Ruan, Xiao Pu, Mingqi Gao et al.

ECCV 2024posterarXiv:2404.16828

#3045

Made to Order: Discovering monotonic temporal changes via self-supervised video ordering

Charig Yang, Weidi Xie, Andrew ZISSERMAN

ICLR 2024posterarXiv:2306.03857

#3046

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Guillaume Bono, Leonid Antsfeld, Assem Sadek et al.

AAAI 2024paperarXiv:2312.07122

#3047

Neural Reasoning about Agents’ Goals, Preferences, and Actions

Matteo Bortoletto, Lei Shi, Andreas Bulling

CVPR 2024posterarXiv:2405.10286

#3048

FFF: Fixing Flawed Foundations in Contrastive Pre-Training Results in Very Strong Vision-Language Models

Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos

#3049

Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process

Mingjie Xu, Feng Lu

AAAI 2024paperarXiv:2403.05093

#3050

Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile

Seokjun Lee, Seung-Won Jung, Hyunseok Seo

AAAI 2024paperarXiv:2310.04884

#3051

Regret Analysis of Repeated Delegated Choice

Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.

AAAI 2024paperarXiv:2401.09067

#3052

Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

Depeng Li, Tianqi Wang, Junwei Chen et al.

AAAI 2024paperarXiv:2312.11545

#3053

Robust Communicative Multi-Agent Reinforcement Learning with Active Defense

Lebin Yu, Yunbo Qiu, Quanming Yao et al.

#3054

RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection

Ming Chang, Xishan Zhang, Rui Zhang et al.

AAAI 2024paperarXiv:2402.00084

#3055

EPSD: Early Pruning with Self-Distillation for Efficient Model Compression

Dong Chen, Ning Liu, Yichen Zhu et al.

AAAI 2024paperarXiv:2312.11091

#3056

Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling

Jakob Hollenstein, Georg Martius, Justus Piater

#3057

Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning

Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.

AAAI 2024paperarXiv:2305.08372

#3058

Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Peipei Liu, Hong Li, Yimo Ren et al.

AAAI 2024paperarXiv:2402.02097

#3059

Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing

Haobin Jiang, Ziluo Ding, Zongqing Lu

ICLR 2024oralarXiv:2311.01450

#3060

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

Vint Lee, Pieter Abbeel, Youngwoon Lee

ECCV 2024posterarXiv:2409.03755

#3061

DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

Wenliang Zhao, Haolin Wang, Jie Zhou et al.

CVPR 2024posterarXiv:2403.13548

#3062

Diversity-aware Channel Pruning for StyleGAN Compression

Jiwoo Chung, Sangeek Hyun, Sang-Heon Shim et al.

AAAI 2024paperarXiv:2305.14024

#3063

Improved Metric Distortion via Threshold Approvals

Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.

AAAI 2024paperarXiv:2412.03825

#3064

Residual Hyperbolic Graph Convolution Networks

Yangkai Xue, Jindou Dai, Zhipeng Lu et al.

CVPR 2024posterarXiv:2404.00842

#3065

An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

Ling Gao, Daniel Gehrig, Hang Su et al.

ECCV 2024posterarXiv:2409.15801

#3066

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

AAAI 2024paperarXiv:2305.06741

#3067

IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers

Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.

AAAI 2024paperarXiv:2403.00225

#3068

Robust Policy Learning via Offline Skill Diffusion

Woo Kyung Kim, Minjong Yoo, Honguk Woo

ECCV 2024posterarXiv:2305.18381

#3069

Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation

YUE XU, Yong-Lu Li, Kaitong Cui et al.

#3070

RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency

Ziming Sun, Yuan Liang, Zejun Ma et al.

AAAI 2024paperarXiv:2312.15971

#3071

Graph Context Transformation Learning for Progressive Correspondence Pruning

Junwen Guo, Guobao Xiao, Shiping Wang et al.

ECCV 2024posterarXiv:2407.15396

#3072

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon et al.

AAAI 2024paperarXiv:2312.09783

#3073

Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning

Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.

#3074

Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolution

Qing Cai, Mu Li, Dongwei Ren et al.

#3075

Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing

SI-QI LIU, Qirui Wang, Pong Chi Yuen

ECCV 2024posterarXiv:2311.17086

#3076

PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation

Jian Ma, Chen Chen, Qingsong Xie et al.

ICLR 2024posterarXiv:2309.17189

#3077

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation

Samuel Pegg, Kai Li, Xiaolin Hu

AAAI 2024paperarXiv:2312.08504

#3078

1/2-Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations

Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni et al.

#3079

An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought

Chunhao LU, Qiang Lu, Jake Luo

CVPR 2024posterarXiv:2403.19326

#3080

MedBN: Robust Test-Time Adaptation against Malicious Test Samples

Hyejin Park, Jeongyeon Hwang, Sunung Mun et al.

ICLR 2024posterarXiv:2310.17256

#3081

fairret: a Framework for Differentiable Fairness Regularization Terms

Maarten Buyl, MaryBeth Defrance, Tijl De Bie

CVPR 2024posterarXiv:2310.17154

#3082

Deep Imbalanced Regression via Hierarchical Classification Adjustment

Haipeng Xiong, Angela Yao

#3083

Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations

Junpeng Fang, Gongduo Zhang, Qing Cui et al.

AAAI 2024paperarXiv:2309.03581

#3084

Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning

Joseph Giovanelli, Alexander Tornede, Tanja Tornede et al.

#3085

Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification

Fengxiang Yang, Zhun Zhong, Zhiming Luo et al.

ICLR 2024posterarXiv:2311.04640

#3086

Object-Centric Learning with Slot Mixture Module

Daniil Kirilenko, Vitaliy Vorobyov, Aleksey Kovalev et al.

#3087

Shape from Heat Conduction

Sriram Narayanan, Mani Ramanagopal, Mark Sheinin et al.

ECCV 2024posterarXiv:2501.02771

#3088

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024posterarXiv:2402.17986

#3089

PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis

Jason Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani et al.

#3090

CatmullRom Splines-Based Regression for Image Forgery Localization

Li Zhang, Mingliang Xu, Dong Li et al.

#3091

On Harmonizing Implicit Subpopulations

Feng Hong, Jiangchao Yao, YUEMING LYU et al.

ICLR 2024poster

#3092

Text2City: One-Stage Text-Driven Urban Layout Regeneration

Yiming Qin, Nanxuan Zhao, Bin Sheng et al.

ICLR 2024posterarXiv:2401.09787

#3093

Querying Easily Flip-flopped Samples for Deep Active Learning

Seong Jin Cho, Gwangsu Kim, Junghyun Lee et al.

#3094

TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts

Youssef Mansour, Xuyang Zhong, Serdar Caglar et al.

ECCV 2024posterarXiv:2305.03713

#3095

Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos

Ekta Prashnani, Koki Nagano, Shalini De Mello et al.

AAAI 2024paperarXiv:2402.12184

#3096

Colorizing Monochromatic Radiance Fields

Yean Cheng, Renjie Wan, Shuchen Weng et al.

ECCV 2024posterarXiv:2406.12452

#3097

Insect Identification in the Wild: The AMI Dataset

Aditya Jain, Fagner Cunha, Michael J Bunsen et al.

AAAI 2024paperarXiv:2312.10608

#3098

Robust 3D Tracking with Quality-Aware Shape Completion

Jingwen Zhang, Zikun Zhou, Guangming Lu et al.

ECCV 2024posterarXiv:2407.05578

#3099

FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

Jiedong Zhuang, Jiaqi Hu, Lianrui Mu et al.

ECCV 2024posterarXiv:2409.18953

#3100

UniCal: Unified Neural Sensor Calibration

Ze Yang, George G Chen, Haowei Zhang et al.

#3101

Knowledge Enhanced Representation Learning for Drug Discovery

Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.

#3102

Adversarially Robust Few-shot Learning via Parameter Co-distillation of Similarity and Class Concept Learners

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024posterarXiv:2311.18435

#3103

Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis

Zipeng Qi, Guoxi Huang, Chenyang Liu et al.

AAAI 2024paperarXiv:2308.08644

#3104

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons

Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.

ECCV 2024posterarXiv:2408.02226

#3105

ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation

Jack Lu, Ryan Teehan, Mengye Ren

AAAI 2024paperarXiv:2312.11260

#3106

Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning

YongJin Yang, Taehyeon Kim, Se-Young Yun

AAAI 2024paperarXiv:2312.07241

#3107

Reachability of Fair Allocations via Sequential Exchanges

Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.

#3108

Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need

Jiaxuan Chen, Yu Qi, Yueming Wang et al.

#3109

SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly

Liuwan Zhu, Rui Ning, Jiang Li et al.

ECCV 2024posterarXiv:2312.00114

#3110

Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation

Ziyun Wang, Jinyuan Guo, Kostas Daniilidis

ECCV 2024posterarXiv:2408.00361

#3111

High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior

Shen Jianbing, Wencheng Han

ECCV 2024posterarXiv:2407.14972

#3112

ARoFace: Alignment Robustness to Improve Low-quality Face Recognition

Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei et al.

AAAI 2024paperarXiv:2312.02684

#3113

DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors

Xiaze Zhang, Ziheng Ding, Qi Jing et al.

ECCV 2024posterarXiv:2410.09802

#3114

EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models

Lee Eungbean, Somi Jeong, Kwanghoon Sohn

ICLR 2024posterarXiv:2311.08384

#3115

Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

Yifei Zhou, Ayush Sekhari, Yuda Song et al.

ECCV 2024posterarXiv:2407.07564

#3116

Trainable Highly-expressive Activation Functions

Irit Chelly, Shahaf Finder, Shira Ifergane et al.

ECCV 2024posterarXiv:2409.08518

#3117

Anytime Continual Learning for Open Vocabulary Classification

Zhen Zhu, Yiming Gong, Derek Hoiem

ECCV 2024posterarXiv:2406.10740

#3118

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Zhikai Zhang, Yitang Li, Haofeng Huang et al.

ICLR 2024posterarXiv:2402.06706

#3119

CoRe-GD: A Hierarchical Framework for Scalable Graph Visualization with GNNs

Florian Grötschla, Joël Mathys, Róbert Veres et al.

ECCV 2024posterarXiv:2407.10632

#3120

Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

Zhening Liu, XINJIE ZHANG, Jiawei Shao et al.

ECCV 2024posterarXiv:2312.11587

#3121

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.

#3122

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting

Junwu Zhang, Zhenyu Tang, Yatian Pang et al.

AAAI 2024paperarXiv:2312.14590

#3123

SIG: Speaker Identification in Literature via Prompt-Based Generation

Zhenlin Su, Liyan Xu, Jin Xu et al.

AAAI 2024paperarXiv:2312.11972

#3124

Expressive Forecasting of 3D Whole-Body Human Motions

Pengxiang Ding, Qiongjie Cui, Haofan Wang et al.

ICLR 2024posterarXiv:2306.00740

#3125

On the Limitations of Temperature Scaling for Distributions with Overlaps

Muthu Chidambaram, Rong Ge

#3126

Improving Knowledge Distillation via Regularizing Feature Direction and Norm

Yuzhu Wang, Lechao Cheng, Manni Duan et al.

ECCV 2024posterarXiv:2403.11107

#3127

Self-supervised co-salient object detection via feature correspondences at multiple scales

Souradeep Chakraborty, Dimitris Samaras

ECCV 2024posterarXiv:2407.13545

#3128

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024posterarXiv:2408.03282

#3129

AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval

Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen et al.

ICLR 2024posterarXiv:2308.06703

#3130

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

Avery Ma, Yangchen Pan, Amir-massoud Farahmand

ECCV 2024posterarXiv:2403.12800

#3131

Learning Neural Volumetric Pose Features for Camera Localization

Jingyu Lin, Jiaqi Gu, Bojian Wu et al.

ECCV 2024posterarXiv:2312.05525

#3132

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

Sheng Jin, Shuhuai Li, Tong Li et al.

AAAI 2024paperarXiv:2312.10797

#3133

Large-Scale Multi-Robot Coverage Path Planning via Local Search

Jingtao Tang, Hang Ma

#3134

Uncertainty-aware Graph-based Hyperspectral Image Classification

Linlin Yu, Yifei Lou, Feng Chen

ICLR 2024poster

AAAI 2024paperarXiv:2401.13157

#3135

Time-Aware Knowledge Representations of Dynamic Objects with Multidimensional Persistence

Baris Coskunuzer, Ignacio Segovia-Dominguez, Yuzhou Chen et al.

ICLR 2024posterarXiv:2401.11098

#3136

Neural Auto-designer for Enhanced Quantum Kernels

Cong Lei, Yuxuan Du, Peng Mi et al.

ICLR 2024oralarXiv:2401.08328

#3137

Un-Mixing Test-Time Normalization Statistics: Combatting Label Temporal Correlation

Devavrat Tomar, Guillaume Vray, Jean-Philippe Thiran et al.

ECCV 2024posterarXiv:2404.03042

#3138

AWOL: Analysis WithOut synthesis using Language

Silvia Zuffi, Michael J. Black

ECCV 2024posterarXiv:2408.02788

#3139

GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths

Xianyu Chen, Ming Jiang, Qi Zhao

ECCV 2024posterarXiv:2405.05552

#3140

Bidirectional Progressive Transformer for Interaction Intention Anticipation

Zichen Zhang, Hongchen Luo, Wei Zhai et al.

ECCV 2024posterarXiv:2409.11235

#3141

SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

Siyuan Li, Lei Ke, Yung-Hsu Yang et al.

ICLR 2024spotlightarXiv:2402.09872

#3142

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

Arman Isajanyan, Artur Shatveryan, David Kocharian et al.

ECCV 2024posterarXiv:2407.13095

#3143

Audio-visual Generalized Zero-shot Learning the Easy Way

Shentong Mo, Pedro Morgado

ECCV 2024posterarXiv:2407.07003

#3144

Learning to Complement and to Defer to Multiple Users

Zheng Zhang, Wenjie Ai, Kevin Wells et al.

ICLR 2024posterarXiv:2312.03248

#3145

Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning

Haowen Wang, Tao Sun, Congyun Jin et al.

#3146

Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids

Wontae Kim, Nam Ik Cho

#3147

Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation

Wenbo Qi, Jiafei Wu, S. C. Chan

ECCV 2024posterarXiv:2407.06315

#3148

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.

CVPR 2024posterarXiv:2405.20729

#3149

Extreme Point Supervised Instance Segmentation

Hyeonjun Lee, Sehyun Hwang, Suha Kwak

ICLR 2024posterarXiv:2402.14430

#3150

Robust Training of Federated Models with Extremely Label Deficiency

Yonggang Zhang, Zhiqin Yang, Xinmei Tian et al.

CVPR 2024posterarXiv:2405.19833

#3151

KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation

Fengyuan Yang, Kerui Gu, Angela Yao

ECCV 2024posterarXiv:2311.08843

#3152

Personalized Video Relighting With an At-Home Light Stage

Jun Myeong Choi, Max Christman, Roni Sengupta

ECCV 2024posterarXiv:2403.14628

#3153

Zero-Shot Multi-Object Scene Completion

Shun Iwase, Katherine Liu, Vitor Guizilini et al.

ECCV 2024posterarXiv:2407.06984

#3154

Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images

Chuanrui Zhang, Yonggen Ling, Minglei Lu et al.

#3155

Querying as Prompt: Parameter-Efficient Learning for Multimodal Language Model

Tian Liang, Jing Huang, Ming Kong et al.

CVPR 2024posterarXiv:2404.01342

#3156

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

Lirui Zhao, Yue Yang, Kaipeng Zhang et al.

ECCV 2024posterarXiv:2408.02672

#3157

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Shishira R Maiya, Anubhav Anubhav, Matthew Gwilliam et al.

ICLR 2024posterarXiv:2305.16242

#3158

Two-timescale Extragradient for Finding Local Minimax Points

Jiseok Chae, Kyuwon Kim, Donghwan Kim

ECCV 2024posterarXiv:2410.00905

#3159

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment

Mu Cai, Haotian Liu, Yuheng Li et al.

ECCV 2024posterarXiv:2407.11356

#3160

The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation

Muyang Qiu, Jian Zhang, Lei Qi et al.

ECCV 2024posterarXiv:2404.02514

#3161

High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

YISHENG HE, Weihao Yuan, Siyu Zhu et al.

ECCV 2024posterarXiv:2409.06535

#3162

PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation

Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno et al.

CVPR 2024posterarXiv:2404.01591

#3163

Language Model Guided Interpretable Video Action Reasoning

Ning Wang, Guangming Zhu, Hongsheng Li et al.

CVPR 2024posterarXiv:2403.08262

#3164

BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image

Minje Kim, Tae-Kyun Kim

ECCV 2024posterarXiv:2407.04461

#3165

VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing

Shang Liu, Chaohui Yu, Chenjie Cao et al.

ECCV 2024posterarXiv:2407.16125

#3166

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024posterarXiv:2408.02110

#3167

AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos

Feichi Lu, Zijian Dong, Jie Song et al.

#3168

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

Qitai Wang, Jiawei He, Yuntao Chen et al.

CVPR 2024posterarXiv:2308.15692

#3169

Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models

Takami Sato, Justin Yue, Nanze Chen et al.

ECCV 2024posterarXiv:2312.04885

#3170

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

ECCV 2024posterarXiv:2409.07307

#3171

Data Augmentation via Latent Diffusion for Saliency Prediction

Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang et al.

CVPR 2024posterarXiv:2403.02041

#3172

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Mathilde Caron, Ahmet Iscen, Alireza Fathi et al.

CVPR 2024posterarXiv:2404.01123

#3173

CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment

Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok et al.

CVPR 2024posterarXiv:2411.15673

#3174

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Alvi Md Ishmam, Chris Thomas

ECCV 2024posterarXiv:2404.06154

#3175

Concise Plane Arrangements for Low-Poly Surface and Volume Modelling

Raphael Sulzer, Florent Lafarge

ECCV 2024posterarXiv:2408.00759

#3176

Text-Guided Video Masked Autoencoder

David Fan, Jue Wang, Shuai Liao et al.

ECCV 2024posterarXiv:2409.01021

#3177

CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.

Long Li, Nian Liu, Dingwen Zhang et al.

#3178

Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation

Xinru Cui, Qiming Liu, Zhe Liu et al.

ECCV 2024posterarXiv:2309.07322

#3179

NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration

Lin Tian, Thomas H Greer, Raul San Jose Estepar et al.

CVPR 2024highlightarXiv:2403.04303

#3180

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

Jialin Li, Qiang Nie, Weifu Fu et al.

ECCV 2024posterarXiv:2404.12867

#3181

FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving

Xingtai Gui, Tengteng Huang, Haonan Shao et al.

#3182

DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction

MOZHGAN POURKESHAVARZ, Arielle Zhang, Amir Rasouli

ECCV 2024posterarXiv:2409.03424

#3183

Weight Conditioning for Smooth Optimization of Neural Networks

Hemanth Saratchandran, Thomas X Wang, Simon Lucey

#3184

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Zihan Zhang, Zhuo Xu, Xiang Xiang

CVPR 2024posterarXiv:2406.11129

#3185

Neural Lineage

Runpeng Yu, Xinchao Wang

ECCV 2024posterarXiv:2312.07231

#3186

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Shentong Mo, Enze Xie, Yue Wu et al.

ECCV 2024posterarXiv:2311.17101

#3187

A high-quality robust diffusion framework for corrupted dataset

Quan Dao, Binh Ta, Tung Pham et al.

ECCV 2024posterarXiv:2312.06721

#3188

Understanding Physical Dynamics with Counterfactual World Modeling

Rahul Mysore Venkatesh, Honglin Chen, Kevin Feigelis et al.

ECCV 2024posterarXiv:2407.09367

#3189

Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma et al.

ECCV 2024posterarXiv:2411.02149

#3190

Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training

Yuanqi Yao, Gang Wu, Kui Jiang et al.

CVPR 2024posterarXiv:2404.03183

#3191

BodyMAP - Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed

Abhishek Tandon, Anujraaj Goyal, Henry M. Clever et al.

#3192

Implicit Motion Function

Yue Gao, Jiahao Li, Lei Chu et al.

AAAI 2024paperarXiv:2404.00603

#3193

Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning

Kun Ding, Haojian Zhang, Qiang Yu et al.

AAAI 2024paperarXiv:2401.15987

#3194

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling

Yuze Hao, Jianrong Zhang, Tao Zhuo et al.

AAAI 2024paperarXiv:2303.01141

#3195

DeepSaDe: Learning Neural Networks That Guarantee Domain Constraint Satisfaction

Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel

AAAI 2024paperarXiv:2312.13596

#3196

Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

Zhixiang Su, Di Wang, Chunyan Miao et al.

ECCV 2024posterarXiv:2401.06826

#3197

Direct Distillation between Different Domains

Jialiang Tang, Shuo Chen, Gang Niu et al.

ECCV 2024posterarXiv:2403.14772

#3198

Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures

Sayanton Vhaduri Dibbo, Adam Breuer, Juston Moore et al.

ECCV 2024posterarXiv:2409.14340

#3199

Self-Supervised Audio-Visual Soundscape Stylization

Tingle Li, Renhao Wang, Po-Yao Huang et al.

AAAI 2024paperarXiv:2402.17978

#3200

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

Zeyang Liu, Lipeng Wan, Xinrui Yang et al.