Most Cited 2024 "human grasp pose" Papers

12,324 papers found • Page 16 of 62

#3001

Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing

Haobin Jiang, Ziluo Ding, Zongqing Lu

AAAI 2024paperarXiv:2402.02097
8
citations
#3002

Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning

Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.

AAAI 2024paper
8
citations
#3003

Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning

Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.

AAAI 2024paperarXiv:2312.09783
8
citations
#3004

Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling

Jakob Hollenstein, Georg Martius, Justus Piater

AAAI 2024paperarXiv:2312.11091
8
citations
#3005

Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Peipei Liu, Hong Li, Yimo Ren et al.

AAAI 2024paperarXiv:2305.08372
8
citations
#3006

DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

Wenliang Zhao, Haolin Wang, Jie Zhou et al.

ECCV 2024arXiv:2409.03755
8
citations
#3007

PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation

Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.

AAAI 2024paperarXiv:2312.13066
8
citations
#3008

DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

Yunxiao Shi, Manish Singh, Hong Cai et al.

CVPR 2024arXiv:2403.12202
8
citations
#3009

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ECCV 2024arXiv:2409.15801
8
citations
#3010

Robust Policy Learning via Offline Skill Diffusion

Woo Kyung Kim, Minjong Yoo, Honguk Woo

AAAI 2024paperarXiv:2403.00225
8
citations
#3011

Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation

YUE XU, Yong-Lu Li, Kaitong Cui et al.

ECCV 2024arXiv:2305.18381
8
citations
#3012

Residual Hyperbolic Graph Convolution Networks

Yangkai Xue, Jindou Dai, Zhipeng Lu et al.

AAAI 2024paperarXiv:2412.03825
8
citations
#3013

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Zhiyu Mei, Wei Fu, Jiaxuan Gao et al.

ICLR 2024arXiv:2306.16688
8
citations
#3014

RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency

Ziming Sun, Yuan Liang, Zejun Ma et al.

ECCV 2024
8
citations
#3015

Improved Metric Distortion via Threshold Approvals

Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.

AAAI 2024paperarXiv:2305.14024
8
citations
#3016

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon et al.

ECCV 2024arXiv:2407.15396
8
citations
#3017

Graph Context Transformation Learning for Progressive Correspondence Pruning

Junwen Guo, Guobao Xiao, Shiping Wang et al.

AAAI 2024paperarXiv:2312.15971
8
citations
#3018

IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers

Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.

AAAI 2024paperarXiv:2305.06741
8
citations
#3019

Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolution

Qing Cai, Mu Li, Dongwei Ren et al.

AAAI 2024paper
8
citations
#3020

Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing

SI-QI LIU, Qirui Wang, Pong Chi Yuen

ECCV 2024
8
citations
#3021

In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing

Yiran Xu, Zhixin Shu, Cameron Smith et al.

CVPR 2024arXiv:2302.04871
8
citations
#3022

PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation

Jian Ma, Chen Chen, Qingsong Xie et al.

ECCV 2024arXiv:2311.17086
8
citations
#3023

Relational Matching for Weakly Semi-Supervised Oriented Object Detection

Wenhao Wu, Hau San Wong, Si Wu et al.

CVPR 2024
8
citations
#3024

An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought

Chunhao LU, Qiang Lu, Jake Luo

ECCV 2024
8
citations
#3025

Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach

Aoqi Zuo, yiqing li, Susan Wei et al.

ICLR 2024arXiv:2401.10632
8
citations
#3026

GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement

Linfang Zheng, Tze Ho Elden Tse, Chen Wang et al.

CVPR 2024arXiv:2404.11139
8
citations
#3027

DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation

Haonan Lin

CVPR 2024arXiv:2403.19235
8
citations
#3028

1/2-Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations

Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni et al.

AAAI 2024paperarXiv:2312.08504
8
citations
#3029

Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations

Junpeng Fang, Gongduo Zhang, Qing Cui et al.

AAAI 2024paper
8
citations
#3030

Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients

Xueyang Tang, Song Guo, Jie ZHANG et al.

ICLR 2024
8
citations
#3031

Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification

Fengxiang Yang, Zhun Zhong, Zhiming Luo et al.

AAAI 2024paper
8
citations
#3032

Shape from Heat Conduction

Sriram Narayanan, Mani Ramanagopal, Mark Sheinin et al.

ECCV 2024
8
citations
#3033

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024arXiv:2501.02771
8
citations
#3034

PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis

Jason Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani et al.

ECCV 2024arXiv:2402.17986
8
citations
#3035

FFF: Fixing Flawed Foundations in Contrastive Pre-Training Results in Very Strong Vision-Language Models

Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos

CVPR 2024arXiv:2405.10286
8
citations
#3036

CatmullRom Splines-Based Regression for Image Forgery Localization

Li Zhang, Mingliang Xu, Dong Li et al.

AAAI 2024paper
8
citations
#3037

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Yusheng Dai, HangChen, Jun Du et al.

CVPR 2024arXiv:2403.04245
8
citations
#3038

TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts

Youssef Mansour, Xuyang Zhong, Serdar Caglar et al.

ECCV 2024
8
citations
#3039

Text2City: One-Stage Text-Driven Urban Layout Regeneration

Yiming Qin, Nanxuan Zhao, Bin Sheng et al.

AAAI 2024paper
8
citations
#3040

Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos

Ekta Prashnani, Koki Nagano, Shalini De Mello et al.

ECCV 2024arXiv:2305.03713
8
citations
#3041

Colorizing Monochromatic Radiance Fields

Yean Cheng, Renjie Wan, Shuchen Weng et al.

AAAI 2024paperarXiv:2402.12184
8
citations
#3042

Insect Identification in the Wild: The AMI Dataset

Aditya Jain, Fagner Cunha, Michael J Bunsen et al.

ECCV 2024arXiv:2406.12452
8
citations
#3043

Robust 3D Tracking with Quality-Aware Shape Completion

Jingwen Zhang, Zikun Zhou, Guangming Lu et al.

AAAI 2024paperarXiv:2312.10608
8
citations
#3044

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Guillaume Bono, Leonid Antsfeld, Assem Sadek et al.

ICLR 2024arXiv:2306.03857
8
citations
#3045

FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

Jiedong Zhuang, Jiaqi Hu, Lianrui Mu et al.

ECCV 2024arXiv:2407.05578
8
citations
#3046

UniCal: Unified Neural Sensor Calibration

Ze Yang, George G Chen, Haowei Zhang et al.

ECCV 2024arXiv:2409.18953
8
citations
#3047

Knowledge Enhanced Representation Learning for Drug Discovery

Thanh Lam Hoang, Marco Luca Sbodio, Marcos Martinez et al.

AAAI 2024paper
8
citations
#3048

Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis

Zipeng Qi, Guoxi Huang, Chenyang Liu et al.

ECCV 2024arXiv:2311.18435
8
citations
#3049

ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation

Jack Lu, Ryan Teehan, Mengye Ren

ECCV 2024arXiv:2408.02226
8
citations
#3050

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons

Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.

AAAI 2024paperarXiv:2308.08644
8
citations
#3051

Leveraging Normalization Layer in Adapters with Progressive Learning and Adaptive Distillation for Cross-Domain Few-Shot Learning

YongJin Yang, Taehyeon Kim, Se-Young Yun

AAAI 2024paperarXiv:2312.11260
8
citations
#3052

Automatic Controllable Colorization via Imagination

Xiaoyan Cong, Yue Wu, Qifeng Chen et al.

CVPR 2024arXiv:2404.05661
8
citations
#3053

Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation

Ziyun Wang, Jinyuan Guo, Kostas Daniilidis

ECCV 2024arXiv:2312.00114
8
citations
#3054

High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior

Shen Jianbing, Wencheng Han

ECCV 2024arXiv:2408.00361
8
citations
#3055

ARoFace: Alignment Robustness to Improve Low-quality Face Recognition

Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei et al.

ECCV 2024arXiv:2407.14972
8
citations
#3056

Bridging the Semantic Latent Space between Brain and Machine: Similarity Is All You Need

Jiaxuan Chen, Yu Qi, Yueming Wang et al.

AAAI 2024paper
8
citations
#3057

EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models

Lee Eungbean, Somi Jeong, Kwanghoon Sohn

ECCV 2024arXiv:2410.09802
8
citations
#3058

Reachability of Fair Allocations via Sequential Exchanges

Ayumi Igarashi, Naoyuki Kamiyama, Warut Suksompong et al.

AAAI 2024paperarXiv:2312.07241
8
citations
#3059

SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly

Liuwan Zhu, Rui Ning, Jiang Li et al.

AAAI 2024paper
8
citations
#3060

Trainable Highly-expressive Activation Functions

Irit Chelly, Shahaf Finder, Shira Ifergane et al.

ECCV 2024arXiv:2407.07564
8
citations
#3061

An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

Ling Gao, Daniel Gehrig, Hang Su et al.

CVPR 2024arXiv:2404.00842
8
citations
#3062

Anytime Continual Learning for Open Vocabulary Classification

Zhen Zhu, Yiming Gong, Derek Hoiem

ECCV 2024arXiv:2409.08518
8
citations
#3063

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Zhikai Zhang, Yitang Li, Haofeng Huang et al.

ECCV 2024arXiv:2406.10740
8
citations
#3064

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

Vint Lee, Pieter Abbeel, Youngwoon Lee

ICLR 2024oralarXiv:2311.01450
8
citations
#3065

Diversity-aware Channel Pruning for StyleGAN Compression

Jiwoo Chung, Sangeek Hyun, Sang-Heon Shim et al.

CVPR 2024arXiv:2403.13548
8
citations
#3066

Expressive Forecasting of 3D Whole-Body Human Motions

Pengxiang Ding, Qiongjie Cui, Haofan Wang et al.

AAAI 2024paperarXiv:2312.11972
8
citations
#3067

DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors

Xiaze Zhang, Ziheng Ding, Qi Jing et al.

AAAI 2024paperarXiv:2312.02684
8
citations
#3068

fairret: a Framework for Differentiable Fairness Regularization Terms

Maarten Buyl, MaryBeth Defrance, Tijl De Bie

ICLR 2024arXiv:2310.17256
8
citations
#3069

Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

Zhening Liu, XINJIE ZHANG, Jiawei Shao et al.

ECCV 2024arXiv:2407.10632
8
citations
#3070

MedBN: Robust Test-Time Adaptation against Malicious Test Samples

Hyejin Park, Jeongyeon Hwang, Sunung Mun et al.

CVPR 2024arXiv:2403.19326
8
citations
#3071

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.

ECCV 2024arXiv:2312.11587
8
citations
#3072

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting

Junwu Zhang, Zhenyu Tang, Yatian Pang et al.

ECCV 2024
8
citations
#3073

Improving Knowledge Distillation via Regularizing Feature Direction and Norm

Yuzhu Wang, Lechao Cheng, Manni Duan et al.

ECCV 2024
8
citations
#3074

Self-supervised co-salient object detection via feature correspondences at multiple scales

Souradeep Chakraborty, Dimitris Samaras

ECCV 2024arXiv:2403.11107
8
citations
#3075

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024arXiv:2407.13545
8
citations
#3076

AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval

Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen et al.

ECCV 2024arXiv:2408.03282
8
citations
#3077

SIG: Speaker Identification in Literature via Prompt-Based Generation

Zhenlin Su, Liyan Xu, Jin Xu et al.

AAAI 2024paperarXiv:2312.14590
8
citations
#3078

Learning Neural Volumetric Pose Features for Camera Localization

Jingyu Lin, Jiaqi Gu, Bojian Wu et al.

ECCV 2024arXiv:2403.12800
8
citations
#3079

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation

Samuel Pegg, Kai Li, Xiaolin Hu

ICLR 2024arXiv:2309.17189
8
citations
#3080

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

Sheng Jin, Shuhuai Li, Tong Li et al.

ECCV 2024arXiv:2312.05525
8
citations
#3081

Large-Scale Multi-Robot Coverage Path Planning via Local Search

Jingtao Tang, Hang Ma

AAAI 2024paperarXiv:2312.10797
8
citations
#3082

Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

Yifei Zhou, Ayush Sekhari, Yuda Song et al.

ICLR 2024arXiv:2311.08384
8
citations
#3083

AWOL: Analysis WithOut synthesis using Language

Silvia Zuffi, Michael J. Black

ECCV 2024arXiv:2404.03042
8
citations
#3084

GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths

Xianyu Chen, Ming Jiang, Qi Zhao

ECCV 2024arXiv:2408.02788
8
citations
#3085

Bidirectional Progressive Transformer for Interaction Intention Anticipation

Zichen Zhang, Hongchen Luo, Wei Zhai et al.

ECCV 2024arXiv:2405.05552
8
citations
#3086

SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

Siyuan Li, Lei Ke, Yung-Hsu Yang et al.

ECCV 2024arXiv:2409.11235
8
citations
#3087

DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks

Tongzhou Mu, Minghua Liu, Hao Su

ICLR 2024arXiv:2404.16779
7
citations
#3088

Audio-visual Generalized Zero-shot Learning the Easy Way

Shentong Mo, Pedro Morgado

ECCV 2024arXiv:2407.13095
7
citations
#3089

Learning to Complement and to Defer to Multiple Users

Zheng Zhang, Wenjie Ai, Kevin Wells et al.

ECCV 2024arXiv:2407.07003
7
citations
#3090

Towards Backward-Compatible Continual Learning of Image Compression

Zhihao Duan, Ming Lu, Justin Yang et al.

CVPR 2024arXiv:2402.18862
7
citations
#3091

Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models

Andrew Engel, Zhichao Wang, Natalie Frank et al.

ICLR 2024spotlightarXiv:2305.14585
7
citations
#3092

Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids

Wontae Kim, Nam Ik Cho

ECCV 2024
7
citations
#3093

Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation

Wenbo Qi, Jiafei Wu, S. C. Chan

ECCV 2024
7
citations
#3094

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.

ECCV 2024arXiv:2407.06315
7
citations
#3095

PixelRNN: In-pixel Recurrent Neural Networks for End-to-end–optimized Perception with Neural Sensors

Haley So, Laurie Bose, Piotr Dudek et al.

CVPR 2024arXiv:2304.05440
7
citations
#3096

BrainWash: A Poisoning Attack to Forget in Continual Learning

Ali Abbasi, Parsa Nooralinejad, Hamed Pirsiavash et al.

CVPR 2024arXiv:2311.11995
7
citations
#3097

Personalized Video Relighting With an At-Home Light Stage

Jun Myeong Choi, Max Christman, Roni Sengupta

ECCV 2024arXiv:2311.08843
7
citations
#3098

Zero-Shot Multi-Object Scene Completion

Shun Iwase, Katherine Liu, Vitor Guizilini et al.

ECCV 2024arXiv:2403.14628
7
citations
#3099

Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction

Thanh-Tung Le, Khai Nguyen, shanlin sun et al.

ICLR 2024arXiv:2305.17555
7
citations
#3100

Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images

Chuanrui Zhang, Yonggen Ling, Minglei Lu et al.

ECCV 2024arXiv:2407.06984
7
citations
#3101

VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks

Zhaomin Wu, Junyi Hou, Bingsheng He

ICLR 2024arXiv:2307.02040
7
citations
#3102

PFStorer: Personalized Face Restoration and Super-Resolution

Tuomas Varanka, Tapani Toivonen, Soumya Tripathy et al.

CVPR 2024arXiv:2403.08436
7
citations
#3103

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Shishira R Maiya, Anubhav Anubhav, Matthew Gwilliam et al.

ECCV 2024arXiv:2408.02672
7
citations
#3104

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment

Mu Cai, Haotian Liu, Yuheng Li et al.

ECCV 2024arXiv:2410.00905
7
citations
#3105

KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation

Fengyuan Yang, Kerui Gu, Angela Yao

CVPR 2024arXiv:2405.19833
7
citations
#3106

The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation

Muyang Qiu, Jian Zhang, Lei Qi et al.

ECCV 2024arXiv:2407.11356
7
citations
#3107

High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

YISHENG HE, Weihao Yuan, Siyu Zhu et al.

ECCV 2024arXiv:2404.02514
7
citations
#3108

Querying as Prompt: Parameter-Efficient Learning for Multimodal Language Model

Tian Liang, Jing Huang, Ming Kong et al.

CVPR 2024
7
citations
#3109

PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation

Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno et al.

ECCV 2024arXiv:2409.06535
7
citations
#3110

L-MAGIC: Language Model Assisted Generation of Images with Coherence

zhipeng cai, Matthias Mueller, Reiner Birkl et al.

CVPR 2024arXiv:2406.01843
7
citations
#3111

VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing

Shang Liu, Chaohui Yu, Chenjie Cao et al.

ECCV 2024arXiv:2407.04461
7
citations
#3112

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024arXiv:2407.16125
7
citations
#3113

AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos

Feichi Lu, Zijian Dong, Jie Song et al.

ECCV 2024arXiv:2408.02110
7
citations
#3114

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

Qitai Wang, Jiawei He, Yuntao Chen et al.

ECCV 2024
7
citations
#3115

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

ECCV 2024arXiv:2312.04885
7
citations
#3116

Language Model Guided Interpretable Video Action Reasoning

Ning Wang, Guangming Zhu, Hongsheng Li et al.

CVPR 2024arXiv:2404.01591
7
citations
#3117

BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image

Minje Kim, Tae-Kyun Kim

CVPR 2024arXiv:2403.08262
7
citations
#3118

Data Augmentation via Latent Diffusion for Saliency Prediction

Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang et al.

ECCV 2024arXiv:2409.07307
7
citations
#3119

Unsupervised Deep Unrolling Networks for Phase Unwrapping

Zhile Chen, Yuhui Quan, Hui Ji

CVPR 2024
7
citations
#3120

Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models

Takami Sato, Justin Yue, Nanze Chen et al.

CVPR 2024arXiv:2308.15692
7
citations
#3121

Concise Plane Arrangements for Low-Poly Surface and Volume Modelling

Raphael Sulzer, Florent Lafarge

ECCV 2024arXiv:2404.06154
7
citations
#3122

Text-Guided Video Masked Autoencoder

David Fan, Jue Wang, Shuai Liao et al.

ECCV 2024arXiv:2408.00759
7
citations
#3123

CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.

Long Li, Nian Liu, Dingwen Zhang et al.

ECCV 2024arXiv:2409.01021
7
citations
#3124

Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation

Xinru Cui, Qiming Liu, Zhe Liu et al.

ECCV 2024
7
citations
#3125

Implicit Neural Representations and the Algebra of Complex Wavelets

T Mitchell Roddenberry, Vishwanath Saragadam, Maarten V de Hoop et al.

ICLR 2024arXiv:2310.00545
7
citations
#3126

NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration

Lin Tian, Thomas H Greer, Raul San Jose Estepar et al.

ECCV 2024arXiv:2309.07322
7
citations
#3127

FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving

Xingtai Gui, Tengteng Huang, Haonan Shao et al.

ECCV 2024arXiv:2404.12867
7
citations
#3128

DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction

MOZHGAN POURKESHAVARZ, Arielle Zhang, Amir Rasouli

ECCV 2024
7
citations
#3129

LEAD: Exploring Logit Space Evolution for Model Selection

Zixuan Hu, Xiaotong Li, SHIXIANG TANG et al.

CVPR 2024arXiv:2507.14559
7
citations
#3130

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Mathilde Caron, Ahmet Iscen, Alireza Fathi et al.

CVPR 2024arXiv:2403.02041
7
citations
#3131

CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment

Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok et al.

CVPR 2024arXiv:2404.01123
7
citations
#3132

STREAM: Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models

Pum Jun Kim, Seojun Kim, Jaejun Yoo

ICLR 2024oralarXiv:2403.09669
7
citations
#3133

Weight Conditioning for Smooth Optimization of Neural Networks

Hemanth Saratchandran, Thomas X Wang, Simon Lucey

ECCV 2024arXiv:2409.03424
7
citations
#3134

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Zihan Zhang, Zhuo Xu, Xiang Xiang

ECCV 2024
7
citations
#3135

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

Shentong Mo, Enze Xie, Yue Wu et al.

ECCV 2024arXiv:2312.07231
7
citations
#3136

Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise

Rui Pan, Yuxing Liu, Xiaoyu Wang et al.

ICLR 2024arXiv:2312.14567
7
citations
#3137

A high-quality robust diffusion framework for corrupted dataset

Quan Dao, Binh Ta, Tung Pham et al.

ECCV 2024arXiv:2311.17101
7
citations
#3138

Understanding Physical Dynamics with Counterfactual World Modeling

Rahul Mysore Venkatesh, Honglin Chen, Kevin Feigelis et al.

ECCV 2024arXiv:2312.06721
7
citations
#3139

Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma et al.

ECCV 2024arXiv:2407.09367
7
citations
#3140

Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training

Yuanqi Yao, Gang Wu, Kui Jiang et al.

ECCV 2024arXiv:2411.02149
7
citations
#3141

Identifying Policy Gradient Subspaces

Jan Schneider, Pierre Schumacher, Simon Guist et al.

ICLR 2024arXiv:2401.06604
7
citations
#3142

Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning

Kun Ding, Haojian Zhang, Qiang Yu et al.

AAAI 2024paperarXiv:2404.00603
7
citations
#3143

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling

Yuze Hao, Jianrong Zhang, Tao Zhuo et al.

AAAI 2024paperarXiv:2401.15987
7
citations
#3144

DeepSaDe: Learning Neural Networks That Guarantee Domain Constraint Satisfaction

Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel

AAAI 2024paperarXiv:2303.01141
7
citations
#3145

Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

Zhixiang Su, Di Wang, Chunyan Miao et al.

AAAI 2024paperarXiv:2312.13596
7
citations
#3146

BodyMAP - Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed

Abhishek Tandon, Anujraaj Goyal, Henry M. Clever et al.

CVPR 2024arXiv:2404.03183
7
citations
#3147

Direct Distillation between Different Domains

Jialiang Tang, Shuo Chen, Gang Niu et al.

ECCV 2024arXiv:2401.06826
7
citations
#3148

Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures

Sayanton Vhaduri Dibbo, Adam Breuer, Juston Moore et al.

ECCV 2024arXiv:2403.14772
7
citations
#3149

Self-Supervised Audio-Visual Soundscape Stylization

Tingle Li, Renhao Wang, Po-Yao Huang et al.

ECCV 2024arXiv:2409.14340
7
citations
#3150

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

Zeyang Liu, Lipeng Wan, Xinrui Yang et al.

AAAI 2024paperarXiv:2402.17978
7
citations
#3151

Multiscale Vision Transformers Meet Bipartite Matching for Efficient Single-stage Action Localization

Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos

CVPR 2024arXiv:2312.17686
7
citations
#3152

Discretization-Induced Dirichlet Posterior for Robust Uncertainty Quantification on Regression

Xuanlong Yu, Gianni Franchi, Jindong Gu et al.

AAAI 2024paperarXiv:2308.09065
7
citations
#3153

Camera-LiDAR Cross-modality Gait Recognition

Wenxuan Guo, Yingping Liang, Zhiyu Pan et al.

ECCV 2024arXiv:2407.02038
7
citations
#3154

On Computing Makespan-Optimal Solutions for Generalized Sliding-Tile Puzzles

Marcus Gozon, Jingjin Yu

AAAI 2024paperarXiv:2312.10887
7
citations
#3155

Artist-Friendly Relightable and Animatable Neural Heads

Yingyan Xu, Prashanth Chandran, Sebastian Weiss et al.

CVPR 2024arXiv:2312.03420
7
citations
#3156

Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning

Haowen Wang, Tao Sun, Congyun Jin et al.

ICLR 2024arXiv:2312.03248
7
citations
#3157

Single View Refractive Index Tomography with Neural Fields

Brandon Zhao, Aviad Levis, Liam Connor et al.

CVPR 2024arXiv:2309.04437
7
citations
#3158

Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation

Dong Zhao, Shuang Wang, Qi Zang et al.

CVPR 2024arXiv:2406.06813
7
citations
#3159

Fun with Flags: Robust Principal Directions via Flag Manifolds

Tolga Birdal, Nathan Mankovich

CVPR 2024arXiv:2401.04071
7
citations
#3160

Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM

Baicheng Li, Zike Yan, Dong Wu et al.

ECCV 2024arXiv:2407.13338
7
citations
#3161

Spear and Shield: Adversarial Attacks and Defense Methods for Model-Based Link Prediction on Continuous-Time Dynamic Graphs

Dongjin Lee, Juho Lee, Kijung Shin

AAAI 2024paperarXiv:2308.10779
7
citations
#3162

Generating 3D House Wireframes with Semantics

Xueqi Ma, Yilin Liu, Wenjun Zhou et al.

ECCV 2024arXiv:2407.12267
7
citations
#3163

Two-timescale Extragradient for Finding Local Minimax Points

Jiseok Chae, Kyuwon Kim, Donghwan Kim

ICLR 2024arXiv:2305.16242
7
citations
#3164

Robust Training of Federated Models with Extremely Label Deficiency

Yonggang Zhang, Zhiqin Yang, Xinmei Tian et al.

ICLR 2024arXiv:2402.14430
7
citations
#3165

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

Youheng Sun, Shengming Yuan, Xuanhan Wang et al.

ECCV 2024arXiv:2407.12292
7
citations
#3166

Detours for Navigating Instructional Videos

Kumar Ashutosh, Zihui Xue, Tushar Nagarajan et al.

CVPR 2024highlightarXiv:2401.01823
7
citations
#3167

Extreme Point Supervised Instance Segmentation

Hyeonjun Lee, Sehyun Hwang, Suha Kwak

CVPR 2024arXiv:2405.20729
7
citations
#3168

Spatial-Semantic Collaborative Cropping for User Generated Content

Yukun Su, Yiwen Cao, Jingliang Deng et al.

AAAI 2024paperarXiv:2401.08086
7
citations
#3169

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

KILICHBEK HAYDAROV, Xiaoqian Shen, Avinash Madasu et al.

ECCV 2024arXiv:2308.16349
7
citations
#3170

PPIDSG: A Privacy-Preserving Image Distribution Sharing Scheme with GAN in Federated Learning

Yuting Ma, Yuanzhi Yao, Xiaohua Xu

AAAI 2024paperarXiv:2312.10380
7
citations
#3171

WaveMo: Learning Wavefront Modulations to See Through Scattering

Mingyang Xie, Haiyun Guo, Brandon Y. Feng et al.

CVPR 2024arXiv:2404.07985
7
citations
#3172

Inverse Weight-Balancing for Deep Long-Tailed Learning

Wenqi Dang, Zhou Yang, Weisheng Dong et al.

AAAI 2024paper
7
citations
#3173

DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting

Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha et al.

ECCV 2024arXiv:2407.17058
7
citations
#3174

Physics-Aware Hand-Object Interaction Denoising

Haowen Luo, Yunze Liu, Li Yi

CVPR 2024arXiv:2405.11481
7
citations
#3175

Efficient Stitchable Task Adaptation

Haoyu He, Zizheng Pan, Jing Liu et al.

CVPR 2024arXiv:2311.17352
7
citations
#3176

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

Amrin Kareem, Jean Lahoud, Hisham Cholakkal

ECCV 2024arXiv:2404.03836
7
citations
#3177

Complete Neural Networks for Complete Euclidean Graphs

Snir Hordan, Tal Amir, Nadav Dym et al.

AAAI 2024paperarXiv:2301.13821
7
citations
#3178

LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate

Tao Wu, Tie Luo, D. C. Wunsch

AAAI 2024paperarXiv:2312.13118
7
citations
#3179

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Patrick Altmeyer, Mojtaba Farmanbar, Arie Van Deursen et al.

AAAI 2024paperarXiv:2312.10648
7
citations
#3180

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

Lirui Zhao, Yue Yang, Kaipeng Zhang et al.

CVPR 2024arXiv:2404.01342
7
citations
#3181

DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors

Zizheng Yan, Jiapeng Zhou, Fanpeng Meng et al.

ECCV 2024arXiv:2407.16260
7
citations
#3182

Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN

Minsoo Kang, Minkoo Kang, Suhyun Kim

AAAI 2024paperarXiv:2401.13193
7
citations
#3183

NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning

Bo Xiong, Mojtaba Nayyeri, Linhao Luo et al.

AAAI 2024paperarXiv:2312.09219
7
citations
#3184

Improving Zero-Shot Generalization for CLIP with Variational Adapter

Ziqian Lu, Fengli Shen, Mushui Liu et al.

ECCV 2024
7
citations
#3185

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

Jialin Li, Qiang Nie, Weifu Fu et al.

CVPR 2024highlightarXiv:2403.04303
7
citations
#3186

DAG-Aware Variational Autoencoder for Social Propagation Graph Generation

Dongpeng Hou, Chao Gao, Xuelong Li et al.

AAAI 2024paper
7
citations
#3187

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Junsung Lee, Minsoo Kang, Bohyung Han

ECCV 2024arXiv:2409.08077
7
citations
#3188

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.

ECCV 2024arXiv:2407.16448
7
citations
#3189

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

ECCV 2024
7
citations
#3190

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Alvi Md Ishmam, Chris Thomas

CVPR 2024arXiv:2411.15673
7
citations
#3191

Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts

Dominik Scheuble, Chenyang Lei, Mario Bijelic et al.

CVPR 2024arXiv:2406.03461
7
citations
#3192

Probabilistic Neural Circuits

Pedro Zuidberg Dos Martires

AAAI 2024paperarXiv:2403.06235
7
citations
#3193

Neural Lineage

Runpeng Yu, Xinchao Wang

CVPR 2024arXiv:2406.11129
7
citations
#3194

Accelerating the Global Aggregation of Local Explanations

Alon Mor, Yonatan Belinkov, Benny Kimelfeld

AAAI 2024paperarXiv:2312.07991
7
citations
#3195

Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants

Wei Chen, Zhiyi Huang, Ruichu Cai et al.

AAAI 2024paperarXiv:2312.11934
7
citations
#3196

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

ECCV 2024arXiv:2311.12090
7
citations
#3197

Exploiting Polarized Material Cues for Robust Car Detection

Wen Dong, Haiyang Mei, Ziqi Wei et al.

AAAI 2024paperarXiv:2401.02606
7
citations
#3198

Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning

Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.

AAAI 2024paperarXiv:2308.07272
7
citations
#3199

Semantic Human Mesh Reconstruction with Textures

xiaoyu zhan, Jianxin Yang, Yuanqi Li et al.

CVPR 2024arXiv:2403.02561
7
citations
#3200

BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events

Yijin Li, Yichen Shen, Zhaoyang Huang et al.

ECCV 2024arXiv:2410.20451
7
citations