Most Cited 2024 "animation dataset" Papers

12,324 papers found • Page 26 of 62

#5001

Functional Diffusion

Biao Zhang, Peter Wonka

CVPR 2024arXiv:2311.15435
14
citations
#5002

Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

Ian Gemp, Luke Marris, Georgios Piliouras

ICLR 2024arXiv:2310.06689
14
citations
#5003

BOtied: Multi-objective Bayesian optimization with tied multivariate ranks

Ji Won Park, Natasa Tagasovska, Michael Maser et al.

ICML 2024arXiv:2306.00344
14
citations
#5004

Unified Entropy Optimization for Open-Set Test-Time Adaptation

Zhengqing Gao, Xu-Yao Zhang, Cheng-Lin Liu

CVPR 2024arXiv:2404.06065
14
citations
#5005

Masks, Signs, And Learning Rate Rewinding

Advait Gadhikar, Rebekka Burkholz

ICLR 2024spotlightarXiv:2402.19262
14
citations
#5006

SEABO: A Simple Search-Based Method for Offline Imitation Learning

Jiafei Lyu, Xiaoteng Ma, Le Wan et al.

ICLR 2024arXiv:2402.03807
14
citations
#5007

Cycle-Consistency Learning for Captioning and Grounding

Ning Wang, Jiajun Deng, Mingbo Jia

AAAI 2024paperarXiv:2312.15162
14
citations
#5008

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Yanwei Wang, Johnson (Tsun-Hsuan) Wang, Jiayuan Mao et al.

ICLR 2024spotlightarXiv:2403.17124
14
citations
#5009

ProCC: Progressive Cross-Primitive Compatibility for Open-World Compositional Zero-Shot Learning

Fushuo Huo, Wenchao Xu, Song Guo et al.

AAAI 2024paperarXiv:2211.12417
14
citations
#5010

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Shuo Cao, Yihao Liu, Wenlong Zhang et al.

ECCV 2024arXiv:2407.12273
14
citations
#5011

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024arXiv:2408.05205
14
citations
#5012

The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical Images

Nicholas Konz, Maciej Mazurowski

ICLR 2024arXiv:2401.08865
14
citations
#5013

Sequential Fusion Based Multi-Granularity Consistency for Space-Time Transformer Tracking

Kun Hu, Wenjing Yang, Wanrong Huang et al.

AAAI 2024paper
14
citations
#5014

Trust Region Methods for Nonconvex Stochastic Optimization beyond Lipschitz Smoothness

Chenghan Xie, Chenxi Li, Chuwen Zhang et al.

AAAI 2024paperarXiv:2310.17319
14
citations
#5015

Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

Mohsin Hasan, Guojun Zhang, Kaiyang Guo et al.

AAAI 2024paperarXiv:2312.09817
14
citations
#5016

USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Xiaoqi Wang, Wenbin He, Xiwei Xuan et al.

CVPR 2024arXiv:2406.05271
14
citations
#5017

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

Haoxuan You, Xiaoyue Guo, Zhecan Wang et al.

ICLR 2024arXiv:2303.13455
14
citations
#5018

MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning

Zohar Rimon, Tom Jurgenson, Orr Krupnik et al.

ICLR 2024arXiv:2403.09859
14
citations
#5019

Are Models Biased on Text without Gender-related Language?

Catarina Belém, Preethi Seshadri, Yasaman Razeghi et al.

ICLR 2024arXiv:2405.00588
14
citations
#5020

Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning

Junfeng CHEN, Kailiang Wu

ICML 2024arXiv:2405.09285
14
citations
#5021

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Marah I Abdin, Suriya Gunasekar, Varun Chandrasekaran et al.

ICLR 2024arXiv:2310.15511
14
citations
#5022

Transport meets Variational Inference: Controlled Monte Carlo Diffusions

Francisco Vargas, Shreyas Padhy, Denis Blessing et al.

ICLR 2024arXiv:2307.01050
14
citations
#5023

BENO: Boundary-embedded Neural Operators for Elliptic PDEs

Haixin Wang, Jiaxin Li, Anubhav Dwivedi et al.

ICLR 2024arXiv:2401.09323
14
citations
#5024

SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos

Tao Wu, Runyu He, Gangshan Wu et al.

CVPR 2024arXiv:2404.04565
14
citations
#5025

UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization

Shuaibo Li, Wei Ma, Jianwei Guo et al.

CVPR 2024
14
citations
#5026

Multi-modal Gaussian Process Variational Autoencoders for Neural and Behavioral Data

Rabia Gondur, Usama Bin Sikandar, Evan Schaffer et al.

ICLR 2024oralarXiv:2310.03111
14
citations
#5027

Towards Fair Graph Federated Learning via Incentive Mechanisms

12794 Chenglu Pan, Jiarong Xu, Yue Yu et al.

AAAI 2024paperarXiv:2312.13306
14
citations
#5028

SIGMA: Sinkhorn-Guided Masked Video Modeling

Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.

ECCV 2024arXiv:2407.15447
14
citations
#5029

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

Jiajun Ma, Shuchen Xue, Tianyang Hu et al.

ICML 2024arXiv:2402.15170
14
citations
#5030

AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack

Ruikui Wang, Yuanfang Guo, Yunhong Wang

AAAI 2024paper
14
citations
#5031

A Neural Framework for Generalized Causal Sensitivity Analysis

Dennis Frauen, Fergus Imrie, Alicia Curth et al.

ICLR 2024arXiv:2311.16026
14
citations
#5032

Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

Feiran Li, Qianqian Xu, Shilong Bao et al.

ICML 2024spotlightarXiv:2405.09782
14
citations
#5033

Mutual-Modality Adversarial Attack with Semantic Perturbation

Jingwen Ye, Ruonan Yu, Songhua Liu et al.

AAAI 2024paperarXiv:2312.12768
14
citations
#5034

Tackling Vision Language Tasks through Learning Inner Monologues

Diji Yang, Kezhen Chen, Jinmeng Rao et al.

AAAI 2024paperarXiv:2308.09970
14
citations
#5035

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Yi Ma, Jianye Hao, Hebin Liang et al.

ICML 2024arXiv:2311.00267
14
citations
#5036

ProTeCt: Prompt Tuning for Taxonomic Open Set Classification

Tz-Ying Wu, Chih-Hui Ho, Nuno Vasconcelos

CVPR 2024arXiv:2306.02240
14
citations
#5037

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ECCV 2024arXiv:2310.08530
14
citations
#5038

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Wenhao Zhan, Masatoshi Uehara, Wen Sun et al.

ICLR 2024spotlightarXiv:2305.18505
14
citations
#5039

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024arXiv:2312.13663
14
citations
#5040

CLIP-Guided Federated Learning on Heterogeneity and Long-Tailed Data

Jiangming Shi, Shanshan Zheng, Xiangbo Yin et al.

AAAI 2024paperarXiv:2312.08648
14
citations
#5041

Pre-training with Random Orthogonal Projection Image Modeling

Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed et al.

ICLR 2024spotlightarXiv:2310.18737
14
citations
#5042

PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns

Shuliang Ning, Duomin Wang, Yipeng Qin et al.

CVPR 2024arXiv:2312.04534
14
citations
#5043

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

ICLR 2024arXiv:2311.18207
14
citations
#5044

Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation

Jingxuan He, Lechao Cheng, Chaowei Fang et al.

AAAI 2024paperarXiv:2312.08916
14
citations
#5045

SyFormer: Structure-Guided Synergism Transformer for Large-Portion Image Inpainting

Jie Wu, Yuchao Feng, Honghui Xu et al.

AAAI 2024paper
14
citations
#5046

One Forward is Enough for Neural Network Training via Likelihood Ratio Method

Jinyang Jiang, Zeliang Zhang, Chenliang Xu et al.

ICLR 2024arXiv:2305.08960
14
citations
#5047

NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation

Pengfei Zheng, Yonggang Zhang, Zhen Fang et al.

ICLR 2024spotlightarXiv:2403.08840
14
citations
#5048

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off

Yuansan Liu, Ruqing Zhang, Mingkun Zhang et al.

AAAI 2024paperarXiv:2312.10329
14
citations
#5049

Enhancing Adversarial Robustness in SNNs with Sparse Gradients

Yujia Liu, Tong Bu, Ding Jianhao et al.

ICML 2024arXiv:2405.20355
14
citations
#5050

Arrows of Time for Large Language Models

Vassilis Papadopoulos, Jérémie Wenger, Clement Hongler

ICML 2024arXiv:2401.17505
14
citations
#5051

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024arXiv:2407.08256
14
citations
#5052

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification

YuTeng Ye, Hang Zhou, Jiale Cai et al.

AAAI 2024paperarXiv:2211.14742
14
citations
#5053

CMG-Net: Robust Normal Estimation for Point Clouds via Chamfer Normal Distance and Multi-Scale Geometry

Yingrui Wu, Mingyang Zhao, Keqiang Li et al.

AAAI 2024paperarXiv:2312.09154
14
citations
#5054

Attention Meets Post-hoc Interpretability: A Mathematical Perspective

Gianluigi Lopardo, Frederic Precioso, Damien Garreau

ICML 2024arXiv:2402.03485
14
citations
#5055

ASWT-SGNN: Adaptive Spectral Wavelet Transform-Based Self-Supervised Graph Neural Network

Ruyue Liu, Rong Yin, Yong Liu et al.

AAAI 2024paperarXiv:2312.05736
14
citations
#5056

DexFuncGrasp: A Robotic Dexterous Functional Grasp Dataset Constructed from a Cost-Effective Real-Simulation Annotation System

Jinglue Hang, Xiangbo Lin, Tianqiang Zhu et al.

AAAI 2024paper
14
citations
#5057

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

ECCV 2024
14
citations
#5058

Exploiting Auxiliary Caption for Video Grounding

Hongxiang Li, Meng Cao, Xuxin Cheng et al.

AAAI 2024paperarXiv:2301.05997
14
citations
#5059

HONGAT: Graph Attention Networks in the Presence of High-Order Neighbors

Heng-Kai Zhang, Yi-Ge Zhang, Zhi Zhou et al.

AAAI 2024paper
14
citations
#5060

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024arXiv:2407.01872
14
citations
#5061

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024arXiv:2404.14565
14
citations
#5062

MEND: Meta Demonstration Distillation for Efficient and Effective In-Context Learning

Yichuan Li, Xiyao Ma, Sixing Lu et al.

ICLR 2024arXiv:2403.06914
14
citations
#5063

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024arXiv:2311.11325
14
citations
#5064

Expand-and-Cluster: Parameter Recovery of Neural Networks

Flavio Martinelli, Berfin Simsek, Wulfram Gerstner et al.

ICML 2024arXiv:2304.12794
14
citations
#5065

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024arXiv:2404.05729
14
citations
#5066

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024
14
citations
#5067

A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation

Ayan Sengupta, Shantanu Dixit, Md Shad Akhtar et al.

ICLR 2024
14
citations
#5068

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024arXiv:2404.09857
14
citations
#5069

Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks

Yassine ABBAHADDOU, Sofiane ENNADIR, Johannes Lutzeyer et al.

ICLR 2024arXiv:2404.17947
14
citations
#5070

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232
14
citations
#5071

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

ECCV 2024arXiv:2403.09419
14
citations
#5072

Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes

Yotam Amitai, Yael Friedler, Ofra Amir

AAAI 2024paperarXiv:2312.11118
14
citations
#5073

How to Escape Sharp Minima with Random Perturbations

Kwangjun Ahn, Ali Jadbabaie, Suvrit Sra

ICML 2024arXiv:2305.15659
14
citations
#5074

Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling

Liwen Wu, Sai Bi, Zexiang Xu et al.

CVPR 2024highlightarXiv:2405.14847
14
citations
#5075

ReCoRe: Regularized Contrastive Representation Learning of World Model

Rudra P, K. Poudel, Harit Pandya et al.

CVPR 2024arXiv:2312.09056
14
citations
#5076

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024arXiv:2407.04458
14
citations
#5077

Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context

Haochong Xia, Shuo Sun, Xinrun Wang et al.

AAAI 2024paperarXiv:2309.07708
14
citations
#5078

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330
14
citations
#5079

DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning

Ashwin Hebbar, Sravan Kumar Ankireddy, Hyeji Kim et al.

ICML 2024arXiv:2402.08864
14
citations
#5080

OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation

Jisoo Jeong, Hong Cai, Risheek Garrepalli et al.

CVPR 2024arXiv:2403.18092
14
citations
#5081

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

ICLR 2024spotlightarXiv:2403.09953
14
citations
#5082

Adversarial Backdoor Attack by Naturalistic Data Poisoning on Trajectory Prediction in Autonomous Driving

Mozhgan Pourkeshavarz, Mohammad Sabokrou, Amir Rasouli

CVPR 2024arXiv:2306.15755
14
citations
#5083

Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity

Runyu Zhang, Yang Hu, Na Li

ICLR 2024arXiv:2306.11626
14
citations
#5084

RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation

Oded Bialer, Yuval Haitman

CVPR 2024arXiv:2404.18150
14
citations
#5085

HyperFields: Towards Zero-Shot Generation of NeRFs from Text

Sudarshan Babu, Richard Liu, Zi Yu Zhou et al.

ICML 2024arXiv:2310.17075
14
citations
#5086

SVDinsTN: A Tensor Network Paradigm for Efficient Structure Search from Regularized Modeling Perspective

Yu-Bang Zheng, Xile Zhao, Junhua Zeng et al.

CVPR 2024highlightarXiv:2305.14912
14
citations
#5087

Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

Zihan Wang, Siyang Song, Cheng Luo et al.

CVPR 2024arXiv:2404.06443
14
citations
#5088

UniHuman: A Unified Model For Editing Human Images in the Wild

Nannan Li, Qing Liu, Krishna Kumar Singh et al.

CVPR 2024arXiv:2312.14985
14
citations
#5089

3D Neural Edge Reconstruction

Lei Li, Songyou Peng, Zehao Yu et al.

CVPR 2024arXiv:2405.19295
14
citations
#5090

Hyperbolic Learning with Synthetic Captions for Open-World Detection

Fanjie Kong, Yanbei Chen, Jiarui Cai et al.

CVPR 2024arXiv:2404.05016
14
citations
#5091

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

Zhen Yang, Zhou Shao, Yuxiao Dong et al.

AAAI 2024paperarXiv:2402.11855
14
citations
#5092

Inspecting Prediction Confidence for Detecting Black-Box Backdoor Attacks

Tong Wang, Yuan Yao, Feng Xu et al.

AAAI 2024paper
14
citations
#5093

Deep Structural Knowledge Exploitation and Synergy for Estimating Node Importance Value on Heterogeneous Information Networks

Yankai Chen, Yixiang Fang, Qiongyan Wang et al.

AAAI 2024paperarXiv:2402.12411
14
citations
#5094

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024arXiv:2403.13556
14
citations
#5095

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024arXiv:2401.02402
14
citations
#5096

Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption

Buzhen Huang, Chen Li, Chongyang Xu et al.

CVPR 2024arXiv:2404.11291
14
citations
#5097

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024arXiv:2407.18550
14
citations
#5098

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024arXiv:2409.06290
14
citations
#5099

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024arXiv:2311.18815
14
citations
#5100

Principled Preferential Bayesian Optimization

Wenjie Xu, Wenbin Wang, Yuning Jiang et al.

ICML 2024arXiv:2402.05367
14
citations
#5101

CNN Kernels Can Be the Best Shapelets

Eric Qu, Yansen Wang, Xufang Luo et al.

ICLR 2024
14
citations
#5102

Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging

Fulin Luo, Xi Chen, Xiuwen Gong et al.

AAAI 2024paper
14
citations
#5103

Neural-Symbolic Recursive Machine for Systematic Generalization

Qing Li, Yixin Zhu, Yitao Liang et al.

ICLR 2024arXiv:2210.01603
14
citations
#5104

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao et al.

CVPR 2024arXiv:2402.02352
14
citations
#5105

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

Qi Jia, Yaqi Cai, Qi Jia et al.

CVPR 2024highlightarXiv:2405.06283
14
citations
#5106

SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Xudong LU, Aojun Zhou, Yuhui Xu et al.

ICML 2024arXiv:2405.16057
14
citations
#5107

Navigate Beyond Shortcuts: Debiased Learning Through the Lens of Neural Collapse

Yining Wang, Junjie Sun, Chenyue Wang et al.

CVPR 2024highlightarXiv:2405.05587
14
citations
#5108

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness

Guibin Zhang, Yanwei Yue, kun wang et al.

ICML 2024arXiv:2402.01242
14
citations
#5109

HEAL-SWIN: A Vision Transformer On The Sphere

Oscar Carlsson, Jan E. Gerken, Hampus Linander et al.

CVPR 2024arXiv:2307.07313
14
citations
#5110

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024arXiv:2409.06703
14
citations
#5111

LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation

Ke Guo, Zhenwei Miao, Wei Jing et al.

CVPR 2024arXiv:2403.17601
14
citations
#5112

Privately Aligning Language Models with Reinforcement Learning

Fan Wu, Huseyin Inan, Arturs Backurs et al.

ICLR 2024arXiv:2310.16960
14
citations
#5113

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

CVPR 2024
14
citations
#5114

WWW: A Unified Framework for Explaining What Where and Why of Neural Networks by Interpretation of Neuron Concepts

Yong Hyun Ahn, Hyeon Bae Kim, Seong Tae Kim

CVPR 2024arXiv:2402.18956
14
citations
#5115

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

Jongha Kim, Jihwan Park, Jinyoung Park et al.

CVPR 2024arXiv:2403.17709
14
citations
#5116

LAFS: Landmark-based Facial Self-supervised Learning for Face Recognition

Zhonglin Sun, Chen Feng, Ioannis Patras et al.

CVPR 2024arXiv:2403.08161
14
citations
#5117

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024arXiv:2212.09877
14
citations
#5118

Bootstrapping Autonomous Driving Radars with Self-Supervised Learning

Yiduo Hao, Sohrab Madani, Junfeng Guan et al.

CVPR 2024arXiv:2312.04519
14
citations
#5119

Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks

Zhiying Jiang, Xingyuan Li, Jinyuan Liu et al.

AAAI 2024paperarXiv:2402.15959
14
citations
#5120

L2B: Learning to Bootstrap Robust Models for Combating Label Noise

Yuyin Zhou, Xianhang li, Fengze Liu et al.

CVPR 2024arXiv:2202.04291
14
citations
#5121

Learning Structure-from-Motion with Graph Attention Networks

Lucas Brynte, José Pedro Iglesias, Carl Olsson et al.

CVPR 2024arXiv:2308.15984
14
citations
#5122

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild

Zeren Jiang, Chen Guo, Manuel Kaufmann et al.

CVPR 2024arXiv:2406.01595
14
citations
#5123

MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

Jielin Qiu, Jiacheng Zhu, William Han et al.

CVPR 2024highlightarXiv:2306.04216
14
citations
#5124

Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains

Bang-Dang Pham, Phong Tran, Anh Tran et al.

CVPR 2024arXiv:2403.16205
14
citations
#5125

ControlCap: Controllable Region-level Captioning

Yuzhong Zhao, Liu Yue, Zonghao Guo et al.

ECCV 2024arXiv:2401.17910
14
citations
#5126

Improving Gradient-Guided Nested Sampling for Posterior Inference

Pablo Lemos, Nikolay Malkin, Will Handley et al.

ICML 2024arXiv:2312.03911
14
citations
#5127

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion

Zhenjiang Du, Jiale Dou, Zhitao Liu et al.

AAAI 2024paper
14
citations
#5128

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

Yunchen Li, Zhou Yu, Gaoqi He et al.

AAAI 2024paperarXiv:2312.08200
14
citations
#5129

In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging

Xin Wang, Lizhi Wang, Xiangtian Ma et al.

CVPR 2024arXiv:2312.13319
14
citations
#5130

Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation

Jingyun Wang, Guoliang Kang

CVPR 2024arXiv:2408.06747
14
citations
#5131

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

Shitian Zhao, Zhuowan Li, YadongLu et al.

CVPR 2024highlightarXiv:2312.06685
14
citations
#5132

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024arXiv:2409.03944
14
citations
#5133

SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

Danni Yang, Jiayi Ji, Yiwei Ma et al.

ICML 2024arXiv:2406.01451
14
citations
#5134

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

CVPR 2024arXiv:2402.19270
14
citations
#5135

Controlling the World by Sleight of Hand

Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.

ECCV 2024arXiv:2408.07147
14
citations
#5136

Misalignment-Robust Frequency Distribution Loss for Image Transformation

Zhangkai Ni, Juncheng Wu, Zian Wang et al.

CVPR 2024arXiv:2402.18192
14
citations
#5137

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

CVPR 2024arXiv:2403.09344
14
citations
#5138

Prompt-guided Precise Audio Editing with Diffusion Models

Manjie Xu, Chenxing Li, Duzhen Zhang et al.

ICML 2024arXiv:2406.04350
14
citations
#5139

Position: Benchmarking is Limited in Reinforcement Learning Research

Scott Jordan, Adam White, Bruno da Silva et al.

ICML 2024arXiv:2406.16241
14
citations
#5140

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

Shuofeng Sun, Yongming Rao, Jiwen Lu et al.

CVPR 2024arXiv:2404.15010
14
citations
#5141

Accelerating Data Generation for Neural Operators via Krylov Subspace Recycling

Hong Wang, Zhongkai Hao, Jie Wang et al.

ICLR 2024spotlightarXiv:2401.09516
14
citations
#5142

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024arXiv:2407.07077
14
citations
#5143

DGD: Dynamic 3D Gaussians Distillation

Isaac Labe, Noam Issachar, Itai Lang et al.

ECCV 2024arXiv:2405.19321
14
citations
#5144

Policy Learning for Balancing Short-Term and Long-Term Rewards

Peng Wu, Ziyu Shen, Feng Xie et al.

ICML 2024arXiv:2405.03329
14
citations
#5145

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024arXiv:2403.09072
14
citations
#5146

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning

Fan-Ming Luo, Tian Xu, Xingchen Cao et al.

ICLR 2024spotlightarXiv:2310.05422
14
citations
#5147

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Nicolas Michel, Giovanni Chierchia, Romain Negrel et al.

AAAI 2024paperarXiv:2306.03364
14
citations
#5148

Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency

Soumyadeep Pal, Yuguang Yao, Ren Wang et al.

ICLR 2024arXiv:2403.10717
14
citations
#5149

LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack

AAAI 2024paperarXiv:2308.00319
14
citations
#5150

Neural Contractive Dynamical Systems

Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.

ICLR 2024spotlightarXiv:2401.09352
14
citations
#5151

Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction

Yizhi Wang, Wallace Lira, Wenqi Wang et al.

CVPR 2024
14
citations
#5152

MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading

Abdallah Dib, Luiz Gustavo Hafemann, Emeline Got et al.

CVPR 2024arXiv:2312.13091
14
citations
#5153

Learning Explicit Contact for Implicit Reconstruction of Hand-Held Objects from Monocular Images

Junxing Hu, Hongwen Zhang, Zerui Chen et al.

AAAI 2024paperarXiv:2305.20089
14
citations
#5154

Adaptive Federated Learning with Auto-Tuned Clients

Junhyung Lyle Kim, Mohammad Taha Toghani, Cesar Uribe et al.

ICLR 2024arXiv:2306.11201
14
citations
#5155

Memory-based Adapters for Online 3D Scene Perception

Xiuwei Xu, Chong Xia, Ziwei Wang et al.

CVPR 2024arXiv:2403.06974
14
citations
#5156

Federated Optimization with Doubly Regularized Drift Correction

Xiaowen Jiang, Anton Rodomanov, Sebastian Stich

ICML 2024arXiv:2404.08447
14
citations
#5157

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024arXiv:2409.11718
14
citations
#5158

Open-Vocabulary Calibration for Fine-tuned CLIP

Shuoyuan Wang, Jindong Wang, Guoqing Wang et al.

ICML 2024arXiv:2402.04655
14
citations
#5159

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024arXiv:2312.02362
14
citations
#5160

Protein Multimer Structure Prediction via Prompt Learning

Ziqi Gao, Xiangguo SUN, Zijing Liu et al.

ICLR 2024arXiv:2402.18813
14
citations
#5161

M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis

Ning Zhang, Hiuyi Cheng, Jiayu Chen et al.

AAAI 2024paper
14
citations
#5162

Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Jiayi Gao, Kongming Liang, Tao Wei et al.

AAAI 2024paper
14
citations
#5163

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024arXiv:2407.15626
14
citations
#5164

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

ECCV 2024arXiv:2407.15328
14
citations
#5165

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects

Abhinav Kumar, Yuliang Guo, Xinyu Huang et al.

CVPR 2024arXiv:2403.20318
14
citations
#5166

DOS: Diverse Outlier Sampling for Out-of-Distribution Detection

Wenyu Jiang, Hao Cheng, MingCai Chen et al.

ICLR 2024arXiv:2306.02031
14
citations
#5167

DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data

Hanrong Ye, Dan Xu

CVPR 2024arXiv:2403.15389
14
citations
#5168

Cell Graph Transformer for Nuclei Classification

Wei Lou, Guanbin Li, Xiang Wan et al.

AAAI 2024paperarXiv:2402.12946
14
citations
#5169

Explaining CLIP's Performance Disparities on Data from Blind/Low Vision Users

Daniela Massiceti, Camilla Longden, Agnieszka Słowik et al.

CVPR 2024arXiv:2311.17315
14
citations
#5170

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024arXiv:2407.15617
14
citations
#5171

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Jianxiong Li, Jinliang Zheng, Yinan Zheng et al.

ICML 2024oralarXiv:2402.18137
14
citations
#5172

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024
14
citations
#5173

Rating-Based Reinforcement Learning

Devin White, Mingkang Wu, Ellen Novoseller et al.

AAAI 2024paperarXiv:2307.16348
14
citations
#5174

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024arXiv:2408.10614
14
citations
#5175

Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment

Bowen Gao, Yinjun JIA, Yuanle Mo et al.

ICLR 2024arXiv:2310.07229
14
citations
#5176

BayOTIDE: Bayesian Online Multivariate Time Series Imputation with Functional Decomposition

Shikai Fang, Qingsong Wen, Yingtao Luo et al.

ICML 2024oralarXiv:2308.14906
14
citations
#5177

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024
14
citations
#5178

GraCo: Granularity-Controllable Interactive Segmentation

Yian Zhao, Kehan Li, Zesen Cheng et al.

CVPR 2024highlightarXiv:2405.00587
14
citations
#5179

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024
14
citations
#5180

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Hao Tan, Jun Li, Yizhuang Zhou et al.

AAAI 2024paperarXiv:2312.06401
14
citations
#5181

Multi-Sender Persuasion: A Computational Perspective

Safwan Hossain, Tonghan Wang, Tao Lin et al.

ICML 2024arXiv:2402.04971
14
citations
#5182

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024arXiv:2407.02778
14
citations
#5183

CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning

Tianyu Li, Hyunyoung Jung, Matthew Gombolay et al.

ICLR 2024arXiv:2309.17046
14
citations
#5184

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024arXiv:2407.11499
14
citations
#5185

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024arXiv:2407.13460
14
citations
#5186

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024arXiv:2404.14715
14
citations
#5187

Score Models for Offline Goal-Conditioned Reinforcement Learning

Harshit Sikchi, Rohan Chitnis, Ahmed Touati et al.

ICLR 2024arXiv:2311.02013
14
citations
#5188

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

Tzofi Klinghoffer, Xiaoyu Xiang, Siddharth Somasundaram et al.

CVPR 2024arXiv:2312.14239
14
citations
#5189

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Rui Zhao, Bin Shi, Jianfei Ruan et al.

CVPR 2024arXiv:2405.05714
14
citations
#5190

Discovering Bias in Latent Space: An Unsupervised Debiasing Approach

Dyah Adila, Shuai Zhang, Boran Han et al.

ICML 2024arXiv:2406.03631
14
citations
#5191

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions

Hao Xu, Li Haipeng, Yinqiao Wang et al.

CVPR 2024arXiv:2403.18575
14
citations
#5192

Neural Active Learning Beyond Bandits

Yikun Ban, Ishika Agarwal, Ziwei Wu et al.

ICLR 2024arXiv:2404.12522
14
citations
#5193

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?

JIANFEI YANG, Hanjie Qian, Yuecong Xu et al.

ICLR 2024arXiv:2305.18712
14
citations
#5194

Verifying message-passing neural networks via topology-based bounds tightening

Christopher Hojny, Shiqiang Zhang, Juan Campos et al.

ICML 2024arXiv:2402.13937
14
citations
#5195

RLVF: Learning from Verbal Feedback without Overgeneralization

Moritz Stephan, Alexander Khazatsky, Eric Mitchell et al.

ICML 2024arXiv:2402.10893
14
citations
#5196

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024arXiv:2402.18695
13
citations
#5197

Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity

Yiyue Chen, Haris Vikalo, Chianing Wang

AAAI 2024paperarXiv:2312.13380
13
citations
#5198

Retrieval-Augmented Score Distillation for Text-to-3D Generation

Junyoung Seo, Susung Hong, Wooseok Jang et al.

ICML 2024arXiv:2402.02972
13
citations
#5199

In-context Exploration-Exploitation for Reinforcement Learning

Zhenwen Dai, Federico Tomasi, Sina Ghiassian

ICLR 2024arXiv:2403.06826
13
citations
#5200

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024arXiv:2407.16696
13
citations