Most Cited 2024 &quot;animation dataset&quot; Papers

ICLR 2024arXiv:2310.06689

#5002

Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization

Ian Gemp, Luke Marris, Georgios Piliouras

ICML 2024arXiv:2306.00344

#5003

BOtied: Multi-objective Bayesian optimization with tied multivariate ranks

Ji Won Park, Natasa Tagasovska, Michael Maser et al.

CVPR 2024arXiv:2404.06065

#5004

Unified Entropy Optimization for Open-Set Test-Time Adaptation

Zhengqing Gao, Xu-Yao Zhang, Cheng-Lin Liu

ICLR 2024spotlightarXiv:2402.19262

#5005

Masks, Signs, And Learning Rate Rewinding

Advait Gadhikar, Rebekka Burkholz

ICLR 2024arXiv:2402.03807

#5006

SEABO: A Simple Search-Based Method for Offline Imitation Learning

Jiafei Lyu, Xiaoteng Ma, Le Wan et al.

AAAI 2024paperarXiv:2312.15162

#5007

Cycle-Consistency Learning for Captioning and Grounding

Ning Wang, Jiajun Deng, Mingbo Jia

ICLR 2024spotlightarXiv:2403.17124

#5008

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Yanwei Wang, Johnson (Tsun-Hsuan) Wang, Jiayuan Mao et al.

AAAI 2024paperarXiv:2211.12417

#5009

ProCC: Progressive Cross-Primitive Compatibility for Open-World Compositional Zero-Shot Learning

Fushuo Huo, Wenchao Xu, Song Guo et al.

ECCV 2024arXiv:2407.12273

#5010

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Shuo Cao, Yihao Liu, Wenlong Zhang et al.

ECCV 2024arXiv:2408.05205

#5011

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ICLR 2024arXiv:2401.08865

#5012

The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical Images

Nicholas Konz, Maciej Mazurowski

#5013

Sequential Fusion Based Multi-Granularity Consistency for Space-Time Transformer Tracking

Kun Hu, Wenjing Yang, Wanrong Huang et al.

AAAI 2024paperarXiv:2310.17319

#5014

Trust Region Methods for Nonconvex Stochastic Optimization beyond Lipschitz Smoothness

Chenghan Xie, Chenxi Li, Chuwen Zhang et al.

AAAI 2024paperarXiv:2312.09817

#5015

Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

Mohsin Hasan, Guojun Zhang, Kaiyang Guo et al.

CVPR 2024arXiv:2406.05271

#5016

USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Xiaoqi Wang, Wenbin He, Xiwei Xuan et al.

ICLR 2024arXiv:2303.13455

#5017

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

Haoxuan You, Xiaoyue Guo, Zhecan Wang et al.

ICLR 2024arXiv:2403.09859

#5018

MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning

Zohar Rimon, Tom Jurgenson, Orr Krupnik et al.

ICLR 2024arXiv:2405.00588

#5019

Are Models Biased on Text without Gender-related Language?

Catarina Belém, Preethi Seshadri, Yasaman Razeghi et al.

ICML 2024arXiv:2405.09285

#5020

Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning

Junfeng CHEN, Kailiang Wu

ICLR 2024arXiv:2310.15511

#5021

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Marah I Abdin, Suriya Gunasekar, Varun Chandrasekaran et al.

ICLR 2024arXiv:2307.01050

#5022

Transport meets Variational Inference: Controlled Monte Carlo Diffusions

Francisco Vargas, Shreyas Padhy, Denis Blessing et al.

ICLR 2024arXiv:2401.09323

#5023

BENO: Boundary-embedded Neural Operators for Elliptic PDEs

Haixin Wang, Jiaxin Li, Anubhav Dwivedi et al.

CVPR 2024arXiv:2404.04565

#5024

SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos

Tao Wu, Runyu He, Gangshan Wu et al.

#5025

UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization

Shuaibo Li, Wei Ma, Jianwei Guo et al.

CVPR 2024

ICLR 2024oralarXiv:2310.03111

#5026

Multi-modal Gaussian Process Variational Autoencoders for Neural and Behavioral Data

Rabia Gondur, Usama Bin Sikandar, Evan Schaffer et al.

AAAI 2024paperarXiv:2312.13306

#5027

Towards Fair Graph Federated Learning via Incentive Mechanisms

12794 Chenglu Pan, Jiarong Xu, Yue Yu et al.

ECCV 2024arXiv:2407.15447

#5028

SIGMA: Sinkhorn-Guided Masked Video Modeling

Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.

ICML 2024arXiv:2402.15170

#5029

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

Jiajun Ma, Shuchen Xue, Tianyang Hu et al.

#5030

AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack

Ruikui Wang, Yuanfang Guo, Yunhong Wang

ICLR 2024arXiv:2311.16026

#5031

A Neural Framework for Generalized Causal Sensitivity Analysis

Dennis Frauen, Fergus Imrie, Alicia Curth et al.

ICML 2024spotlightarXiv:2405.09782

#5032

Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

Feiran Li, Qianqian Xu, Shilong Bao et al.

AAAI 2024paperarXiv:2312.12768

#5033

Mutual-Modality Adversarial Attack with Semantic Perturbation

Jingwen Ye, Ruonan Yu, Songhua Liu et al.

AAAI 2024paperarXiv:2308.09970

#5034

Tackling Vision Language Tasks through Learning Inner Monologues

Diji Yang, Kezhen Chen, Jinmeng Rao et al.

ICML 2024arXiv:2311.00267

#5035

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Yi Ma, Jianye Hao, Hebin Liang et al.

CVPR 2024arXiv:2306.02240

#5036

ProTeCt: Prompt Tuning for Taxonomic Open Set Classification

Tz-Ying Wu, Chih-Hui Ho, Nuno Vasconcelos

ECCV 2024arXiv:2310.08530

#5037

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ICLR 2024spotlightarXiv:2305.18505

#5038

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Wenhao Zhan, Masatoshi Uehara, Wen Sun et al.

ECCV 2024arXiv:2312.13663

#5039

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

AAAI 2024paperarXiv:2312.08648

#5040

CLIP-Guided Federated Learning on Heterogeneity and Long-Tailed Data

Jiangming Shi, Shanshan Zheng, Xiangbo Yin et al.

ICLR 2024spotlightarXiv:2310.18737

#5041

Pre-training with Random Orthogonal Projection Image Modeling

Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed et al.

CVPR 2024arXiv:2312.04534

#5042

PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns

Shuliang Ning, Duomin Wang, Yipeng Qin et al.

ICLR 2024arXiv:2311.18207

#5043

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami et al.

AAAI 2024paperarXiv:2312.08916

#5044

Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation

Jingxuan He, Lechao Cheng, Chaowei Fang et al.

#5045

SyFormer: Structure-Guided Synergism Transformer for Large-Portion Image Inpainting

Jie Wu, Yuchao Feng, Honghui Xu et al.

ICLR 2024arXiv:2305.08960

#5046

One Forward is Enough for Neural Network Training via Likelihood Ratio Method

Jinyang Jiang, Zeliang Zhang, Chenliang Xu et al.

ICLR 2024spotlightarXiv:2403.08840

#5047

NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation

Pengfei Zheng, Yonggang Zhang, Zhen Fang et al.

AAAI 2024paperarXiv:2312.10329

#5048

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off

Yuansan Liu, Ruqing Zhang, Mingkun Zhang et al.

ICML 2024arXiv:2405.20355

#5049

Enhancing Adversarial Robustness in SNNs with Sparse Gradients

Yujia Liu, Tong Bu, Ding Jianhao et al.

ICML 2024arXiv:2401.17505

#5050

Arrows of Time for Large Language Models

Vassilis Papadopoulos, Jérémie Wenger, Clement Hongler

ECCV 2024arXiv:2407.08256

#5051

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

AAAI 2024paperarXiv:2211.14742

#5052

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification

YuTeng Ye, Hang Zhou, Jiale Cai et al.

AAAI 2024paperarXiv:2312.09154

#5053

CMG-Net: Robust Normal Estimation for Point Clouds via Chamfer Normal Distance and Multi-Scale Geometry

Yingrui Wu, Mingyang Zhao, Keqiang Li et al.

ICML 2024arXiv:2402.03485

#5054

Attention Meets Post-hoc Interpretability: A Mathematical Perspective

Gianluigi Lopardo, Frederic Precioso, Damien Garreau

AAAI 2024paperarXiv:2312.05736

#5055

ASWT-SGNN: Adaptive Spectral Wavelet Transform-Based Self-Supervised Graph Neural Network

Ruyue Liu, Rong Yin, Yong Liu et al.

#5056

DexFuncGrasp: A Robotic Dexterous Functional Grasp Dataset Constructed from a Cost-Effective Real-Simulation Annotation System

Jinglue Hang, Xiangbo Lin, Tianqiang Zhu et al.

#5057

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

AAAI 2024paperarXiv:2301.05997

#5058

Exploiting Auxiliary Caption for Video Grounding

Hongxiang Li, Meng Cao, Xuxin Cheng et al.

#5059

HONGAT: Graph Attention Networks in the Presence of High-Order Neighbors

Heng-Kai Zhang, Yi-Ge Zhang, Zhi Zhou et al.

ECCV 2024arXiv:2407.01872

#5060

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024arXiv:2404.14565

#5061

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ICLR 2024arXiv:2403.06914

#5062

MEND: Meta Demonstration Distillation for Efficient and Effective In-Context Learning

Yichuan Li, Xiyao Ma, Sixing Lu et al.

ECCV 2024arXiv:2311.11325

#5063

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ICML 2024arXiv:2304.12794

#5064

Expand-and-Cluster: Parameter Recovery of Neural Networks

Flavio Martinelli, Berfin Simsek, Wulfram Gerstner et al.

ECCV 2024arXiv:2404.05729

#5065

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

#5066

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

#5067

A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation

Ayan Sengupta, Shantanu Dixit, Md Shad Akhtar et al.

ICLR 2024

ECCV 2024arXiv:2404.09857

#5068

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ICLR 2024arXiv:2404.17947

#5069

Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks

Yassine ABBAHADDOU, Sofiane ENNADIR, Johannes Lutzeyer et al.

ECCV 2024arXiv:2312.14232

#5070

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2403.09419

#5071

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

AAAI 2024paperarXiv:2312.11118

#5072

Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes

Yotam Amitai, Yael Friedler, Ofra Amir

ICML 2024arXiv:2305.15659

#5073

How to Escape Sharp Minima with Random Perturbations

Kwangjun Ahn, Ali Jadbabaie, Suvrit Sra

CVPR 2024highlightarXiv:2405.14847

#5074

Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling

Liwen Wu, Sai Bi, Zexiang Xu et al.

CVPR 2024arXiv:2312.09056

#5075

ReCoRe: Regularized Contrastive Representation Learning of World Model

Rudra P, K. Poudel, Harit Pandya et al.

ECCV 2024arXiv:2407.04458

#5076

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

AAAI 2024paperarXiv:2309.07708

#5077

Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context

Haochong Xia, Shuo Sun, Xinrun Wang et al.

ECCV 2024arXiv:2407.10330

#5078

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ICML 2024arXiv:2402.08864

#5079

DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning

Ashwin Hebbar, Sravan Kumar Ankireddy, Hyeji Kim et al.

CVPR 2024arXiv:2403.18092

#5080

OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation

Jisoo Jeong, Hong Cai, Risheek Garrepalli et al.

ICLR 2024spotlightarXiv:2403.09953

#5081

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

CVPR 2024arXiv:2306.15755

#5082

Adversarial Backdoor Attack by Naturalistic Data Poisoning on Trajectory Prediction in Autonomous Driving

Mozhgan Pourkeshavarz, Mohammad Sabokrou, Amir Rasouli

ICLR 2024arXiv:2306.11626

#5083

Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity

Runyu Zhang, Yang Hu, Na Li

CVPR 2024arXiv:2404.18150

#5084

RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation

Oded Bialer, Yuval Haitman

ICML 2024arXiv:2310.17075

#5085

HyperFields: Towards Zero-Shot Generation of NeRFs from Text

Sudarshan Babu, Richard Liu, Zi Yu Zhou et al.

CVPR 2024highlightarXiv:2305.14912

#5086

SVDinsTN: A Tensor Network Paradigm for Efficient Structure Search from Regularized Modeling Perspective

Yu-Bang Zheng, Xile Zhao, Junhua Zeng et al.

CVPR 2024arXiv:2404.06443

#5087

Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

Zihan Wang, Siyang Song, Cheng Luo et al.

CVPR 2024arXiv:2312.14985

#5088

UniHuman: A Unified Model For Editing Human Images in the Wild

Nannan Li, Qing Liu, Krishna Kumar Singh et al.

CVPR 2024arXiv:2405.19295

#5089

3D Neural Edge Reconstruction

Lei Li, Songyou Peng, Zehao Yu et al.

CVPR 2024arXiv:2404.05016

#5090

Hyperbolic Learning with Synthetic Captions for Open-World Detection

Fanjie Kong, Yanbei Chen, Jiarui Cai et al.

AAAI 2024paperarXiv:2402.11855

#5091

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

Zhen Yang, Zhou Shao, Yuxiao Dong et al.

#5092

Inspecting Prediction Confidence for Detecting Black-Box Backdoor Attacks

Tong Wang, Yuan Yao, Feng Xu et al.

AAAI 2024paperarXiv:2402.12411

#5093

Deep Structural Knowledge Exploitation and Synergy for Estimating Node Importance Value on Heterogeneous Information Networks

Yankai Chen, Yixiang Fang, Qiongyan Wang et al.

ECCV 2024arXiv:2403.13556

#5094

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024arXiv:2401.02402

#5095

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

CVPR 2024arXiv:2404.11291

#5096

Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption

Buzhen Huang, Chen Li, Chongyang Xu et al.

ECCV 2024arXiv:2407.18550

#5097

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024arXiv:2409.06290

#5098

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024arXiv:2311.18815

#5099

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ICML 2024arXiv:2402.05367

#5100

Principled Preferential Bayesian Optimization

Wenjie Xu, Wenbin Wang, Yuning Jiang et al.

#5101

CNN Kernels Can Be the Best Shapelets

Eric Qu, Yansen Wang, Xufang Luo et al.

ICLR 2024

#5102

Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging

Fulin Luo, Xi Chen, Xiuwen Gong et al.

ICLR 2024arXiv:2210.01603

#5103

Neural-Symbolic Recursive Machine for Systematic Generalization

Qing Li, Yixin Zhu, Yitao Liang et al.

CVPR 2024arXiv:2402.02352

#5104

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao et al.

CVPR 2024highlightarXiv:2405.06283

#5105

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

Qi Jia, Yaqi Cai, Qi Jia et al.

ICML 2024arXiv:2405.16057

#5106

SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Xudong LU, Aojun Zhou, Yuhui Xu et al.

CVPR 2024highlightarXiv:2405.05587

#5107

Navigate Beyond Shortcuts: Debiased Learning Through the Lens of Neural Collapse

Yining Wang, Junjie Sun, Chenyue Wang et al.

ICML 2024arXiv:2402.01242

#5108

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness

Guibin Zhang, Yanwei Yue, kun wang et al.

CVPR 2024arXiv:2307.07313

#5109

HEAL-SWIN: A Vision Transformer On The Sphere

Oscar Carlsson, Jan E. Gerken, Hampus Linander et al.

ECCV 2024arXiv:2409.06703

#5110

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

CVPR 2024arXiv:2403.17601

#5111

LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation

Ke Guo, Zhenwei Miao, Wei Jing et al.

ICLR 2024arXiv:2310.16960

#5112

Privately Aligning Language Models with Reinforcement Learning

Fan Wu, Huseyin Inan, Arturs Backurs et al.

#5113

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

CVPR 2024

CVPR 2024arXiv:2402.18956

#5114

WWW: A Unified Framework for Explaining What Where and Why of Neural Networks by Interpretation of Neuron Concepts

Yong Hyun Ahn, Hyeon Bae Kim, Seong Tae Kim

CVPR 2024arXiv:2403.17709

#5115

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

Jongha Kim, Jihwan Park, Jinyoung Park et al.

CVPR 2024arXiv:2403.08161

#5116

LAFS: Landmark-based Facial Self-supervised Learning for Face Recognition

Zhonglin Sun, Chen Feng, Ioannis Patras et al.

ECCV 2024arXiv:2212.09877

#5117

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

CVPR 2024arXiv:2312.04519

#5118

Bootstrapping Autonomous Driving Radars with Self-Supervised Learning

Yiduo Hao, Sohrab Madani, Junfeng Guan et al.

AAAI 2024paperarXiv:2402.15959

#5119

Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks

Zhiying Jiang, Xingyuan Li, Jinyuan Liu et al.

CVPR 2024arXiv:2202.04291

#5120

L2B: Learning to Bootstrap Robust Models for Combating Label Noise

Yuyin Zhou, Xianhang li, Fengze Liu et al.

CVPR 2024arXiv:2308.15984

#5121

Learning Structure-from-Motion with Graph Attention Networks

Lucas Brynte, José Pedro Iglesias, Carl Olsson et al.

CVPR 2024arXiv:2406.01595

#5122

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild

Zeren Jiang, Chen Guo, Manuel Kaufmann et al.

CVPR 2024highlightarXiv:2306.04216

#5123

MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

Jielin Qiu, Jiacheng Zhu, William Han et al.

CVPR 2024arXiv:2403.16205

#5124

Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains

Bang-Dang Pham, Phong Tran, Anh Tran et al.

ECCV 2024arXiv:2401.17910

#5125

ControlCap: Controllable Region-level Captioning

Yuzhong Zhao, Liu Yue, Zonghao Guo et al.

ICML 2024arXiv:2312.03911

#5126

Improving Gradient-Guided Nested Sampling for Posterior Inference

Pablo Lemos, Nikolay Malkin, Will Handley et al.

#5127

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion

Zhenjiang Du, Jiale Dou, Zhitao Liu et al.

AAAI 2024paperarXiv:2312.08200

#5128

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

Yunchen Li, Zhou Yu, Gaoqi He et al.

CVPR 2024arXiv:2312.13319

#5129

In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging

Xin Wang, Lizhi Wang, Xiangtian Ma et al.

CVPR 2024arXiv:2408.06747

#5130

Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation

Jingyun Wang, Guoliang Kang

CVPR 2024highlightarXiv:2312.06685

#5131

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

Shitian Zhao, Zhuowan Li, YadongLu et al.

ECCV 2024arXiv:2409.03944

#5132

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ICML 2024arXiv:2406.01451

#5133

SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

Danni Yang, Jiayi Ji, Yiwei Ma et al.

CVPR 2024arXiv:2402.19270

#5134

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

ECCV 2024arXiv:2408.07147

#5135

Controlling the World by Sleight of Hand

Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.

CVPR 2024arXiv:2402.18192

#5136

Misalignment-Robust Frequency Distribution Loss for Image Transformation

Zhangkai Ni, Juncheng Wu, Zian Wang et al.

CVPR 2024arXiv:2403.09344

#5137

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

ICML 2024arXiv:2406.04350

#5138

Prompt-guided Precise Audio Editing with Diffusion Models

Manjie Xu, Chenxing Li, Duzhen Zhang et al.

ICML 2024arXiv:2406.16241

#5139

Position: Benchmarking is Limited in Reinforcement Learning Research

Scott Jordan, Adam White, Bruno da Silva et al.

CVPR 2024arXiv:2404.15010

#5140

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

Shuofeng Sun, Yongming Rao, Jiwen Lu et al.

ICLR 2024spotlightarXiv:2401.09516

#5141

Accelerating Data Generation for Neural Operators via Krylov Subspace Recycling

Hong Wang, Zhongkai Hao, Jie Wang et al.

ECCV 2024arXiv:2407.07077

#5142

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024arXiv:2405.19321

#5143

DGD: Dynamic 3D Gaussians Distillation

Isaac Labe, Noam Issachar, Itai Lang et al.

ICML 2024arXiv:2405.03329

#5144

Policy Learning for Balancing Short-Term and Long-Term Rewards

Peng Wu, Ziyu Shen, Feng Xie et al.

ECCV 2024arXiv:2403.09072

#5145

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ICLR 2024spotlightarXiv:2310.05422

#5146

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning

Fan-Ming Luo, Tian Xu, Xingchen Cao et al.

AAAI 2024paperarXiv:2306.03364

#5147

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Nicolas Michel, Giovanni Chierchia, Romain Negrel et al.

ICLR 2024arXiv:2403.10717

#5148

Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency

Soumyadeep Pal, Yuguang Yao, Ren Wang et al.

AAAI 2024paperarXiv:2308.00319

#5149

LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack

ICLR 2024spotlightarXiv:2401.09352

#5150

Neural Contractive Dynamical Systems

Hadi Beik Mohammadi, Søren Hauberg, Georgios Arvanitidis et al.

#5151

Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction

Yizhi Wang, Wallace Lira, Wenqi Wang et al.

CVPR 2024

CVPR 2024arXiv:2312.13091

#5152

MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading

Abdallah Dib, Luiz Gustavo Hafemann, Emeline Got et al.

AAAI 2024paperarXiv:2305.20089

#5153

Learning Explicit Contact for Implicit Reconstruction of Hand-Held Objects from Monocular Images

Junxing Hu, Hongwen Zhang, Zerui Chen et al.

ICLR 2024arXiv:2306.11201

#5154

Adaptive Federated Learning with Auto-Tuned Clients

Junhyung Lyle Kim, Mohammad Taha Toghani, Cesar Uribe et al.

CVPR 2024arXiv:2403.06974

#5155

Memory-based Adapters for Online 3D Scene Perception

Xiuwei Xu, Chong Xia, Ziwei Wang et al.

ICML 2024arXiv:2404.08447

#5156

Federated Optimization with Doubly Regularized Drift Correction

Xiaowen Jiang, Anton Rodomanov, Sebastian Stich

ECCV 2024arXiv:2409.11718

#5157

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ICML 2024arXiv:2402.04655

#5158

Open-Vocabulary Calibration for Fine-tuned CLIP

Shuoyuan Wang, Jindong Wang, Guoqing Wang et al.

ECCV 2024arXiv:2312.02362

#5159

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ICLR 2024arXiv:2402.18813

#5160

Protein Multimer Structure Prediction via Prompt Learning

Ziqi Gao, Xiangguo SUN, Zijing Liu et al.

#5161

M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis

Ning Zhang, Hiuyi Cheng, Jiayu Chen et al.

#5162

Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Jiayi Gao, Kongming Liang, Tao Wei et al.

ECCV 2024arXiv:2407.15626

#5163

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024arXiv:2407.15328

#5164

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

CVPR 2024arXiv:2403.20318

#5165

SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects

Abhinav Kumar, Yuliang Guo, Xinyu Huang et al.

ICLR 2024arXiv:2306.02031

#5166

DOS: Diverse Outlier Sampling for Out-of-Distribution Detection

Wenyu Jiang, Hao Cheng, MingCai Chen et al.

CVPR 2024arXiv:2403.15389

#5167

DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data

Hanrong Ye, Dan Xu

AAAI 2024paperarXiv:2402.12946

#5168

Cell Graph Transformer for Nuclei Classification

Wei Lou, Guanbin Li, Xiang Wan et al.

CVPR 2024arXiv:2311.17315

#5169

Explaining CLIP's Performance Disparities on Data from Blind/Low Vision Users

Daniela Massiceti, Camilla Longden, Agnieszka Słowik et al.

ECCV 2024arXiv:2407.15617

#5170

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ICML 2024oralarXiv:2402.18137

#5171

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Jianxiong Li, Jinliang Zheng, Yinan Zheng et al.

#5172

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

AAAI 2024paperarXiv:2307.16348

#5173

Rating-Based Reinforcement Learning

Devin White, Mingkang Wu, Ellen Novoseller et al.

ECCV 2024arXiv:2408.10614

#5174

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ICLR 2024arXiv:2310.07229

#5175

Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment

Bowen Gao, Yinjun JIA, Yuanle Mo et al.

ICML 2024oralarXiv:2308.14906

#5176

BayOTIDE: Bayesian Online Multivariate Time Series Imputation with Functional Decomposition

Shikai Fang, Qingsong Wen, Yingtao Luo et al.

#5177

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

CVPR 2024highlightarXiv:2405.00587

#5178

GraCo: Granularity-Controllable Interactive Segmentation

Yian Zhao, Kehan Li, Zesen Cheng et al.

#5179

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

AAAI 2024paperarXiv:2312.06401

#5180

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Hao Tan, Jun Li, Yizhuang Zhou et al.

ICML 2024arXiv:2402.04971

#5181

Multi-Sender Persuasion: A Computational Perspective

Safwan Hossain, Tonghan Wang, Tao Lin et al.

ECCV 2024arXiv:2407.02778

#5182

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ICLR 2024arXiv:2309.17046

#5183

CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning

Tianyu Li, Hyunyoung Jung, Matthew Gombolay et al.

ECCV 2024arXiv:2407.11499

#5184

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024arXiv:2407.13460

#5185

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024arXiv:2404.14715

#5186

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ICLR 2024arXiv:2311.02013

#5187

Score Models for Offline Goal-Conditioned Reinforcement Learning

Harshit Sikchi, Rohan Chitnis, Ahmed Touati et al.

CVPR 2024arXiv:2312.14239

#5188

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

Tzofi Klinghoffer, Xiaoyu Xiang, Siddharth Somasundaram et al.

CVPR 2024arXiv:2405.05714

#5189

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Rui Zhao, Bin Shi, Jianfei Ruan et al.

ICML 2024arXiv:2406.03631

#5190

Discovering Bias in Latent Space: An Unsupervised Debiasing Approach

Dyah Adila, Shuai Zhang, Boran Han et al.

CVPR 2024arXiv:2403.18575

#5191

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions

Hao Xu, Li Haipeng, Yinqiao Wang et al.

ICLR 2024arXiv:2404.12522

#5192

Neural Active Learning Beyond Bandits

Yikun Ban, Ishika Agarwal, Ziwei Wu et al.

ICLR 2024arXiv:2305.18712

#5193

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?

JIANFEI YANG, Hanjie Qian, Yuecong Xu et al.

ICML 2024arXiv:2402.13937

#5194

Verifying message-passing neural networks via topology-based bounds tightening

Christopher Hojny, Shiqiang Zhang, Juan Campos et al.

ICML 2024arXiv:2402.10893

#5195

RLVF: Learning from Verbal Feedback without Overgeneralization

Moritz Stephan, Alexander Khazatsky, Eric Mitchell et al.

ECCV 2024arXiv:2402.18695

#5196

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

AAAI 2024paperarXiv:2312.13380

#5197

Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity

Yiyue Chen, Haris Vikalo, Chianing Wang

ICML 2024arXiv:2402.02972

#5198

Retrieval-Augmented Score Distillation for Text-to-3D Generation

Junyoung Seo, Susung Hong, Wooseok Jang et al.

ICLR 2024arXiv:2403.06826

#5199

In-context Exploration-Exploitation for Reinforcement Learning

Zhenwen Dai, Federico Tomasi, Sina Ghiassian

ECCV 2024arXiv:2407.16696

#5200

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.