Most Cited 2024 "causal perspective" Papers

12,324 papers found • Page 19 of 62

#3601

Diffusion Model for Dense Matching

Jisu Nam, Gyuseong Lee, Seonwoo Kim et al.

ICLR 2024arXiv:2305.19094
23
citations
#3602

On the Posterior Distribution in Denoising: Application to Uncertainty Quantification

Hila Manor, Tomer Michaeli

ICLR 2024arXiv:2309.13598
23
citations
#3603

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

Jie Yang, Xuesong Niu, Nan Jiang et al.

ECCV 2024arXiv:2407.12435
23
citations
#3604

Generalizable Sleep Staging via Multi-Level Domain Alignment

Jiquan Wang, Sha Zhao, Haiteng Jiang et al.

AAAI 2024paperarXiv:2401.05363
23
citations
#3605

Learning to Reweight for Generalizable Graph Neural Network

Zhengyu Chen, Teng Xiao, Kun Kuang et al.

AAAI 2024paper
23
citations
#3606

Tailoring Self-Rationalizers with Multi-Reward Distillation

Sahana Ramnath, Brihi Joshi, Skyler Hallinan et al.

ICLR 2024arXiv:2311.02805
23
citations
#3607

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Haiyang Wang, Hao Tang, Li Jiang et al.

ECCV 2024arXiv:2403.09394
23
citations
#3608

Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning

Haoqi Yuan, Zhancun Mu, Feiyang Xie et al.

ICLR 2024oral
23
citations
#3609

POPDG: Popular 3D Dance Generation with PopDanceSet

Zhenye Luo, Min Ren, Xuecai Hu et al.

CVPR 2024arXiv:2405.03178
23
citations
#3610

milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing

Fangqiang Ding, Zhen Luo, Peijun Zhao et al.

ECCV 2024arXiv:2306.17010
23
citations
#3611

ModaVerse: Efficiently Transforming Modalities with LLMs

Xinyu Wang, Bohan Zhuang, Qi Wu

CVPR 2024arXiv:2401.06395
23
citations
#3612

Object-Centric Diffusion for Efficient Video Editing

Kumara Kahatapitiya, Adil Karjauv, Davide Abati et al.

ECCV 2024arXiv:2401.05735
23
citations
#3613

Democratizing Fine-grained Visual Recognition with Large Language Models

Mingxuan Liu, Subhankar Roy, Wenjing Li et al.

ICLR 2024arXiv:2401.13837
23
citations
#3614

Maximum Entropy Heterogeneous-Agent Reinforcement Learning

Jiarong Liu, Yifan Zhong, Siyi Hu et al.

ICLR 2024spotlightarXiv:2306.10715
23
citations
#3615

PanoContext-Former: Panoramic Total Scene Understanding with a Transformer

Yuan Dong, Chuan Fang, Liefeng Bo et al.

CVPR 2024arXiv:2305.12497
23
citations
#3616

Explaining Time Series via Contrastive and Locally Sparse Perturbations

Zichuan Liu, Yingying ZHANG, Tianchun Wang et al.

ICLR 2024oralarXiv:2401.08552
23
citations
#3617

Hyperbolic Geometric Latent Diffusion Model for Graph Generation

Xingcheng Fu, Yisen Gao, Yuecen Wei et al.

ICML 2024arXiv:2405.03188
23
citations
#3618

AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization

Shixiong Xu, Chenghao Zhang, Lubin Fan et al.

ECCV 2024arXiv:2407.08156
22
citations
#3619

RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching

Divya Nori, Wengong Jin

ICML 2024arXiv:2405.18768
22
citations
#3620

Semantic-aware SAM for Point-Prompted Instance Segmentation

Zhaoyang Wei, Pengfei Chen, Xuehui Yu et al.

CVPR 2024highlightarXiv:2312.15895
22
citations
#3621

Tool-Augmented Reward Modeling

Lei Li, Yekun Chai, Shuohuan Wang et al.

ICLR 2024spotlightarXiv:2310.01045
22
citations
#3622

3D Hand Pose Estimation in Everyday Egocentric Images

Aditya Prakash, Ruisen Tu, Matthew Chang et al.

ECCV 2024arXiv:2312.06583
22
citations
#3623

Causal Representation Learning Made Identifiable by Grouping of Observational Variables

Hiroshi Morioka, Aapo Hyvarinen

ICML 2024oralarXiv:2310.15709
22
citations
#3624

Utility-Fairness Trade-Offs and How to Find Them

Sepehr Dehdashtian, Bashir Sadeghi, Vishnu Naresh Boddeti

CVPR 2024arXiv:2404.09454
22
citations
#3625

Projecting Molecules into Synthesizable Chemical Spaces

Shitong Luo, Wenhao Gao, Zuofan Wu et al.

ICML 2024arXiv:2406.04628
22
citations
#3626

Active Prompt Learning in Vision Language Models

Jihwan Bang, Sumyeong Ahn, Jae-Gil Lee

CVPR 2024arXiv:2311.11178
22
citations
#3627

Rethinking Momentum Knowledge Distillation in Online Continual Learning

Nicolas MICHEL, Maorong Wang, Ling Xiao et al.

ICML 2024arXiv:2309.02870
22
citations
#3628

Submodular Reinforcement Learning

Manish Prajapat, Mojmir Mutny, Melanie Zeilinger et al.

ICLR 2024spotlightarXiv:2307.13372
22
citations
#3629

FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection

Dongmei Zhang, Chang Li, Renrui Zhang et al.

AAAI 2024paperarXiv:2312.14465
22
citations
#3630

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Songchun Zhang, Yibo Zhang, Quan Zheng et al.

CVPR 2024arXiv:2403.09439
22
citations
#3631

IRGen: Generative Modeling for Image Retrieval

Yidan Zhang, Ting Zhang, DONG CHEN et al.

ECCV 2024arXiv:2303.10126
22
citations
#3632

NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

Muhammad Zubair Irshad, Sergey Zakharov, Vitor Guizilini et al.

ECCV 2024arXiv:2404.01300
22
citations
#3633

Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer

Yang Wu, Kaihua Zhang, Jianjun Qian et al.

ECCV 2024arXiv:2407.19628
22
citations
#3634

How Private are DP-SGD Implementations?

Lynn Chua, Badih Ghazi, Pritish Kamath et al.

ICML 2024arXiv:2403.17673
22
citations
#3635

A Sparsity Principle for Partially Observable Causal Representation Learning

Danru Xu, Dingling Yao, Sébastien Lachapelle et al.

ICML 2024arXiv:2403.08335
22
citations
#3636

MotionChain: Conversational Motion Controllers via Multimodal Prompts

Biao Jiang, Xin Chen, Chi Zhang et al.

ECCV 2024arXiv:2404.01700
22
citations
#3637

Category-Level Multi-Part Multi-Joint 3D Shape Assembly

Yichen Li, Kaichun Mo, Yueqi Duan et al.

CVPR 2024arXiv:2303.06163
22
citations
#3638

DPZero: Private Fine-Tuning of Language Models without Backpropagation

Liang Zhang, Bingcong Li, Kiran Thekumparampil et al.

ICML 2024arXiv:2310.09639
22
citations
#3639

DiffAIL: Diffusion Adversarial Imitation Learning

Bingzheng Wang, Guoqiang Wu, Teng Pang et al.

AAAI 2024paperarXiv:2312.06348
22
citations
#3640

Convolutional Channel-Wise Competitive Learning for the Forward-Forward Algorithm

Andreas Papachristodoulou, Christos Kyrkou, Stelios Timotheou et al.

AAAI 2024paperarXiv:2312.12668
22
citations
#3641

Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget

Johannes Lehner, Benedikt Alkin, Andreas Fürst et al.

AAAI 2024paperarXiv:2304.10520
22
citations
#3642

Position: Understanding LLMs Requires More Than Statistical Generalization

Patrik Reizinger, Szilvia Ujváry, Anna Mészáros et al.

ICML 2024spotlightarXiv:2405.01964
22
citations
#3643

Mind Marginal Non-Crack Regions: Clustering-Inspired Representation Learning for Crack Segmentation

zhuangzhuang chen, Zhuonan Lai, Jie Chen et al.

CVPR 2024
22
citations
#3644

DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks

Caixin Kang, Yinpeng Dong, Zhengyi Wang et al.

ECCV 2024arXiv:2306.09124
22
citations
#3645

HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations

Peng Dai, Yang Zhang, Tao Liu et al.

CVPR 2024arXiv:2403.03561
22
citations
#3646

Continuous Memory Representation for Anomaly Detection

Joo Chan Lee, Taejune Kim, Eunbyung Park et al.

ECCV 2024arXiv:2402.18293
22
citations
#3647

A Diffusion-Based Pre-training Framework for Crystal Property Prediction

Zixing Song, Ziqiao Meng, Irwin King

AAAI 2024paper
22
citations
#3648

Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-Hoc Retrieval

Weihang Su, Qingyao Ai, Xiangsheng Li et al.

AAAI 2024paperarXiv:2312.10661
22
citations
#3649

An Incremental Unified Framework for Small Defect Inspection

Jiaqi Tang, Hao Lu, Xiaogang Xu et al.

ECCV 2024arXiv:2312.08917
22
citations
#3650

PromptFusion: Decoupling Stability and Plasticity for Continual Learning

Haoran Chen, Zuxuan Wu, Xintong Han et al.

ECCV 2024arXiv:2303.07223
22
citations
#3651

Hybrid-Supervised Dual-Search: Leveraging Automatic Learning for Loss-Free Multi-Exposure Image Fusion

Guanyao Wu, Hongming Fu, Jinyuan Liu et al.

AAAI 2024paperarXiv:2309.01113
22
citations
#3652

Adaptive Rational Activations to Boost Deep Reinforcement Learning

Quentin Delfosse, Patrick Schramowski, Martin Mundt et al.

ICLR 2024spotlightarXiv:2102.09407
22
citations
#3653

SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic

Kashyap Chitta, Daniel Dauner, Andreas Geiger

ECCV 2024arXiv:2403.17933
22
citations
#3654

Converting Transformers to Polynomial Form for Secure Inference Over Homomorphic Encryption

Itamar Zimerman, Moran Baruch, Nir Drucker et al.

ICML 2024arXiv:2311.08610
22
citations
#3655

End-to-End Spatio-Temporal Action Localisation with Video Transformers

Alexey Gritsenko, Xuehan Xiong, Josip Djolonga et al.

CVPR 2024arXiv:2304.12160
22
citations
#3656

Why is SAM Robust to Label Noise?

Christina Baek, J Kolter, Aditi Raghunathan

ICLR 2024arXiv:2405.03676
22
citations
#3657

Generalization in Kernel Regression Under Realistic Assumptions

Daniel Barzilai, Ohad Shamir

ICML 2024spotlightarXiv:2312.15995
22
citations
#3658

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

Zhaoyuan Yang, Zhengyang Yu, Zhiwei Xu et al.

ICLR 2024arXiv:2311.06792
22
citations
#3659

LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment

yiming ren, xiao han, Chengfeng Zhao et al.

CVPR 2024highlightarXiv:2402.17171
22
citations
#3660

Debiasing Algorithm through Model Adaptation

Tomasz Limisiewicz, David Mareček, Tomáš Musil

ICLR 2024arXiv:2310.18913
22
citations
#3661

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Zecheng Tang, Zecheng Tang, Chenfei Wu et al.

ICLR 2024arXiv:2309.09506
22
citations
#3662

LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures

Vimal Thilak, Chen Huang, Omid Saremi et al.

ICLR 2024spotlightarXiv:2312.04000
22
citations
#3663

PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization

Zining Chen, Weiqiu Wang, Zhicheng Zhao et al.

CVPR 2024arXiv:2404.09011
22
citations
#3664

Time- Memory- and Parameter-Efficient Visual Adaptation

Otniel-Bogdan Mercea, Alexey Gritsenko, Cordelia Schmid et al.

CVPR 2024highlightarXiv:2402.02887
22
citations
#3665

GridFormer: Point-Grid Transformer for Surface Reconstruction

Shengtao Li, Ge Gao, Yudong Liu et al.

AAAI 2024paperarXiv:2401.02292
22
citations
#3666

VecFusion: Vector Font Generation with Diffusion

Vikas Thamizharasan, Difan Liu, Shantanu Agarwal et al.

CVPR 2024highlightarXiv:2312.10540
22
citations
#3667

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook et al.

ICML 2024arXiv:2402.01567
22
citations
#3668

UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models

Xiaoxi Li, Yujia Zhou, Zhicheng Dou

AAAI 2024paperarXiv:2312.11036
22
citations
#3669

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

Wei Dong, Xing Zhang, Bihui Chen et al.

CVPR 2024arXiv:2403.19067
22
citations
#3670

NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors

Yannan He, Garvita Tiwari, Tolga Birdal et al.

CVPR 2024highlightarXiv:2403.03122
22
citations
#3671

Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

Jiahan Zhang, Qi Wei, Feng Liu et al.

ICML 2024arXiv:2406.10502
22
citations
#3672

Robust Calibration of Large Vision-Language Adapters

Balamurali Murugesan, Julio Silva-Rodríguez, Ismail Ben Ayed et al.

ECCV 2024arXiv:2407.13588
22
citations
#3673

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

Yuwei Zeng, Yao Mu, Lin Shao

ICML 2024arXiv:2405.07162
22
citations
#3674

Collaborative Control for Geometry-Conditioned PBR Image Generation

Shimon Vainer, Mark Boss, Mathias Parger et al.

ECCV 2024arXiv:2402.05919
22
citations
#3675

Reliability in Semantic Segmentation: Can We Use Synthetic Data?

Thibaut Loiseau, Tuan Hung Vu, Mickael Chen et al.

ECCV 2024arXiv:2312.09231
22
citations
#3676

Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation

Razvan Pasca, Alexey Gavryushin, Muhammad Hamza et al.

CVPR 2024arXiv:2301.09209
22
citations
#3677

Latent Space Symmetry Discovery

Jianke Yang, Nima Dehmamy, Robin Walters et al.

ICML 2024arXiv:2310.00105
22
citations
#3678

Rethinking Few-shot 3D Point Cloud Semantic Segmentation

Zhaochong An, Guolei Sun, Yun Liu et al.

CVPR 2024arXiv:2403.00592
22
citations
#3679

Multimodal Molecular Pretraining via Modality Blending

Qiying Yu, Yudi Zhang, yuyan ni et al.

ICLR 2024arXiv:2307.06235
22
citations
#3680

CrowdDiff: Multi-hypothesis Crowd Density Estimation using Diffusion Models

Yasiru Ranasinghe, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara et al.

CVPR 2024arXiv:2303.12790
22
citations
#3681

Targeted Representation Alignment for Open-World Semi-Supervised Learning

Ruixuan Xiao, Lei Feng, Kai Tang et al.

CVPR 2024
22
citations
#3682

On the Provable Advantage of Unsupervised Pretraining

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024spotlightarXiv:2303.01566
22
citations
#3683

Online Zero-Shot Classification with CLIP

Qi Qian, JUHUA HU

ECCV 2024arXiv:2408.13320
22
citations
#3684

How to Overcome Curse-of-Dimensionality for Out-of-Distribution Detection?

Soumya Suvra Ghosal, Yiyou Sun, Yixuan Li

AAAI 2024paperarXiv:2312.14452
22
citations
#3685

Language-guided Image Reflection Separation

Haofeng Zhong, Yuchen Hong, Shuchen Weng et al.

CVPR 2024arXiv:2402.11874
22
citations
#3686

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

Tianyuan Yuan, Mao Yucheng, Jiawei Yang et al.

ECCV 2024arXiv:2403.09079
22
citations
#3687

Eliminating Feature Ambiguity for Few-Shot Segmentation

Qianxiong Xu, Guosheng Lin, Chen Change Loy et al.

ECCV 2024arXiv:2407.09842
22
citations
#3688

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems

Yifan Xia, Xianliang Yang, Zichuan Liu et al.

ICML 2024arXiv:2406.03503
22
citations
#3689

Feature Transportation Improves Graph Neural Networks

Moshe Eliasof, Eldad Haber, Eran Treister

AAAI 2024paperarXiv:2307.16092
22
citations
#3690

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

İlker Kesen, Andrea Pedrotti, Mustafa Dogan et al.

ICLR 2024oralarXiv:2311.07022
22
citations
#3691

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling

Xianwei Zhuang, Xuxin Cheng, Yuexian Zou

AAAI 2024paper
22
citations
#3692

OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations

Yiming Zuo, Jia Deng

ECCV 2024arXiv:2406.11711
22
citations
#3693

Attribution-based Explanations that Provide Recourse Cannot be Robust

Hidde Fokkema, Rianne de Heide, Tim van Erven

ICML 2024arXiv:2205.15834
22
citations
#3694

RadEdit: stress-testing biomedical vision models via diffusion image editing

Fernando Pérez-García, Sam Bond-Taylor, Pedro Sanchez et al.

ECCV 2024arXiv:2312.12865
22
citations
#3695

Be Careful What You Smooth For: Label Smoothing Can Be a Privacy Shield but Also a Catalyst for Model Inversion Attacks

Lukas Struppek, Dominik Hintersdorf, Kristian Kersting

ICLR 2024arXiv:2310.06549
22
citations
#3696

GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction

Xinshun Wang, Qiongjie Cui, Chen Chen et al.

AAAI 2024paperarXiv:2312.11850
22
citations
#3697

Implicit Event-RGBD Neural SLAM

Delin Qu, Chi Yan, Dong Wang et al.

CVPR 2024highlightarXiv:2311.11013
22
citations
#3698

ASAM: Boosting Segment Anything Model with Adversarial Tuning

Bo Li, Haoke Xiao, Lv Tang

CVPR 2024arXiv:2405.00256
22
citations
#3699

Sketch and Refine: Towards Fast and Accurate Lane Detection

Chao Chen, Jie Liu, Chang Zhou et al.

AAAI 2024paperarXiv:2401.14729
22
citations
#3700

Causal Walk: Debiasing Multi-Hop Fact Verification with Front-Door Adjustment

AAAI 2024paperarXiv:2403.02698
22
citations
#3701

VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions

Seokha Moon, Hyun Woo, Hongbeen Park et al.

ECCV 2024arXiv:2407.12345
22
citations
#3702

Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding

Noam Levi, Alon Beck, Yohai Bar-Sinai

ICLR 2024arXiv:2310.16441
22
citations
#3703

CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model

Pengwei Yin, Guanzhong Zeng, Jingjing Wang et al.

AAAI 2024paperarXiv:2403.05124
22
citations
#3704

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks

Ben Eisner, Yi Yang, Todor Davchev et al.

ICLR 2024arXiv:2404.13478
22
citations
#3705

SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery

Sarah Rastegar, Mohammadreza Salehi, Yuki M Asano et al.

ECCV 2024arXiv:2408.14371
22
citations
#3706

Language-Driven Anchors for Zero-Shot Adversarial Robustness

Xiao Li, Wei Zhang, Yining Liu et al.

CVPR 2024arXiv:2301.13096
22
citations
#3707

MonoHair: High-Fidelity Hair Modeling from a Monocular Video

Keyu Wu, LINGCHEN YANG, Zhiyi Kuang et al.

CVPR 2024arXiv:2403.18356
22
citations
#3708

Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension

Quan Liu, Hongzi Zhu, Zhenxi Wang et al.

CVPR 2024arXiv:2403.03532
22
citations
#3709

REBAR: Retrieval-Based Reconstruction for Time-series Contrastive Learning

Maxwell Xu, Alexander Moreno, Hui Wei et al.

ICLR 2024arXiv:2311.00519
22
citations
#3710

T-Rep: Representation Learning for Time Series using Time-Embeddings

Archibald Fraikin, Adrien Bennetot, Stephanie Allassonniere

ICLR 2024oralarXiv:2310.04486
22
citations
#3711

Spatio-Temporal Turbulence Mitigation: A Translational Perspective

Xingguang Zhang, Nicholas M Chimitt, Yiheng Chi et al.

CVPR 2024arXiv:2401.04244
22
citations
#3712

Bi-ViT: Pushing the Limit of Vision Transformer Quantization

Yanjing Li, Sheng Xu, Mingbao Lin et al.

AAAI 2024paperarXiv:2305.12354
22
citations
#3713

VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams

Liao Wang, Kaixin Yao, Chengcheng Guo et al.

CVPR 2024arXiv:2312.01407
22
citations
#3714

Estimating Canopy Height at Scale

Jan Pauls, Max Zimmer, Una Kelly et al.

ICML 2024arXiv:2406.01076
22
citations
#3715

Image Clustering Conditioned on Text Criteria

Sehyun Kwon, Jaden Park, Minkyu Kim et al.

ICLR 2024arXiv:2310.18297
22
citations
#3716

SEED: A Simple and Effective 3D DETR in Point Clouds

Zhe Liu, Jinghua Hou, Xiaoqing Ye et al.

ECCV 2024arXiv:2407.10749
22
citations
#3717

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number

Ke Fan, Zechen Bai, Tianjun Xiao et al.

CVPR 2024arXiv:2406.09196
22
citations
#3718

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

Yufei Huang, Odin Zhang, Lirong Wu et al.

ICML 2024spotlightarXiv:2402.11459
22
citations
#3719

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model

Chenjie Cao, Yunuo Cai, Qiaole Dong et al.

CVPR 2024arXiv:2305.11577
22
citations
#3720

SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

AAAI 2024paperarXiv:2307.16586
22
citations
#3721

Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion

Xiang Fan, Anand Bhattad, Ranjay Krishna

ECCV 2024arXiv:2403.14617
22
citations
#3722

Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning

Desai Xie, Jiahao Li, Hao Tan et al.

CVPR 2024arXiv:2312.13980
22
citations
#3723

DIM: Dyadic Interaction Modeling for Social Behavior Generation

Minh Tran, Di Chang, Maksim Siniukov et al.

ECCV 2024
22
citations
#3724

Latent Modulated Function for Computational Optimal Continuous Image Representation

Zongyao He, Zhi Jin

CVPR 2024highlightarXiv:2404.16451
22
citations
#3725

ViLA: Efficient Video-Language Alignment for Video Question Answering

Xijun Wang, Junbang Liang, Chun-Kai Wang et al.

ECCV 2024arXiv:2312.08367
22
citations
#3726

CSTA: CNN-based Spatiotemporal Attention for Video Summarization

Jaewon Son, Jaehun Park, Kwangsu Kim

CVPR 2024arXiv:2405.11905
22
citations
#3727

Decoding AI’s Nudge: A Unified Framework to Predict Human Behavior in AI-Assisted Decision Making

Zhuoyan Li, Zhuoran Lu, Ming Yin

AAAI 2024paperarXiv:2401.05840
22
citations
#3728

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

Hui Fu, Zeqing Wang, Ke Gong et al.

AAAI 2024paperarXiv:2312.10877
22
citations
#3729

GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules

Zhenfang Chen, Rui Sun, Wenjun Liu et al.

ICLR 2024arXiv:2311.04901
22
citations
#3730

How connectivity structure shapes rich and lazy learning in neural circuits

Yuhan Helena Liu, Aristide Baratin, Jonathan Cornford et al.

ICLR 2024arXiv:2310.08513
22
citations
#3731

Towards Theoretical Understandings of Self-Consuming Generative Models

Shi Fu, Sen Zhang, Yingjie Wang et al.

ICML 2024arXiv:2402.11778
22
citations
#3732

Meaning Representations from Trajectories in Autoregressive Models

Tian Yu Liu, Matthew Trager, Alessandro Achille et al.

ICLR 2024arXiv:2310.18348
22
citations
#3733

Narrative Action Evaluation with Prompt-Guided Multimodal Interaction

Shiyi Zhang, Sule Bai, Guangyi Chen et al.

CVPR 2024arXiv:2404.14471
22
citations
#3734

Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models

Sijia Chen, Baochun Li, Di Niu

ICLR 2024arXiv:2402.11140
22
citations
#3735

Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval

Young Kyun Jang, Donghyun Kim, Zihang Meng et al.

CVPR 2024arXiv:2404.15516
22
citations
#3736

Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning

Kai Gan, Tong Wei

ICML 2024arXiv:2405.11756
22
citations
#3737

Idempotent Generative Network

Assaf Shocher, Amil Dravid, Yossi Gandelsman et al.

ICLR 2024arXiv:2311.01462
22
citations
#3738

Depth Prompting for Sensor-Agnostic Depth Estimation

Jin-Hwi Park, Chanhwi Jeong, Junoh Lee et al.

CVPR 2024arXiv:2405.11867
22
citations
#3739

Online Continual Learning for Interactive Instruction Following Agents

Byeonghwi Kim, Minhyuk Seo, Jonghyun Choi

ICLR 2024arXiv:2403.07548
22
citations
#3740

HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

Jingbo Zhang, Xiaoyu Li, Qi Zhang et al.

CVPR 2024arXiv:2311.16961
22
citations
#3741

STEER: Assessing the Economic Rationality of Large Language Models

Narun Raman, Taylor Lundy, Samuel Joseph Amouyal et al.

ICML 2024arXiv:2402.09552
22
citations
#3742

COCONut: Modernizing COCO Segmentation

Xueqing Deng, Qihang Yu, Peng Wang et al.

CVPR 2024arXiv:2404.08639
22
citations
#3743

When Semantic Segmentation Meets Frequency Aliasing

Linwei Chen, Lin Gu, Ying Fu

ICLR 2024arXiv:2403.09065
22
citations
#3744

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao et al.

ICLR 2024arXiv:2402.16321
22
citations
#3745

Optimal Ridge Regularization for Out-of-Distribution Prediction

Pratik Patil, Jin-Hong Du, Ryan Tibshirani

ICML 2024spotlightarXiv:2404.01233
22
citations
#3746

Retrieval-Augmented Open-Vocabulary Object Detection

Jooyeon Kim, Eulrang Cho, Sehyung Kim et al.

CVPR 2024arXiv:2404.05687
22
citations
#3747

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Haomiao Ni, Bernhard Egger, Suhas Lohit et al.

CVPR 2024arXiv:2404.16306
22
citations
#3748

To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets

Darshil Doshi, Aritra Das, Tianyu He et al.

ICLR 2024arXiv:2310.13061
22
citations
#3749

Factorized Diffusion: Perceptual Illusions by Noise Decomposition

Daniel Geng, Inbum Park, Andrew Owens

ECCV 2024arXiv:2404.11615
22
citations
#3750

The Generalization Gap in Offline Reinforcement Learning

Ishita Mediratta, Qingfei You, Minqi Jiang et al.

ICLR 2024oralarXiv:2312.05742
22
citations
#3751

Robust-Wide: Robust Watermarking against Instruction-driven Image Editing

Runyi Hu, Jie Zhang, Ting Xu et al.

ECCV 2024arXiv:2402.12688
22
citations
#3752

T-VSL: Text-Guided Visual Sound Source Localization in Mixtures

Tanvir Mahmud, Yapeng Tian, Diana Marculescu

CVPR 2024arXiv:2404.01751
22
citations
#3753

TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge

Young Kwon, Rui Li, Stylianos Venieris et al.

ICML 2024arXiv:2307.09988
22
citations
#3754

TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling

Jiaxiang Dong, Haixu Wu, Yuxuan Wang et al.

ICML 2024oralarXiv:2402.02475
21
citations
#3755

Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization

Ravi Srinivasan, Francesca Mignacco, Martino Sorbaro et al.

ICLR 2024arXiv:2302.05440
21
citations
#3756

HERGen: Elevating Radiology Report Generation with Longitudinal Data

Fuying Wang, Shenghui Du, Lequan Yu

ECCV 2024arXiv:2407.15158
21
citations
#3757

TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video

Minye Wu, Zehao Wang, Georgios Kouros et al.

CVPR 2024arXiv:2312.06713
21
citations
#3758

UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement

yaofeng xie, Lingwei Kong, Kai Chen et al.

CVPR 2024arXiv:2404.14542
21
citations
#3759

DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling

Linqi Zhou, Andy Shih, Chenlin Meng et al.

CVPR 2024highlightarXiv:2311.17082
21
citations
#3760

Distilling Vision-Language Models on Millions of Videos

Yue Zhao, Long Zhao, Xingyi Zhou et al.

CVPR 2024arXiv:2401.06129
21
citations
#3761

Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms

Jianhao Ding, Zhaofei Yu, Tiejun Huang et al.

AAAI 2024paper
21
citations
#3762

One-Shot Diffusion Mimicker for Handwritten Text Generation

Gang Dai, Yifan Zhang, Quhui Ke et al.

ECCV 2024arXiv:2409.04004
21
citations
#3763

Real-time 3D-aware Portrait Video Relighting

Ziqi Cai, Kaiwen Jiang, Shu-Yu Chen et al.

CVPR 2024highlightarXiv:2410.18355
21
citations
#3764

Multi-Level Neural Scene Graphs for Dynamic Urban Environments

Tobias Fischer, Lorenzo Porzi, Samuel Rota Bulò et al.

CVPR 2024arXiv:2404.00168
21
citations
#3765

ZeST: Zero-Shot Material Transfer from a Single Image

Ta-Ying Cheng, Prafull Sharma, Andrew Markham et al.

ECCV 2024arXiv:2404.06425
21
citations
#3766

VideoMAC: Video Masked Autoencoders Meet ConvNets

Gensheng Pei, Tao Chen, Xiruo Jiang et al.

CVPR 2024arXiv:2402.19082
21
citations
#3767

An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

Jianqing Zhang, Yang Liu, Yang Hua et al.

CVPR 2024arXiv:2403.15760
21
citations
#3768

Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions

Zeyu Han, Fangrui Zhu, Qianru Lao et al.

CVPR 2024arXiv:2311.17048
21
citations
#3769

Discovering Temporally-Aware Reinforcement Learning Algorithms

Matthew T Jackson, Chris Lu, Louis Kirsch et al.

ICLR 2024oralarXiv:2402.05828
21
citations
#3770

Rethinking Dimensional Rationale in Graph Contrastive Learning from Causal Perspective

Qirui Ji, Jiangmeng Li, Jie Hu et al.

AAAI 2024paperarXiv:2312.10401
21
citations
#3771

Proportional Aggregation of Preferences for Sequential Decision Making

Nikhil Chandak, Shashwat Goel, Dominik Peters

AAAI 2024paperarXiv:2306.14858
21
citations
#3772

Compositional Few-Shot Class-Incremental Learning

Yixiong Zou, Shanghang Zhang, haichen zhou et al.

ICML 2024arXiv:2405.17022
21
citations
#3773

Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-Loop and Hessian-Free Solution Strategy

Risheng Liu, Zhu Liu, Wei Yao et al.

ICML 2024arXiv:2405.09927
21
citations
#3774

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

Mo Yu, Qiujing Wang, Shunchi Zhang et al.

ICML 2024arXiv:2211.04684
21
citations
#3775

ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting

Yankai Jiang, Zhongzhen Huang, Rongzhao Zhang et al.

CVPR 2024arXiv:2312.04964
21
citations
#3776

Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?

Hanxin Zhu, Tianyu He, Xin Li et al.

CVPR 2024arXiv:2403.06092
21
citations
#3777

Learning Optimal Contracts: How to Exploit Small Action Spaces

Francesco Bacchiocchi, Matteo Castiglioni, Alberto Marchesi et al.

ICLR 2024arXiv:2309.09801
21
citations
#3778

Improving Plasticity in Online Continual Learning via Collaborative Learning

Maorong Wang, Nicolas Michel, Ling Xiao et al.

CVPR 2024arXiv:2312.00600
21
citations
#3779

Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification

Mei Vaish, Shunxin Wang, Nicola Strisciuglio

CVPR 2024arXiv:2403.01944
21
citations
#3780

Conditional Information Bottleneck Approach for Time Series Imputation

MinGyu Choi, Changhee Lee

ICLR 2024oral
21
citations
#3781

MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design

Xiang Fu, Tian Xie, Andrew Rosen et al.

ICLR 2024arXiv:2310.10732
21
citations
#3782

Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation

Guan Gui, Bin-Bin Gao, Jun Liu et al.

ECCV 2024arXiv:2505.09263
21
citations
#3783

GRAM: Global Reasoning for Multi-Page VQA

Itshak Blau, Sharon Fogel, Roi Ronen et al.

CVPR 2024arXiv:2401.03411
21
citations
#3784

Unlocking the Potential of Prompt-Tuning in Bridging Generalized and Personalized Federated Learning

wenlong deng, Christos Thrampoulidis, Xiaoxiao Li

CVPR 2024arXiv:2310.18285
21
citations
#3785

ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-Order Optimization

Shuoran Jiang, Qingcai Chen, Yang Xiang et al.

AAAI 2024paperarXiv:2312.15184
21
citations
#3786

Training-free Video Temporal Grounding using Large-scale Pre-trained Models

Minghang Zheng, Xinhao Cai, Qingchao Chen et al.

ECCV 2024arXiv:2408.16219
21
citations
#3787

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation

Kai Li, Runxuan Yang, Fuchun Sun et al.

ICML 2024oralarXiv:2308.08143
21
citations
#3788

Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders

Yi Yu, Yufei Wang, Song Xia et al.

ICML 2024arXiv:2405.01460
21
citations
#3789

Pathologies of Predictive Diversity in Deep Ensembles

Geoff Pleiss, Taiga Abe, E. Kelly Buchanan et al.

ICLR 2024arXiv:2302.00704
21
citations
#3790

STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models

Mingyu Derek Ma, Xiaoxuan Wang, Po-Nien Kung et al.

AAAI 2024paperarXiv:2305.15090
21
citations
#3791

GOODAT: Towards Test-Time Graph Out-of-Distribution Detection

Luzhi Wang, Di Jin, He Zhang et al.

AAAI 2024paperarXiv:2401.06176
21
citations
#3792

AMEGO: Active Memory from long EGOcentric videos

Gabriele Goletto, Tushar Nagarajan, Giuseppe Averta et al.

ECCV 2024arXiv:2409.10917
21
citations
#3793

Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration

Chu Jie Qin, Ruiqi Wu, Zikun Liu et al.

ECCV 2024arXiv:2409.19403
21
citations
#3794

Understanding the Effects of Iterative Prompting on Truthfulness

Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

ICML 2024arXiv:2402.06625
21
citations
#3795

Implicit regularization of deep residual networks towards neural ODEs

Pierre Marion, Yu-Han Wu, Michael Sander et al.

ICLR 2024spotlightarXiv:2309.01213
21
citations
#3796

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

Chao Xue, Di Liang, Pengfei Wang et al.

AAAI 2024paperarXiv:2402.13188
21
citations
#3797

Domain Randomization via Entropy Maximization

Gabriele Tiboni, Pascal Klink, Jan Peters et al.

ICLR 2024arXiv:2311.01885
21
citations
#3798

EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction

Longzhong Lin, Xuewu Lin, Tianwei Lin et al.

AAAI 2024paperarXiv:2312.09501
21
citations
#3799

Quantifying and Analyzing Entity-Level Memorization in Large Language Models

Zhenhong Zhou, Jiuyang Xiang, Chaomeng Chen et al.

AAAI 2024paperarXiv:2308.15727
21
citations
#3800

Navigation Instruction Generation with BEV Perception and Large Language Models

Sheng Fan, Rui Liu, Wenguan Wang et al.

ECCV 2024arXiv:2407.15087
21
citations