Most Cited 2024 "fourier embedding" Papers

12,324 papers found • Page 27 of 62

#5201

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024arXiv:2407.16696
13
citations
#5202

Adaptive Proximal Gradient Methods Are Universal Without Approximation

Konstantinos Oikonomidis, Emanuel Laude, Puya Latafat et al.

ICML 2024spotlightarXiv:2402.06271
13
citations
#5203

Full Bayesian Significance Testing via Neural Networks

Zehua Liu, Zimeng Li, Jingyuan Wang et al.

AAAI 2024paper
13
citations
#5204

FADAS: Towards Federated Adaptive Asynchronous Optimization

Yujia Wang, Shiqiang Wang, Songtao Lu et al.

ICML 2024arXiv:2407.18365
13
citations
#5205

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024arXiv:2409.09605
13
citations
#5206

Sample-specific Masks for Visual Reprogramming-based Prompting

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICML 2024spotlightarXiv:2406.03150
13
citations
#5207

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

ECCV 2024arXiv:2303.12001
13
citations
#5208

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.

ECCV 2024arXiv:2408.03284
13
citations
#5209

PH-Net: Semi-Supervised Breast Lesion Segmentation via Patch-wise Hardness

Siyao Jiang, Huisi Wu, Junyang Chen et al.

CVPR 2024
13
citations
#5210

Federated Causality Learning with Explainable Adaptive Optimization

Dezhi Yang, Xintong He, Jun Wang et al.

AAAI 2024paperarXiv:2312.05540
13
citations
#5211

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation

Xinliang Zhang, Lei Zhu, Hangzhou He et al.

AAAI 2024paperarXiv:2402.17555
13
citations
#5212

Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation

Xiuding Cai, Yaoyao Zhu, Dong Miao et al.

AAAI 2024paperarXiv:2211.10867
13
citations
#5213

Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics

Siqi Miao, Zhiyuan Lu, Mia Liu et al.

ICML 2024arXiv:2402.12535
13
citations
#5214

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024
13
citations
#5215

FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval

Yanzhe Chen, Huasong Zhong, Xiangteng He et al.

AAAI 2024paper
13
citations
#5216

Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation

Thomas Merth, Qichen Fu, Mohammad Rastegari et al.

ICML 2024arXiv:2404.06910
13
citations
#5217

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117
13
citations
#5218

Recurrent Early Exits for Federated Learning with Heterogeneous Clients

Royson Lee, Javier Fernandez-Marques, Xu Hu et al.

ICML 2024arXiv:2405.14791
13
citations
#5219

F3Loc: Fusion and Filtering for Floorplan Localization

Changan Chen, Rui Wang, Christoph Vogel et al.

CVPR 2024highlight
13
citations
#5220

OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization

Xiang Meng, Shibal Ibrahim, Kayhan Behdin et al.

ICML 2024arXiv:2403.12983
13
citations
#5221

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

Yan Gao, Haojun Xu, Jie Li et al.

AAAI 2024paperarXiv:2312.08951
13
citations
#5222

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024arXiv:2410.10207
13
citations
#5223

Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation

Zeyang Zhao, Qilong Xue, Yifan Bai et al.

ECCV 2024arXiv:2407.08489
13
citations
#5224

Learning Temporal Resolution in Spectrogram for Audio Classification

Haohe Liu, Xubo Liu, Qiuqiang Kong et al.

AAAI 2024paperarXiv:2210.01719
13
citations
#5225

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024arXiv:2403.09805
13
citations
#5226

Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates

Youssef Allouah, Sadegh Farhadkhani, Rachid Guerraoui et al.

ICML 2024arXiv:2402.12780
13
citations
#5227

DiffSED: Sound Event Detection with Denoising Diffusion

Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia et al.

AAAI 2024paperarXiv:2308.07293
13
citations
#5228

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024arXiv:2311.16914
13
citations
#5229

Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

Xinyu Hu, Pengfei Tang, Simiao Zuo et al.

ICLR 2024arXiv:2310.13855
13
citations
#5230

Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models

Qitan Lv, Jie Wang, Hanzhu Chen et al.

ICML 2024arXiv:2410.15116
13
citations
#5231

Object Recognition as Next Token Prediction

Kaiyu Yue, Bor-Chun Chen, Jonas Geiping et al.

CVPR 2024highlightarXiv:2312.02142
13
citations
#5232

Chronic Poisoning: Backdoor Attack against Split Learning

Fangchao Yu, Bo Zeng, Kai Zhao et al.

AAAI 2024paper
13
citations
#5233

Generating Enhanced Negatives for Training Language-Based Object Detectors

Shiyu Zhao, Long Zhao, Vijay Kumar BG et al.

CVPR 2024arXiv:2401.00094
13
citations
#5234

Model Inversion Robustness: Can Transfer Learning Help?

Sy-Tuyen Ho, Koh Jun Hao, Keshigeyan Chandrasegaran et al.

CVPR 2024arXiv:2405.05588
13
citations
#5235

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024arXiv:2401.06397
13
citations
#5236

Retro-fallback: retrosynthetic planning in an uncertain world

Austin Tripp, Krzysztof Maziarz, Sarah Lewis et al.

ICLR 2024arXiv:2310.09270
13
citations
#5237

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

xinlu zhang, Shiyang Li, Xianjun Yang et al.

ICLR 2024arXiv:2305.12723
13
citations
#5238

MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field

Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang et al.

ICLR 2024spotlightarXiv:2303.05703
13
citations
#5239

Restoring balance: principled under/oversampling of data for optimal classification

Emanuele Loffredo, Mauro Pastore, Simona Cocco et al.

ICML 2024arXiv:2405.09535
13
citations
#5240

Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning

Idan Achituve, Idit Diamant, Arnon Netzer et al.

ICML 2024arXiv:2402.04005
13
citations
#5241

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

Zikai Xiao, Zihan Chen, Liyinglan Liu et al.

ICLR 2024arXiv:2401.08977
13
citations
#5242

PAC Prediction Sets Under Label Shift

Wenwen Si, Sangdon Park, Insup Lee et al.

ICLR 2024arXiv:2310.12964
13
citations
#5243

3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting

Zhe Jun Tang, Tat-Jen Cham

ECCV 2024arXiv:2408.03753
13
citations
#5244

Federated Online Adaptation for Deep Stereo

Matteo Poggi, Fabio Tosi

CVPR 2024arXiv:2405.14873
13
citations
#5245

Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction

Zhixuan Chu, Mengxuan Hu, Qing Cui et al.

AAAI 2024paperarXiv:2312.16113
13
citations
#5246

EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Dachun Kai, Jiayao Lu, Yueyi Zhang et al.

ICML 2024oralarXiv:2406.13457
13
citations
#5247

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024arXiv:2401.00912
13
citations
#5248

Reward-Free Curricula for Training Robust World Models

Marc Rigter, Minqi Jiang, Ingmar Posner

ICLR 2024arXiv:2306.09205
13
citations
#5249

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers

Ruiyuan Zhang, Jiaxiang Liu, Zexi Li et al.

AAAI 2024paperarXiv:2312.12340
13
citations
#5250

Improving Neural Additive Models with Bayesian Principles

Kouroche Bouchiat, Alexander Immer, Hugo Yèche et al.

ICML 2024arXiv:2305.16905
13
citations
#5251

SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes

Boshi Tang, Zhiyong Wu, Xixin Wu et al.

AAAI 2024paperarXiv:2312.11858
13
citations
#5252

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024arXiv:2407.14142
13
citations
#5253

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024
13
citations
#5254

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024arXiv:2407.09083
13
citations
#5255

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024arXiv:2403.05018
13
citations
#5256

CHAI: Clustered Head Attention for Efficient LLM Inference

Saurabh Agarwal, Bilge Acun, Basil Hosmer et al.

ICML 2024arXiv:2403.08058
13
citations
#5257

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024arXiv:2407.10528
13
citations
#5258

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024arXiv:2407.12616
13
citations
#5259

Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

Qi Sun, Hang Zhou, Wengang Zhou et al.

ECCV 2024arXiv:2407.05388
13
citations
#5260

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024arXiv:2407.05256
13
citations
#5261

From Posterior Sampling to Meaningful Diversity in Image Restoration

Noa Cohen, Hila Manor, Yuval Bahat et al.

ICLR 2024arXiv:2310.16047
13
citations
#5262

BRUSLEATTACK: A QUERY-EFFICIENT SCORE- BASED BLACK-BOX SPARSE ADVERSARIAL ATTACK

Quoc Viet Vo, Ehsan Abbasnejad, Damith Ranasinghe

ICLR 2024arXiv:2404.05311
13
citations
#5263

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024arXiv:2311.11241
13
citations
#5264

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024arXiv:2407.08061
13
citations
#5265

Mitigating Label Noise through Data Ambiguation

Julian Lienen, Eyke Hüllermeier

AAAI 2024paperarXiv:2305.13764
13
citations
#5266

3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions

Weijia Li, Haote Yang, Zhenghao Hu et al.

CVPR 2024arXiv:2404.04823
13
citations
#5267

Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization

Jian Liang, Sheng, Zhengbo Wang et al.

ICML 2024spotlightarXiv:2308.12919
13
citations
#5268

What is Dataset Distillation Learning?

William Yang, Ye Zhu, Zhiwei Deng et al.

ICML 2024arXiv:2406.04284
13
citations
#5269

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024arXiv:2304.05645
13
citations
#5270

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024arXiv:2301.13803
13
citations
#5271

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024arXiv:2407.15773
13
citations
#5272

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024
13
citations
#5273

Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev Loss

Yahong Yang, Juncai He

ICML 2024arXiv:2402.00152
13
citations
#5274

Privacy-Preserving Optics for Enhancing Protection in Face De-Identification

Jhon Lopez, Carlos Hinojosa, Henry Arguello et al.

CVPR 2024arXiv:2404.00777
13
citations
#5275

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training

Longtian Qiu, Shan Ning, Xuming He

AAAI 2024paperarXiv:2401.02347
13
citations
#5276

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

Jinyi Liu, Zhi Wang, Yan Zheng et al.

AAAI 2024paperarXiv:2312.12145
13
citations
#5277

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024arXiv:2311.16447
13
citations
#5278

GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator

Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

ECCV 2024arXiv:2312.06731
13
citations
#5279

Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection

Qinyu Zhao, Ming Xu, Kartik Gupta et al.

ICLR 2024arXiv:2402.00865
13
citations
#5280

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024arXiv:2403.14183
13
citations
#5281

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024arXiv:2408.00372
13
citations
#5282

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024arXiv:2408.10739
13
citations
#5283

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024arXiv:2301.12195
13
citations
#5284

Look, Remember and Reason: Grounded Reasoning in Videos with Language Models

Apratim Bhattacharyya, Sunny Panchal, Reza Pourreza et al.

ICLR 2024oralarXiv:2306.17778
13
citations
#5285

SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

Jiaben Chen, Huaizu Jiang

CVPR 2024arXiv:2308.16876
13
citations
#5286

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947
13
citations
#5287

GDA: Generalized Diffusion for Robust Test-time Adaptation

Yun-Yun Tsai, Fu-Chen Chen, Albert Chen et al.

CVPR 2024arXiv:2404.00095
13
citations
#5288

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024arXiv:2409.05162
13
citations
#5289

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Prajwal K R, Bowen Shi, Matthew Le et al.

ICML 2024arXiv:2410.20478
13
citations
#5290

Tuning-Free Stochastic Optimization

Ahmed Khaled, Chi Jin

ICML 2024spotlightarXiv:2402.07793
13
citations
#5291

Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference

JIAN XU, Delu Zeng, John Paisley

ICML 2024arXiv:2407.17033
13
citations
#5292

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024arXiv:2212.02997
13
citations
#5293

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

ICLR 2024arXiv:2406.16072
13
citations
#5294

Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation

Noel Loo, Ramin Hasani, Mathias Lechner et al.

ICLR 2024arXiv:2302.01428
13
citations
#5295

GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework

Fangyikang Wang, Huminhao Zhu, Chao Zhang et al.

AAAI 2024paperarXiv:2312.16429
13
citations
#5296

MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections

mude hui, Zihao Wei, Hongru Zhu et al.

CVPR 2024arXiv:2403.10815
13
citations
#5297

On the hardness of learning under symmetries

Bobak Kiani, Thien Le, Hannah Lawrence et al.

ICLR 2024spotlightarXiv:2401.01869
13
citations
#5298

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, Kyungmin Kim, Hyunjung Shim

ECCV 2024arXiv:2407.02286
13
citations
#5299

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730
13
citations
#5300

Distilling Reliable Knowledge for Instance-Dependent Partial Label Learning

Dong-Dong Wu, Deng-Bao Wang, Min-Ling Zhang

AAAI 2024paper
13
citations
#5301

Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning

Jing Xu, Jingzhao Zhang

ICML 2024arXiv:2405.02596
13
citations
#5302

Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

Xuanhua He, Tao Hu, Guoli Wang et al.

AAAI 2024paperarXiv:2401.02161
13
citations
#5303

OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition

Tongjia Chen, Hongshan Yu, Zhengeng Yang et al.

CVPR 2024arXiv:2312.00096
13
citations
#5304

Boosting Adversarial Training via Fisher-Rao Norm-based Regularization

Xiangyu Yin, Wenjie Ruan

CVPR 2024arXiv:2403.17520
13
citations
#5305

A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

Enshu Liu, Xuefei Ning, Huazhong Yang et al.

ICLR 2024arXiv:2312.07243
13
citations
#5306

ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Yin Zhang, Yongqiang Zhang, Zian Zhang et al.

AAAI 2024paper
13
citations
#5307

An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning

Chen Jin, Ryutaro Tanno, Amrutha Saseendran et al.

ICML 2024arXiv:2310.12274
13
citations
#5308

Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D

Mukund Varma T, Peihao Wang, Zhiwen Fan et al.

CVPR 2024arXiv:2403.18922
13
citations
#5309

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024arXiv:2311.12047
13
citations
#5310

CAD: Photorealistic 3D Generation via Adversarial Distillation

Ziyu Wan, Despoina Paschalidou, Ian Huang et al.

CVPR 2024arXiv:2312.06663
13
citations
#5311

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024arXiv:2404.12524
13
citations
#5312

Brain Decodes Deep Nets

Huzheng Yang, James Gee, Jianbo Shi

CVPR 2024highlightarXiv:2312.01280
13
citations
#5313

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024arXiv:2407.11555
13
citations
#5314

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024arXiv:2403.06168
13
citations
#5315

On Characterizing the Trade-off in Invariant Representation Learning

Vishnu Boddeti, Sepehr Dehdashtian, Bashir Sadeghi

ICLR 2024arXiv:2109.03386
13
citations
#5316

Move Anything with Layered Scene Diffusion

Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu et al.

CVPR 2024arXiv:2404.07178
13
citations
#5317

Hearing Anything Anywhere

Mason Wang, Ryosuke Sawata, Samuel Clarke et al.

CVPR 2024arXiv:2406.07532
13
citations
#5318

Prediction Error-based Classification for Class-Incremental Learning

Michał Zając, Tinne Tuytelaars, Gido M van de Ven

ICLR 2024arXiv:2305.18806
13
citations
#5319

Offline Multi-Objective Optimization

Ke Xue, Rong-Xi Tan, Xiaobin Huang et al.

ICML 2024arXiv:2406.03722
13
citations
#5320

Zero-Shot Reinforcement Learning via Function Encoders

Tyler Ingebrand, Amy Zhang, Ufuk Topcu

ICML 2024arXiv:2401.17173
13
citations
#5321

Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models

Yili Wang, Kaixiong Zhou, Ninghao Liu et al.

ICLR 2024arXiv:2406.13137
13
citations
#5322

Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks

Liam Collins, Hamed Hassani, Mahdi Soltanolkotabi et al.

ICML 2024arXiv:2307.06887
13
citations
#5323

Generalizing across Temporal Domains with Koopman Operators

QIUHAO Zeng, Wei Wang, Fan Zhou et al.

AAAI 2024paperarXiv:2402.07834
13
citations
#5324

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Qingping Zheng, Ling Zheng, Yuanfan Guo et al.

CVPR 2024arXiv:2403.16643
13
citations
#5325

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Juno Kim, Kakei Yamamoto, Kazusato Oko et al.

ICLR 2024spotlightarXiv:2312.01127
13
citations
#5326

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Yufan Chen, Jiaming Zhang, Kunyu Peng et al.

CVPR 2024arXiv:2403.14442
13
citations
#5327

Rethinking Generative Large Language Model Evaluation for Semantic Comprehension

Fangyun Wei, Xi Chen, Lin Luo

ICML 2024arXiv:2403.07872
13
citations
#5328

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

Yingji Zhong, Lanqing Hong, Zhenguo Li et al.

CVPR 2024arXiv:2403.16885
13
citations
#5329

An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains

George Eskandar

CVPR 2024arXiv:2402.17562
13
citations
#5330

Audio-driven Talking Face Generation with Stabilized Synchronization Loss

Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.

ECCV 2024arXiv:2307.09368
13
citations
#5331

Enhanced Fine-Grained Motion Diffusion for Text-Driven Human Motion Synthesis

Dong Wei, Xiaoning Sun, Huaijiang Sun et al.

AAAI 2024paperarXiv:2305.13773
13
citations
#5332

Harnessing Density Ratios for Online Reinforcement Learning

Philip Amortila, Dylan Foster, Nan Jiang et al.

ICLR 2024spotlightarXiv:2401.09681
13
citations
#5333

A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data

Wenqiang Li, Weijun Li, Lina Yu et al.

ICML 2024arXiv:2309.13705
13
citations
#5334

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.

ECCV 2024arXiv:2409.01322
13
citations
#5335

Light Schrödinger Bridge

Alexander Korotin, Nikita Gushchin, Evgeny Burnaev

ICLR 2024arXiv:2310.01174
13
citations
#5336

Conformal Inductive Graph Neural Networks

Soroush H. Zargarbashi, Aleksandar Bojchevski

ICLR 2024arXiv:2407.09173
13
citations
#5337

Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

Tenglong Liu, Yang Li, Yixing Lan et al.

ICML 2024arXiv:2405.19909
13
citations
#5338

CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception

Jiachen Sun, Haizhong Zheng, Qingzhao Zhang et al.

ICLR 2024arXiv:2306.00349
13
citations
#5339

PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks

Junwei Su, Difan Zou, Chuan Wu

ICLR 2024oralarXiv:2402.04284
13
citations
#5340

Wavelet Dynamic Selection Network for Inertial Sensor Signal Enhancement

Yifeng Wang, Yi Zhao

AAAI 2024paperarXiv:2401.05416
13
citations
#5341

Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks

Dongyoung Lim, Sotirios Sabanis

ICML 2024arXiv:2105.13937
13
citations
#5342

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

Nisarg Shah, Vibashan VS, Vishal M. Patel

CVPR 2024
13
citations
#5343

Effective Video Mirror Detection with Inconsistent Motion Cues

Alex Warren, Ke Xu, Jiaying Lin et al.

CVPR 2024
13
citations
#5344

A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design

Zhihai Wang, Lei Chen, Jie Wang et al.

ICML 2024spotlightarXiv:2309.03208
13
citations
#5345

Taylor Videos for Action Recognition

Lei Wang, Xiuyuan Yuan, Tom Gedeon et al.

ICML 2024oralarXiv:2402.03019
13
citations
#5346

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Zijian Zhou, Zheng Zhu, Holger Caesar et al.

ECCV 2024arXiv:2407.11213
13
citations
#5347

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

Simon Weber, Barış Zöngür, Nikita Araslanov et al.

CVPR 2024arXiv:2404.03778
13
citations
#5348

Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning

Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu et al.

AAAI 2024paperarXiv:2312.12722
13
citations
#5349

ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention

Jiawei Wang, Changjian Li

CVPR 2024arXiv:2311.16682
13
citations
#5350

Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling

Brooks(Ruijia) Niu, Dongxia Wu, Kai Kim et al.

ICML 2024arXiv:2402.18846
13
citations
#5351

MAFA: Managing False Negatives for Vision-Language Pre-training

Jaeseok Byun, Dohoon Kim, Taesup Moon

CVPR 2024arXiv:2312.06112
13
citations
#5352

FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

LIn Zhao, Tianchen Zhao, Zinan Lin et al.

CVPR 2024arXiv:2403.16379
13
citations
#5353

Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models

Luo Jiayun, Siddhesh Khandelwal, Leonid Sigal et al.

CVPR 2024arXiv:2311.17095
13
citations
#5354

SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

Qingwen Bu, Sungrae Park, Minsoo Khang et al.

AAAI 2024paperarXiv:2308.10531
13
citations
#5355

Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes

Zhilu Zhang, Haoyu Wang, Shuai Liu et al.

ICLR 2024arXiv:2310.01840
13
citations
#5356

Community-Invariant Graph Contrastive Learning

Shiyin Tan, Dongyuan Li, Renhe Jiang et al.

ICML 2024arXiv:2405.01350
13
citations
#5357

SAPG: Split and Aggregate Policy Gradients

Jayesh Singla, Ananye Agarwal, Deepak Pathak

ICML 2024arXiv:2407.20230
13
citations
#5358

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024arXiv:2508.16408
13
citations
#5359

TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch et al.

CVPR 2024arXiv:2404.11120
13
citations
#5360

Identifiability of Direct Effects from Summary Causal Graphs

Simon Ferreira, Charles Assaad

AAAI 2024paperarXiv:2306.16958
13
citations
#5361

CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers

Shahaf Arica, Or Rubin, Sapir Gershov et al.

CVPR 2024arXiv:2403.07700
13
citations
#5362

In-context Exploration-Exploitation for Reinforcement Learning

Zhenwen Dai, Federico Tomasi, Sina Ghiassian

ICLR 2024arXiv:2403.06826
13
citations
#5363

Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Langzhang Liang, Sunwoo Kim, Kijung Shin et al.

ICML 2024arXiv:2405.20652
13
citations
#5364

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024arXiv:2407.11717
13
citations
#5365

Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability Composability and Decomposability from Anatomy via Self Supervision

Mohammad Reza Hosseinzadeh Taher, Michael Gotway, Jianming Liang

CVPR 2024
13
citations
#5366

Rethinking Features-Fused-Pyramid-Neck for Object Detection

Hulin Li

ECCV 2024arXiv:2505.12820
13
citations
#5367

Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization

Feihu Huang

ICML 2024arXiv:2407.17823
13
citations
#5368

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

Yanqi Ge, Qiang Nie, Ye Huang et al.

AAAI 2024paperarXiv:2312.11872
13
citations
#5369

SeMoLi: What Moves Together Belongs Together

Jenny Seidenschwarz, Aljoša Ošep, Francesco Ferroni et al.

CVPR 2024arXiv:2402.19463
13
citations
#5370

Retrieval-Augmented Score Distillation for Text-to-3D Generation

Junyoung Seo, Susung Hong, Wooseok Jang et al.

ICML 2024arXiv:2402.02972
13
citations
#5371

Non-convex Stochastic Composite Optimization with Polyak Momentum

Yuan Gao, Anton Rodomanov, Sebastian Stich

ICML 2024arXiv:2403.02967
13
citations
#5372

Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts

Kun Jin, Tongxin Yin, Zhongzhu Chen et al.

AAAI 2024paperarXiv:2305.05090
13
citations
#5373

HYPO: Hyperspherical Out-Of-Distribution Generalization

Haoyue Bai, Yifei Ming, Julian Katz-Samuels et al.

ICLR 2024arXiv:2402.07785
13
citations
#5374

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024arXiv:2404.16029
13
citations
#5375

Towards More Unified In-context Visual Understanding

Dianmo Sheng, Dongdong Chen, Zhentao Tan et al.

CVPR 2024arXiv:2312.02520
13
citations
#5376

An Efficient Tester-Learner for Halfspaces

Aravind Gollakota, Adam Klivans, Konstantinos Stavropoulos et al.

ICLR 2024arXiv:2302.14853
13
citations
#5377

FedWon: Triumphing Multi-domain Federated Learning Without Normalization

Weiming Zhuang, Lingjuan Lyu

ICLR 2024arXiv:2306.05879
13
citations
#5378

Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery

Jialu Zhang, Xiaoying Yang, Wentao He et al.

AAAI 2024paperarXiv:2312.15219
13
citations
#5379

SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field

Ru Li, Jia Liu, Guanghui Liu et al.

AAAI 2024paperarXiv:2312.08692
13
citations
#5380

3D Multi-frame Fusion for Video Stabilization

Zhan Peng, Xinyi Ye, Weiyue Zhao et al.

CVPR 2024arXiv:2404.12887
13
citations
#5381

Enhancing Ensemble Clustering with Adaptive High-Order Topological Weights

Jiaxuan Xu, Taiyong Li, Lei Duan

AAAI 2024paper
13
citations
#5382

Neural Causal Abstractions

Kevin Xia, Elias Bareinboim

AAAI 2024paperarXiv:2401.02602
13
citations
#5383

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

ICLR 2024arXiv:2310.05861
13
citations
#5384

Recurrent Distance Filtering for Graph Representation Learning

Yuhui Ding, Antonio Orvieto, Bobby He et al.

ICML 2024arXiv:2312.01538
13
citations
#5385

Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization

Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.

ECCV 2024arXiv:2407.04245
13
citations
#5386

AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing

Zhiyuan Ma, Guoli Jia, Bowen Zhou

AAAI 2024paperarXiv:2312.08019
13
citations
#5387

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Jianwu Li, Kaiyue Shi, Guo-Sen Xie et al.

AAAI 2024paper
13
citations
#5388

GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent

AAAI 2024paperarXiv:2305.03515
13
citations
#5389

DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences

Peidong Li, Wancheng Shen, Qihao Huang et al.

ECCV 2024arXiv:2403.05402
13
citations
#5390

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024arXiv:2403.17915
13
citations
#5391

Measuring Vision-Language STEM Skills of Neural Models

Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan et al.

ICLR 2024arXiv:2402.17205
13
citations
#5392

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024arXiv:2410.07579
13
citations
#5393

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.

ECCV 2024arXiv:2407.14754
13
citations
#5394

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024arXiv:2311.17524
13
citations
#5395

Parameterized Approximation Algorithms for Sum of Radii Clustering and Variants

Xianrun Chen, Dachuan Xu, Yicheng Xu et al.

AAAI 2024paper
13
citations
#5396

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024arXiv:2404.15770
13
citations
#5397

How to Train the Teacher Model for Effective Knowledge Distillation

Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan et al.

ECCV 2024arXiv:2407.18041
13
citations
#5398

Attacking Perceptual Similarity Metrics

Abhijay Ghildyal, Feng Liu

ICLR 2024arXiv:2305.08840
13
citations
#5399

OHTA: One-shot Hand Avatar via Data-driven Implicit Priors

Xiaozheng Zheng, Chao Wen, Zhuo Su et al.

CVPR 2024arXiv:2402.18969
13
citations
#5400

Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering

Zhaohe Liao, Jiangtong Li, Li Niu et al.

CVPR 2024arXiv:2407.03008
13
citations