Most Cited 2024 "cross-model transfer" Papers

12,324 papers found • Page 17 of 62

Filters:Most Cited 2024 cross-model transfer Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#3201

Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts

Huy Nguyen, Pedram Akbarian Saravi, Fanqi Yan et al.

ICLR 2024arXiv:2309.13850

citations

#3202

FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection

Chanho Lee, Jinsu Son, Hyounguk Shon et al.

AAAI 2024paperarXiv:2401.06159

citations

#3203

Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning

Xinyi Wu, Wentao Ma, Dan Guo et al.

AAAI 2024paper

citations

#3204

3D Human Pose Perception from Egocentric Stereo Videos

Hiroyasu Akada, Jian Wang, Vladislav Golyanik et al.

CVPR 2024highlightarXiv:2401.00889

citations

#3205

BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model

Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu et al.

ICML 2024arXiv:2404.03830

citations

#3206

GALA: Generating Animatable Layered Assets from a Single Scan

Taeksoo Kim, Byungjun Kim, Shunsuke Saito et al.

CVPR 2024arXiv:2401.12979

citations

#3207

Zero Bubble (Almost) Pipeline Parallelism

Penghui Qi, Xinyi Wan, Guangxing Huang et al.

ICLR 2024

citations

#3208

Prioritized Semantic Learning for Zero-shot Instance Navigation

Xinyu Sun, Lizhao Liu, Hongyan Zhi et al.

ECCV 2024arXiv:2403.11650

citations

#3209

Transforming and Combining Rewards for Aligning Large Language Models

Zihao Wang, Chirag Nagpal, Jonathan Berant et al.

ICML 2024arXiv:2402.00742

citations

#3210

2382 SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation

Chengyou Jia, Minnan Luo, Zhuohang Dang et al.

AAAI 2024paper

citations

#3211

Image Clustering with External Guidance

Yunfan Li, Peng Hu, Dezhong Peng et al.

ICML 2024arXiv:2310.11989

citations

#3212

SYMBOL: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

Jiacheng Chen, Zeyuan Ma, Hongshu Guo et al.

ICLR 2024arXiv:2402.02355

citations

#3213

Selective Visual Representations Improve Convergence and Generalization for Embodied AI

Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan et al.

ICLR 2024spotlightarXiv:2311.04193

citations

#3214

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

Sihyun Yu, Weili Nie, De-An Huang et al.

ICLR 2024arXiv:2403.14148

citations

#3215

Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models

Shengqu Cai, Duygu Ceylan, Matheus Gadelha et al.

CVPR 2024arXiv:2312.01409

citations

#3216

MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly Detection

Jakub Micorek, Horst Possegger, Dominik Narnhofer et al.

CVPR 2024arXiv:2403.14497

citations

#3217

Improved baselines for vision-language pre-training

Jakob Verbeek, Enrico Fini, Michal Drozdzal et al.

ICLR 2024

citations

#3218

HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning

Fucai Ke, Zhixi Cai, Simindokht Jahangard et al.

ECCV 2024arXiv:2403.12884

citations

#3219

Probabilistically Rewired Message-Passing Neural Networks

Chendi Qian, Andrei Manolache, Kareem Ahmed et al.

ICLR 2024arXiv:2310.02156

citations

#3220

PointBeV: A Sparse Approach for BeV Predictions

Loick Chambon, Éloi Zablocki, Mickaël Chen et al.

CVPR 2024arXiv:2312.00703

citations

#3221

Multi-Class Support Vector Machine with Maximizing Minimum Margin

Feiping Nie, Zhezheng Hao, Rong Wang

AAAI 2024paperarXiv:2312.06578

citations

#3222

GroundVLP: Harnessing Zero-Shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection

Haozhan Shen, Tiancheng Zhao, Mingwei Zhu et al.

AAAI 2024paperarXiv:2312.15043

citations

#3223

Rethinking Boundary Discontinuity Problem for Oriented Object Detection

Hang Xu, Xinyuan Liu, Haonan Xu et al.

CVPR 2024arXiv:2305.10061

citations

#3224

DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing

Jia-Wei Liu, Yan-Pei Cao, Jay Zhangjie Wu et al.

CVPR 2024arXiv:2310.10624

citations

#3225

Out-of-Domain Generalization in Dynamical Systems Reconstruction

Niclas Göring, Florian Hess, Manuel Brenner et al.

ICML 2024arXiv:2402.18377

citations

#3226

UniGS: Unified Representation for Image Generation and Segmentation

Lu Qi, Lehan Yang, Weidong Guo et al.

CVPR 2024arXiv:2312.01985

citations

#3227

Thermometer: Towards Universal Calibration for Large Language Models

Maohao Shen, Subhro Das, Kristjan Greenewald et al.

ICML 2024arXiv:2403.08819

citations

#3228

FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning

Yuwei Fu, Haichao Zhang, di wu et al.

ICML 2024arXiv:2406.00645

citations

#3229

T-Cal: An Optimal Test for the Calibration of Predictive Models

Donghwan Lee, Xinmeng Huang, Hamed Hassani et al.

ICML 2024arXiv:2203.01850

citations

#3230

HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation

Ce Zhang, Simon Stepputtis, Joseph Campbell et al.

CVPR 2024arXiv:2403.12033

citations

#3231

PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

Yizhe Xiong, Hui Chen, Tianxiang Hao et al.

ECCV 2024arXiv:2403.09192

citations

#3232

Learning to design protein-protein interactions with enhanced generalization

Anton Bushuiev, Roman Bushuiev, Petr Kouba et al.

ICLR 2024arXiv:2310.18515

citations

#3233

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Chengyao Wang, Li Jiang, Xiaoyang Wu et al.

CVPR 2024arXiv:2403.09639

citations

#3234

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning

Mustafa Shukor, Alexandre Rame, Corentin Dancette et al.

ICLR 2024arXiv:2310.00647

citations

#3235

Generalist Equivariant Transformer Towards 3D Molecular Interaction Learning

Xiangzhe Kong, Wenbing Huang, Yang Liu

ICML 2024arXiv:2306.01474

citations

#3236

SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation

Changsheng Lv, Mengshi Qi, Xia Li et al.

AAAI 2024paperarXiv:2303.11048

citations

#3237

MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling

Xuzhe Zhang, Yuhao Wu, Elsa Angelini et al.

CVPR 2024arXiv:2303.09373

citations

#3238

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Yifan Pu, Xia Zhuofan, Jiayi Guo et al.

ECCV 2024arXiv:2408.05710

citations

#3239

DTL: Disentangled Transfer Learning for Visual Recognition

Minghao Fu, Ke Zhu, Jianxin Wu

AAAI 2024paperarXiv:2312.07856

citations

#3240

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching

Meng Chu, Zhedong Zheng, Wei Ji et al.

ECCV 2024arXiv:2311.12751

citations

#3241

Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning

Haoxin Liu, Harshavardhan Kamarthi, Lingkai Kong et al.

ICML 2024oralarXiv:2406.09130

citations

#3242

ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy

Kirill Vishniakov, Zhiqiang Shen, Zhuang Liu

ICML 2024arXiv:2311.09215

citations

#3243

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Rui Yang, Han Zhong, Jiawei Xu et al.

ICLR 2024spotlightarXiv:2310.12955

citations

#3244

StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation

Sidi Wu, Yizi Chen, Loic Landrieu et al.

CVPR 2024arXiv:2403.20142

citations

#3245

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang et al.

CVPR 2024arXiv:2404.02790

citations

#3246

LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning

Bolin Lai, Xiaoliang Dai, Lawrence Chen et al.

ECCV 2024arXiv:2312.03849

citations

#3247

Recasting Regional Lighting for Shadow Removal

Yuhao Liu, Zhanghan Ke, Ke Xu et al.

AAAI 2024paperarXiv:2402.00341

citations

#3248

Supervised Knowledge Makes Large Language Models Better In-context Learners

Linyi Yang, Shuibai Zhang, Zhuohao Yu et al.

ICLR 2024arXiv:2312.15918

citations

#3249

Delving into the Trajectory Long-tail Distribution for Muti-object Tracking

Sijia Chen, En Yu, Jinyang Li et al.

CVPR 2024arXiv:2403.04700

citations

#3250

Emergence of In-Context Reinforcement Learning from Noise Distillation

Ilya Zisman, Vladislav Kurenkov, Alexander Nikulin et al.

ICML 2024arXiv:2312.12275

citations

#3251

Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation

Jin Wang, Bingfeng Zhang, Jian Pang et al.

CVPR 2024arXiv:2405.08458

citations

#3252

IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing

Shaofei Wang, Bozidar Antic, Andreas Geiger et al.

CVPR 2024arXiv:2312.05210

citations

#3253

Decomposed Linear Dynamical Systems (dLDS) for learning the latent components of neural dynamics

Noga Mudrik, Yenho Chen, Eva Yezerets et al.

ICML 2024arXiv:2206.02972

citations

#3254

SchurVINS: Schur Complement-Based Lightweight Visual Inertial Navigation System

Yunfei Fan, Tianyu Zhao, Guidong Wang

CVPR 2024arXiv:2312.01616

citations

#3255

PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

Charlie Hou, Akshat Shrivastava, Hongyuan Zhan et al.

ICML 2024arXiv:2406.02958

citations

#3256

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Dahyun Kang, Minsu Cho

ECCV 2024arXiv:2408.04961

citations

#3257

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

ZUYAN LIU, Benlin Liu, Jiahui Wang et al.

ECCV 2024arXiv:2407.18121

citations

#3258

PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation

Ardian Umam, Cheng-Kun Yang, Min-Hung Chen et al.

CVPR 2024arXiv:2312.04016

citations

#3259

Enhancing Diffusion Models with Text-Encoder Reinforcement Learning

Chaofeng Chen, Annan Wang, Haoning Wu et al.

ECCV 2024arXiv:2311.15657

citations

#3260

Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers

Sanghyeok Lee, Joonmyung Choi, Hyunwoo J. Kim

CVPR 2024arXiv:2403.10030

citations

#3261

DiffusionPoser: Real-time Human Motion Reconstruction From Arbitrary Sparse Sensors Using Autoregressive Diffusion

Tom Van Wouwe, Seunghwan Lee, Antoine Falisse et al.

CVPR 2024arXiv:2308.16682

citations

#3262

Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

Xinyi Wang, Alfonso Amayuelas, Kexun Zhang et al.

ICML 2024arXiv:2402.03268

citations

#3263

FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clustering

Yongxin Guo, Xiaoying Tang, Tao Lin

ICML 2024arXiv:2301.12379

citations

#3264

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Yinmin Zhang, Jie Liu, Chuming Li et al.

AAAI 2024paperarXiv:2312.07685

citations

#3265

Robust Multi-Task Learning with Excess Risks

Yifei He, Shiji Zhou, Guojun Zhang et al.

ICML 2024arXiv:2402.02009

citations

#3266

On Error Propagation of Diffusion Models

Yangming Li, Mihaela van der Schaar

ICLR 2024arXiv:2308.05021

citations

#3267

CCIL: Continuity-Based Data Augmentation for Corrective Imitation Learning

Liyiming Ke, Yunchu Zhang, Abhay Deshpande et al.

ICLR 2024arXiv:2310.12972

citations

#3268

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.

ICML 2024arXiv:2306.04815

citations

#3269

Dual DETRs for Multi-Label Temporal Action Detection

Yuhan Zhu, Guozhen Zhang, Jing Tan et al.

CVPR 2024arXiv:2404.00653

citations

#3270

Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting

Zijie Chen, Lichao Zhang, Fangsheng Weng et al.

CVPR 2024arXiv:2310.08129

citations

#3271

Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

Haoyu Chen, Wenbo Li, Jinjin Gu et al.

CVPR 2024arXiv:2403.02601

citations

#3272

Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation

Xianghui Xie, Bharat Lal Bhatnagar, Jan Lenssen et al.

CVPR 2024highlightarXiv:2312.07063

citations

#3273

Fantastic Generalization Measures are Nowhere to be Found

Michael Gastpar, Ido Nachum, Jonathan Shafer et al.

ICLR 2024arXiv:2309.13658

citations

#3274

WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering

Pingyi Chen, Chenglu Zhu, Sunyi Zheng et al.

ECCV 2024arXiv:2407.05603

citations

#3275

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning

Patrik Okanovic, Roger Waleffe, Vasilis Mageirakos et al.

ICLR 2024arXiv:2305.18424

citations

#3276

Multi-Object Tracking in the Dark

Xinzhe Wang, Kang Ma, Qiankun Liu et al.

CVPR 2024arXiv:2405.06600

citations

#3277

Reinformer: Max-Return Sequence Modeling for Offline RL

Zifeng Zhuang, Dengyun Peng, Jinxin Liu et al.

ICML 2024arXiv:2405.08740

citations

#3278

Learning and Forgetting Unsafe Examples in Large Language Models

Jiachen Zhao, Zhun Deng, David Madras et al.

ICML 2024oralarXiv:2312.12736

citations

#3279

Scaling Laws for Associative Memories

Vivien Cabannes, Elvis Dohmatob, Alberto Bietti

ICLR 2024spotlightarXiv:2310.02984

citations

#3280

Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity

Ruijie Quan, Wenguan Wang, Zhibo Tian et al.

CVPR 2024arXiv:2403.20022

citations

#3281

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

Yan Li, Weiwei Guo, Xue Yang et al.

ECCV 2024arXiv:2311.11646

citations

#3282

AesFA: An Aesthetic Feature

Aware Arbitrary Neural Style Transfer

AAAI 2024paperarXiv:2312.05928

citations

#3283

Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts

Fei Ni, Jianye Hao, Shiguang Wu et al.

CVPR 2024

citations

#3284

Cross-Domain Policy Adaptation by Capturing Representation Mismatch

Jiafei Lyu, Chenjia Bai, Jing-Wen Yang et al.

ICML 2024arXiv:2405.15369

citations

#3285

MagiCapture: High-Resolution Multi-Concept Portrait Customization

9256 Junha Hyung, Jaeyo Shin, Jaegul Choo

AAAI 2024paperarXiv:2309.06895

citations

#3286

Rayleigh Quotient Graph Neural Networks for Graph-level Anomaly Detection

Xiangyu Dong, Xingyi Zhang, Sibo WANG

ICLR 2024arXiv:2310.02861

citations

#3287

SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer

Yuhta Takida, Masaaki Imaizumi, Takashi Shibuya et al.

ICLR 2024arXiv:2301.12811

citations

#3288

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Hu Zhang, xu jianhua, Tao Tang et al.

ECCV 2024arXiv:2312.08876

citations

#3289

Decomposing and Editing Predictions by Modeling Model Computation

Harshay Shah, Andrew Ilyas, Aleksander Madry

ICML 2024arXiv:2404.11534

citations

#3290

UC-NERF: Neural Radiance Field for Under-Calibrated Multi-View Cameras in Autonomous Driving

Kai Cheng, Xiaoxiao Long, Wei Yin et al.

ICLR 2024oralarXiv:2311.16945

citations

#3291

Learning Iterative Reasoning through Energy Diffusion

Yilun Du, Jiayuan Mao, Josh Tenenbaum

ICML 2024arXiv:2406.11179

citations

#3292

Pairwise Alignment Improves Graph Domain Adaptation

Shikun Liu, Deyu Zou, Han Zhao et al.

ICML 2024spotlightarXiv:2403.01092

citations

#3293

Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation

Yuan Wang, Rui Sun, Naisong Luo et al.

CVPR 2024arXiv:2404.00262

citations

#3294

WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights

Youngdong Jang, Dong In Lee, MinHyuk Jang et al.

CVPR 2024arXiv:2405.02066

citations

#3295

VITA: ‘Carefully Chosen and Weighted Less’ Is Better in Medication Recommendation

AAAI 2024paperarXiv:2312.12100

citations

#3296

CPP-Net: Embracing Multi-Scale Feature Fusion into Deep Unfolding CP-PPA Network for Compressive Sensing

Zhen Guo, Hongping Gan

CVPR 2024

citations

#3297

Fewer Truncations Improve Language Modeling

Hantian Ding, Zijian Wang, Giovanni Paolini et al.

ICML 2024arXiv:2404.10830

citations

#3298

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

Yixiang Qiu, Hao Fang, Hongyao Yu et al.

ECCV 2024arXiv:2407.13863

citations

#3299

Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?

Fan Yao, Chuanhao Li, Denis Nekipelov et al.

ICML 2024arXiv:2402.15467

citations

#3300

Federated Generalized Category Discovery

Nan Pu, Wenjing Li, Xinyuan Ji et al.

CVPR 2024arXiv:2305.14107

citations

#3301

Some Fundamental Aspects about Lipschitz Continuity of Neural Networks

Grigory Khromov, Sidak Pal Singh

ICLR 2024arXiv:2302.10886

citations

#3302

Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning

Sumeet Batra, Bryon Tjanaka, Matthew Fontaine et al.

ICLR 2024oralarXiv:2305.13795

citations

#3303

Compositional Preference Models for Aligning LMs

DONGYOUNG GO, Tomek Korbak, Germàn Kruszewski et al.

ICLR 2024arXiv:2310.13011

citations

#3304

Perception-Oriented Video Frame Interpolation via Asymmetric Blending

Guangyang Wu, Xin Tao, Changlin Li et al.

CVPR 2024arXiv:2404.06692

citations

#3305

Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir et al.

CVPR 2024arXiv:2405.14497

citations

#3306

What's in a Prior? Learned Proximal Networks for Inverse Problems

Zhenghan Fang, Sam Buchanan, Jeremias Sulam

ICLR 2024arXiv:2310.14344

citations

#3307

MANUS: Markerless Grasp Capture using Articulated 3D Gaussians

Chandradeep Pokhariya, Ishaan Shah, Angela Xing et al.

CVPR 2024arXiv:2312.02137

citations

#3308

Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes

Yaoting Wang, Peiwen Sun, Dongzhan Zhou et al.

ECCV 2024arXiv:2407.10957

citations

#3309

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection

Haimei Zhao, Qiming Zhang, Shanshan Zhao et al.

AAAI 2024paperarXiv:2303.16818

citations

#3310

SasWOT: Real-Time Semantic Segmentation Architecture Search WithOut Training

Chendi Zhu, Lujun Li, Yuli Wu et al.

AAAI 2024paper

citations

#3311

Context-based and Diversity-driven Specificity in Compositional Zero-Shot Learning

Yun Li, Zhe Liu, Hang Chen et al.

CVPR 2024arXiv:2402.17251

citations

#3312

Diffusion Time-step Curriculum for One Image to 3D Generation

YI Xuanyu, Zike Wu, Qingshan Xu et al.

CVPR 2024arXiv:2404.04562

citations

#3313

Stochastic Localization via Iterative Posterior Sampling

Louis Grenioux, Maxence Noble, Marylou Gabrié et al.

ICML 2024spotlightarXiv:2402.10758

citations

#3314

Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models

Ziyu Wang, Lejun Min, Gus Xia

ICLR 2024spotlightarXiv:2405.09901

citations

#3315

VQCNIR: Clearer Night Image Restoration with Vector-Quantized Codebook

Wenbin Zou, Hongxia Gao, Tian Ye et al.

AAAI 2024paperarXiv:2312.08606

citations

#3316

Permutation Equivariance of Transformers and Its Applications

Hengyuan Xu, Liyao Xiang, Hangyu Ye et al.

CVPR 2024arXiv:2304.07735

citations

#3317

Dual Associated Encoder for Face Restoration

Yu-Ju Tsai, Yu-Lun Liu, Lu Qi et al.

ICLR 2024arXiv:2308.07314

citations

#3318

Would Deep Generative Models Amplify Bias in Future Models?

Tianwei Chen, Yusuke Hirota, Mayu Otani et al.

CVPR 2024arXiv:2404.03242

citations

#3319

StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization

Shida Wang, Qianxiao Li

ICML 2024arXiv:2311.14495

citations

#3320

Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How

Sebastian Pineda Arango, Fabio Ferreira, Arlind Kadra et al.

ICLR 2024arXiv:2306.03828

citations

#3321

Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks

Sehwan Choi, Jun Won Choi, JUNGHO KIM et al.

ECCV 2024arXiv:2407.13517

citations

#3322

Critical windows: non-asymptotic theory for feature emergence in diffusion models

Marvin Li, Sitan Chen

ICML 2024arXiv:2403.01633

citations

#3323

FastMAC: Stochastic Spectral Sampling of Correspondence Graph

Yifei Zhang, Hao Zhao, Hongyang Li et al.

CVPR 2024arXiv:2403.08770

citations

#3324

Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation

Can Yaras, Peng Wang, Laura Balzano et al.

ICML 2024arXiv:2406.04112

citations

#3325

Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan

ICML 2024arXiv:2406.01382

citations

#3326

Doubly Abductive Counterfactual Inference for Text-based Image Editing

Xue Song, Jiequan Cui, Hanwang Zhang et al.

CVPR 2024arXiv:2403.02981

citations

#3327

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Minyoung Hwang, Luca Weihs, Chanwoo Park et al.

CVPR 2024arXiv:2312.09337

citations

#3328

Efficient Modulation for Vision Networks

Xu Ma, Xiyang Dai, Jianwei Yang et al.

ICLR 2024arXiv:2403.19963

citations

#3329

Denoising Diffusion Step-aware Models

Shuai Yang, Yukang Chen, Luozhou WANG et al.

ICLR 2024arXiv:2310.03337

citations

#3330

GeoCalib: Learning Single-image Calibration with Geometric Optimization

Alexander Veicht, Paul-Edouard Sarlin, Philipp Lindenberger et al.

ECCV 2024arXiv:2409.06704

citations

#3331

Improving protein optimization with smoothed fitness landscapes

Andrew Kirjner, Jason Yim, Raman Samusevich et al.

ICLR 2024arXiv:2307.00494

citations

#3332

On the Duality Between Sharpness-Aware Minimization and Adversarial Training

Yihao Zhang, Hangzhou He, Jingyu Zhu et al.

ICML 2024arXiv:2402.15152

citations

#3333

FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning

Chenhao Li, Elijah Stanger-Jones, Steve Heim et al.

ICLR 2024oralarXiv:2402.13820

citations

#3334

Accelerated Algorithms for Constrained Nonconvex-Nonconcave Min-Max Optimization and Comonotone Inclusion

Yang Cai, Argyris Oikonomou, Weiqiang Zheng

ICML 2024arXiv:2206.05248

citations

#3335

MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution

Yuxuan Jiang, Chen Feng, Fan Zhang et al.

ECCV 2024arXiv:2404.09571

citations

#3336

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

Molei Qin, Shuo Sun, Wentao Zhang et al.

AAAI 2024paperarXiv:2309.12891

citations

#3337

Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment

Ziyu Shan, Yujie Zhang, Qi Yang et al.

CVPR 2024arXiv:2403.10066

citations

#3338

SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology

Saarthak Kapse, Pushpak Pati, Srijan Das et al.

CVPR 2024arXiv:2312.15010

citations

#3339

TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series

Arjun Ashok, Étienne Marcotte, Valentina Zantedeschi et al.

ICLR 2024arXiv:2310.01327

citations

#3340

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

Hansam Cho, Jonghyun Lee, Seoung Bum Kim et al.

ICLR 2024arXiv:2402.04625

citations

#3341

Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation

Rongyu Zhang, Yulin Luo, Jiaming Liu et al.

AAAI 2024paper

citations

#3342

LLMGA: Multimodal Large Language Model based Generation Assistant

Bin Xia, Shiyin Wang, Yingfan Tao et al.

ECCV 2024arXiv:2311.16500

citations

#3343

Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information

Linfeng Ye, Shayan Mohajer Hamidi, Renhao Tan et al.

ICLR 2024arXiv:2401.08732

citations

#3344

Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion

Linzhan Mou, Jun-Kun Chen, Yu-Xiong Wang

CVPR 2024arXiv:2406.09402

citations

#3345

Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation

Divyat Mahajan, Ioannis Mitliagkas, Brady Neal et al.

ICLR 2024spotlightarXiv:2211.01939

citations

#3346

High-Probability Convergence for Composite and Distributed Stochastic Minimization and Variational Inequalities with Heavy-Tailed Noise

Eduard Gorbunov, Abdurakhmon Sadiev, Marina Danilova et al.

ICML 2024arXiv:2310.01860

citations

#3347

In-Context Reinforcement Learning for Variable Action Spaces

Viacheslav Sinii, Alexander Nikulin, Vladislav Kurenkov et al.

ICML 2024arXiv:2312.13327

citations

#3348

Task-Agnostic Privacy-Preserving Representation Learning for Federated Learning against Attribute Inference Attacks

Caridad Arroyo Arevalo, Sayedeh Leila Noorbakhsh, Yun Dong et al.

AAAI 2024paperarXiv:2312.06989

citations

#3349

SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation

Zhixuan Liu, Peter Schaldenbrand, Beverley-Claire Okogwu et al.

CVPR 2024arXiv:2401.08053

citations

#3350

Towards Certified Unlearning for Deep Neural Networks

Binchi Zhang, Yushun Dong, Tianhao Wang et al.

ICML 2024arXiv:2408.00920

citations

#3351

TrojVLM: Backdoor Attack Against Vision Language Models

Weimin Lyu, Lu Pang, Tengfei Ma et al.

ECCV 2024arXiv:2409.19232

citations

#3352

Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels

Zhuohong Li, Wei He, Jiepan Li et al.

CVPR 2024highlightarXiv:2403.02746

citations

#3353

Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations

Kaiwen Xue, Yuhao Zhou, Shen Nie et al.

ICML 2024arXiv:2404.15766

citations

#3354

SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection

Hongcheng Zhang, Liu Liang, Pengxin Zeng et al.

ECCV 2024arXiv:2403.07284

citations

#3355

The Entropy Enigma: Success and Failure of Entropy Minimization

Ori Press, Ravid Shwartz-Ziv, Yann LeCun et al.

ICML 2024arXiv:2405.05012

citations

#3356

MoDE: CLIP Data Experts via Clustering

Jiawei Ma, Po-Yao Huang, Saining Xie et al.

CVPR 2024arXiv:2404.16030

citations

#3357

Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers

Zhibo Yang, Sounak Mondal, Seoyoung Ahn et al.

CVPR 2024arXiv:2303.09383

citations

#3358

Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving

Zhenghao Peng, Wenjie Luo, Yiren Lu et al.

ECCV 2024arXiv:2409.18343

citations

#3359

Enhancing Vectorized Map Perception with Historical Rasterized Maps

Xiaoyu Zhang, Guangwei Liu, Zihao Liu et al.

ECCV 2024arXiv:2409.00620

citations

#3360

A Dual Stealthy Backdoor: From Both Spatial and Frequency Perspectives

Yudong Gao, Honglong Chen, Peng Sun et al.

AAAI 2024paperarXiv:2307.10184

citations

#3361

CLIM: Contrastive Language-Image Mosaic for Region Representation

Size Wu, Wenwei Zhang, Lumin XU et al.

AAAI 2024paperarXiv:2312.11376

citations

#3362

An Empirical Study of Realized GNN Expressiveness

Yanbo Wang, Muhan Zhang

ICML 2024arXiv:2304.07702

citations

#3363

Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation

Qiyuan Dai, Sibei Yang

CVPR 2024arXiv:2404.11998

citations

#3364

Improved Visual Grounding through Self-Consistent Explanations

Ruozhen He, Paola Cascante-Bonilla, Ziyan Yang et al.

CVPR 2024arXiv:2312.04554

citations

#3365

Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering

Jiawei Yao, Qi Qian, Juhua Hu

CVPR 2024arXiv:2404.15655

citations

#3366

NetTrack: Tracking Highly Dynamic Objects with a Net

Guangze Zheng, Shijie Lin, Haobo Zuo et al.

CVPR 2024arXiv:2403.11186

citations

#3367

MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty

Tim Broedermann, David Brüggemann, Christos Sakaridis et al.

ECCV 2024arXiv:2401.12761

citations

#3368

Small Scale Data-Free Knowledge Distillation

He Liu, Yikai Wang, Huaping Liu et al.

CVPR 2024arXiv:2406.07876

citations

#3369

Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

Talfan Evans, Shreya Pathak, Hamza Merzic et al.

ECCV 2024arXiv:2312.05328

citations

#3370

Denoising Task Routing for Diffusion Models

Byeongjun Park, Sangmin Woo, Hyojun Go et al.

ICLR 2024arXiv:2310.07138

citations

#3371

SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation

Junyan Ye, Qiyan Luo, Jinhua Yu et al.

CVPR 2024highlightarXiv:2404.02638

citations

#3372

Accelerating Parallel Sampling of Diffusion Models

Zhiwei Tang, Jiasheng Tang, Hao Luo et al.

ICML 2024arXiv:2402.09970

citations

#3373

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Yuanzhi Zhu, Xingchao Liu, Qiang Liu

ECCV 2024arXiv:2407.12718

citations

#3374

Provably Robust Conformal Prediction with Improved Efficiency

Ge Yan, Yaniv Romano, Tsui-Wei Weng

ICLR 2024arXiv:2404.19651

citations

#3375

360+x: A Panoptic Multi-modal Scene Understanding Dataset

Hao Chen, Yuqi Hou, Chenyuan Qu et al.

CVPR 2024arXiv:2404.00989

citations

#3376

Isomorphic Pruning for Vision Models

Gongfan Fang, Xinyin Ma, Michael Bi Mi et al.

ECCV 2024arXiv:2407.04616

citations

#3377

SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

Guohao Sun, Can Qin, JIAMINAN WANG et al.

ECCV 2024arXiv:2403.11299

citations

#3378

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation

Jihyun Kim, Changjae Oh, Hoseok Do et al.

CVPR 2024arXiv:2405.04356

citations

#3379

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

Andreas Opedal, Alessandro Stolfo, Haruki Shirakami et al.

ICML 2024arXiv:2401.18070

citations

#3380

Matrix Information Theory for Self-Supervised Learning

Yifan Zhang, Zhiquan Tan, Jingqin Yang et al.

ICML 2024arXiv:2305.17326

citations

#3381

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Zhongyi Shui, Yunlong Zhang, Kai Yao et al.

ECCV 2024arXiv:2311.15939

citations

#3382

Score Distillation Sampling with Learned Manifold Corrective

Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescu

ECCV 2024arXiv:2401.05293

citations

#3383

Position: Why We Must Rethink Empirical Research in Machine Learning

Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger et al.

ICML 2024arXiv:2405.02200

citations

#3384

ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation

Jia-Hao Wu, Fu-Jen Tsai, Yan-Tsung Peng et al.

CVPR 2024arXiv:2312.10998

citations

#3385

Context-Aware Meta-Learning

Christopher Fifty, Dennis Duan, Ronald Junkins et al.

ICLR 2024arXiv:2310.10971

citations

#3386

EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere

Jiaxi Jiang, Paul Streli, Manuel Meier et al.

ECCV 2024arXiv:2308.06493

citations

#3387

Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition

Haijun Xiong, Bin Feng, Xinggang Wang et al.

ECCV 2024arXiv:2407.12519

citations

#3388

Cascade Prompt Learning for Visual-Language Model Adaptation

Ge Wu, Xin Zhang, Zheng Li et al.

ECCV 2024

citations

#3389

Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation

Ignat Georgiev, Krishnan Srinivasan, Jie Xu et al.

ICML 2024arXiv:2405.17784

citations

#3390

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong et al.

ECCV 2024arXiv:2407.12951

citations

#3391

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

Maitreya Patel, Changhoon Kim, Sheng Cheng et al.

CVPR 2024arXiv:2312.04655

citations

#3392

Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection

Yuanpeng Tu, Boshen Zhang, Liang Liu et al.

ECCV 2024arXiv:2401.03145

citations

#3393

Self-Correcting Self-Consuming Loops for Generative Model Training

Nate Gillman, Michael Freeman, Daksh Aggarwal et al.

ICML 2024arXiv:2402.07087

citations

#3394

Training-Free Pretrained Model Merging

Zhengqi Xu, Ke Yuan, Huiqiong Wang et al.

CVPR 2024arXiv:2403.01753

citations

#3395

Log Neural Controlled Differential Equations: The Lie Brackets Make A Difference

Benjamin Walker, Andrew McLeod, Tiexin QIN et al.

ICML 2024arXiv:2402.18512

citations

#3396

LISO: Lidar-only Self-Supervised 3D Object Detection

Stefan Baur, Frank Moosmann, Andreas Geiger

ECCV 2024arXiv:2403.07071

citations

#3397

HoloVIC: Large-scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative

CONG MA, Qiao Lei, Chengkai Zhu et al.

CVPR 2024arXiv:2403.02640

citations

#3398

Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation

Nina Weng, Paraskevas Pegios, Eike Petersen et al.

ECCV 2024arXiv:2312.14223

citations

#3399

Text-Conditioned Resampler For Long Form Video Understanding

Bruno Korbar, Yongqin Xian, Alessio Tonioni et al.

ECCV 2024arXiv:2312.11897

citations

#3400

AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning

Yuwei Tang, ZhenYi Lin, Qilong Wang et al.

CVPR 2024arXiv:2404.08958

citations

← Previous

1...15 16 17 18 19...62