Most Cited 2024 "dense matching" Papers

12,324 papers found • Page 21 of 62

#4001

Gaussian Processes on Cellular Complexes

Mathieu Alain, So Takao, Brooks Paige et al.

ICML 2024arXiv:2311.01198
20
citations
#4002

Upper Bounding Barlow Twins: A Novel Filter for Multi-Relational Clustering

Xiaowei Qian, Bingheng Li, Zhao Kang

AAAI 2024paperarXiv:2312.14066
20
citations
#4003

Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach

Shizhou Zhang, Wenlong Luo, De Cheng et al.

ECCV 2024arXiv:2408.07500
20
citations
#4004

MoST: Multi-Modality Scene Tokenization for Motion Prediction

Norman Mu, Jingwei Ji, Zhenpei Yang et al.

CVPR 2024arXiv:2404.19531
20
citations
#4005

Symmetry Induces Structure and Constraint of Learning

Liu Ziyin

ICML 2024arXiv:2309.16932
20
citations
#4006

Correspondence-Free Non-Rigid Point Set Registration Using Unsupervised Clustering Analysis

Mingyang Zhao, Jiang Jingen, Lei Ma et al.

CVPR 2024highlightarXiv:2406.18817
20
citations
#4007

Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?

Huy Nguyen, Pedram Akbarian, Nhat Ho

ICML 2024arXiv:2401.13875
20
citations
#4008

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

song yiran, Qianyu Zhou, Xiangtai Li et al.

CVPR 2024arXiv:2401.02317
20
citations
#4009

BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection

Wenjie Wang, Yehao Lu, Guangcong Zheng et al.

CVPR 2024arXiv:2406.08785
20
citations
#4010

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

kang you, Zekai Xu, Chen Nie et al.

ICML 2024arXiv:2406.03470
20
citations
#4011

Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein Representation Learning

Youhan Lee, Hasun Yu, Jaemyung Lee et al.

ICLR 2024
20
citations
#4012

Conformal Autoregressive Generation: Beam Search with Coverage Guarantees

Nicolas Deutschmann, Marvin Alberts, María Rodríguez Martínez

AAAI 2024paperarXiv:2309.03797
20
citations
#4013

StableMask: Refining Causal Masking in Decoder-only Transformer

Qingyu Yin, Xuzheng He, Xiang Zhuang et al.

ICML 2024arXiv:2402.04779
20
citations
#4014

Searching for High-Value Molecules Using Reinforcement Learning and Transformers

Raj Ghugare, Santiago Miret, Adriana Hugessen et al.

ICLR 2024arXiv:2310.02902
20
citations
#4015

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Yunheng Li, Zhong-Yu Li, Quan-Sheng Zeng et al.

ICML 2024arXiv:2406.00670
20
citations
#4016

Improving Transferable Targeted Adversarial Attacks with Model Self-Enhancement

Han Wu, Guanyan Ou, Weibin Wu et al.

CVPR 2024
20
citations
#4017

Making Vision Transformers Truly Shift-Equivariant

Renan A. Rojas-Gomez, Teck-Yian Lim, Minh Do et al.

CVPR 2024arXiv:2305.16316
20
citations
#4018

Decentralized Directed Collaboration for Personalized Federated Learning

Yingqi Liu, Yifan Shi, Qinglun Li et al.

CVPR 2024arXiv:2405.17876
20
citations
#4019

Investigating the Benefits of Projection Head for Representation Learning

Yihao Xue, Eric Gan, Jiayi Ni et al.

ICLR 2024arXiv:2403.11391
20
citations
#4020

MoMo: Momentum Models for Adaptive Learning Rates

Fabian Schaipp, Ruben Ohana, Michael Eickenberg et al.

ICML 2024arXiv:2305.07583
20
citations
#4021

ConGeo: Robust Cross-view Geo-localization across Ground View Variations

Li Mi, Chang Xu, Javiera Castillo Navarro et al.

ECCV 2024arXiv:2403.13965
20
citations
#4022

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

Xingqun Qi, Jiahao Pan, Peng Li et al.

CVPR 2024arXiv:2311.17532
20
citations
#4023

Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling

Leon Sick, Dominik Engel, Pedro Hermosilla et al.

CVPR 2024arXiv:2309.12378
20
citations
#4024

Diffusion for Natural Image Matting

Yihan Hu, Yiheng Lin, Wei Wang et al.

ECCV 2024arXiv:2312.05915
20
citations
#4025

HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud

WENCAN CHENG, Hao Tang, Luc Van Gool et al.

CVPR 2024highlightarXiv:2404.03159
20
citations
#4026

Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

Zhengbo Zhang, Li Xu, Duo Peng et al.

ECCV 2024arXiv:2407.08394
20
citations
#4027

As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors

Seungwoo Yoo, Kunho Kim, Vladimir G. Kim et al.

CVPR 2024arXiv:2311.16739
19
citations
#4028

Decentralized Monte Carlo Tree Search for Partially Observable Multi-Agent Pathfinding

Alexey Skrynnik, Anton Andreychuk, Konstantin Yakovlev et al.

AAAI 2024paperarXiv:2312.15908
19
citations
#4029

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Yunhan Yang, Yukun Huang, Xiaoyang Wu et al.

CVPR 2024arXiv:2312.03611
19
citations
#4030

NViST: In the Wild New View Synthesis from a Single Image with Transformers

Wonbong Jang, Lourdes Agapito

CVPR 2024arXiv:2312.08568
19
citations
#4031

Explorations of Self-Repair in Language Models

Cody Rushing, Neel Nanda

ICML 2024arXiv:2402.15390
19
citations
#4032

CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding

Qiongyi Zhou, Changde Du, Shengpei Wang et al.

ICLR 2024arXiv:2402.08994
19
citations
#4033

HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

Zhongyu Xia, ZhiWei Lin, Xinhao Wang et al.

ECCV 2024arXiv:2404.02517
19
citations
#4034

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

Weitai Kang, Gaowen Liu, Shah Mubarak et al.

ECCV 2024arXiv:2407.03200
19
citations
#4035

PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos

Yufei Zhang, Jeffrey Kephart, Zijun Cui et al.

CVPR 2024arXiv:2404.04430
19
citations
#4036

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024arXiv:2404.01241
19
citations
#4037

DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos

Arjun Balasingam, Joseph Chandler, Chenning Li et al.

CVPR 2024arXiv:2312.09523
19
citations
#4038

WebVLN: Vision-and-Language Navigation on Websites

Qi Chen, Dileepa Pitawela, Chongyang Zhao et al.

AAAI 2024paperarXiv:2312.15820
19
citations
#4039

EAT: Towards Long-Tailed Out-of-Distribution Detection

Tong Wei, Bo-Lin Wang, Min-Ling Zhang

AAAI 2024paperarXiv:2312.08939
19
citations
#4040

Trustless Audits without Revealing Data or Models

Suppakit Waiwitlikhit, Ion Stoica, Yi Sun et al.

ICML 2024arXiv:2404.04500
19
citations
#4041

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Bingheng Li, Linxin Yang, Yupeng Chen et al.

ICML 2024arXiv:2406.01908
19
citations
#4042

Overload: Latency Attacks on Object Detection for Edge Devices

Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung et al.

CVPR 2024arXiv:2304.05370
19
citations
#4043

On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

Runqi Lin, Chaojian Yu, Bo Han et al.

ICLR 2024arXiv:2310.08847
19
citations
#4044

Privileged Sensing Scaffolds Reinforcement Learning

Edward Hu, James Springer, Oleh Rybkin et al.

ICLR 2024spotlightarXiv:2405.14853
19
citations
#4045

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

Yumeng Li, Margret Keuper, Dan Zhang et al.

ICLR 2024arXiv:2401.08815
19
citations
#4046

CHEMREASONER: Heuristic Search over a Large Language Model’s Knowledge Space using Quantum-Chemical Feedback

Henry W. Sprueill, Carl Edwards, Khushbu Agarwal et al.

ICML 2024arXiv:2402.10980
19
citations
#4047

Transformer-Based Selective Super-resolution for Efficient Image Refinement

Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo et al.

AAAI 2024paperarXiv:2312.05803
19
citations
#4048

The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model

Daniel Goldfarb, Itay Evron, Nir Weinberger et al.

ICLR 2024arXiv:2401.12617
19
citations
#4049

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

Junkai Fan, Jiangwei Weng, Kun Wang et al.

CVPR 2024arXiv:2405.09996
19
citations
#4050

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel

ICLR 2024spotlightarXiv:2311.11321
19
citations
#4051

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024arXiv:2403.20032
19
citations
#4052

Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Alex Gomez-Villa, Dipam Goswami, Kai Wang et al.

ECCV 2024arXiv:2407.08536
19
citations
#4053

Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning

Jinglin Liang, Jin Zhong, Hanlin Gu et al.

ECCV 2024arXiv:2409.01128
19
citations
#4054

Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model

Zelin Peng, Zhengqin Xu, Zhilin Zeng et al.

CVPR 2024arXiv:2311.17112
19
citations
#4055

PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery

Fernando Julio Cendra, Bingchen Zhao, Kai Han

ECCV 2024arXiv:2407.19001
19
citations
#4056

Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding

YIWEN TANG, Renrui Zhang, Jiaming Liu et al.

ECCV 2024
19
citations
#4057

Fair-VPT: Fair Visual Prompt Tuning for Image Classification

Sungho Park, Hyeran Byun

CVPR 2024
19
citations
#4058

Identifiable Latent Polynomial Causal Models through the Lens of Change

Yuhang Liu, Zhen Zhang, Dong Gong et al.

ICLR 2024arXiv:2310.15580
19
citations
#4059

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Yuhui Zhang, Elaine Sui, Serena Yeung

ICLR 2024arXiv:2401.08567
19
citations
#4060

Improved Generalization of Weight Space Networks via Augmentations

Aviv Shamsian, Aviv Navon, David Zhang et al.

ICML 2024arXiv:2402.04081
19
citations
#4061

Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

Kejun Tang, Jiayu Zhai, Xiaoliang Wan et al.

ICLR 2024arXiv:2305.18702
19
citations
#4062

Position: Leverage Foundational Models for Black-Box Optimization

Xingyou Song, Yingtao Tian, Robert Lange et al.

ICML 2024arXiv:2405.03547
19
citations
#4063

MoST: Motion Style Transformer Between Diverse Action Contents

Boeun Kim, Jungho Kim, Hyung Jin Chang et al.

CVPR 2024arXiv:2403.06225
19
citations
#4064

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

Xiangyu Fan, Jiaqi Li, Zhiqian Lin et al.

ECCV 2024arXiv:2408.00762
19
citations
#4065

Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Ruibin Li, Ruihuang Li, Song Guo et al.

ECCV 2024arXiv:2403.11105
19
citations
#4066

Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras

Ashwath Shetty, Marc Habermann, Guoxing Sun et al.

CVPR 2024arXiv:2312.07423
19
citations
#4067

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models

Jaehoon Hahm, Junho Lee, Sunghyun Kim et al.

ICML 2024arXiv:2407.11451
19
citations
#4068

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World

Rujie Wu, Xiaojian Ma, Zhenliang Zhang et al.

ICLR 2024arXiv:2310.10207
19
citations
#4069

Patch-Wise Graph Contrastive Learning for Image Translation

Chanyong Jung, Gihyun Kwon, Jong Chul Ye

AAAI 2024paperarXiv:2312.08223
19
citations
#4070

PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

Lirong Wu, Yufei Huang, Cheng Tan et al.

AAAI 2024paperarXiv:2402.08198
19
citations
#4071

Position: Optimization in SciML Should Employ the Function Space Geometry

Johannes Müller, Marius Zeinhofer

ICML 2024arXiv:2402.07318
19
citations
#4072

Learning Continuous 3D Words for Text-to-Image Generation

Ta-Ying Cheng, Matheus Gadelha, Thibault Groueix et al.

CVPR 2024arXiv:2402.08654
19
citations
#4073

Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction

Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann et al.

ECCV 2024arXiv:2403.07263
19
citations
#4074

Towards Category Unification of 3D Single Object Tracking on Point Clouds

Jiahao Nie, Zhiwei He, Xudong Lv et al.

ICLR 2024arXiv:2401.11204
19
citations
#4075

Information Bottleneck Analysis of Deep Neural Networks via Lossy Compression

Ivan Butakov, Aleksandr Tolmachev, Sofia Malanchuk et al.

ICLR 2024arXiv:2305.08013
19
citations
#4076

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Zehan Wang, Ziang Zhang, xize cheng et al.

ICML 2024arXiv:2405.04883
19
citations
#4077

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Zeeshan Hayder, Xuming He

CVPR 2024arXiv:2403.14886
19
citations
#4078

Prompting a Pretrained Transformer Can Be a Universal Approximator

Aleksandar Petrov, Phil Torr, Adel Bibi

ICML 2024arXiv:2402.14753
19
citations
#4079

Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts

Andong Tan, Fengtao Zhou, Hao Chen

ECCV 2024arXiv:2408.02265
19
citations
#4080

HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

Haozhe Qi, Chen Zhao, Mathieu Salzmann et al.

CVPR 2024arXiv:2402.17062
19
citations
#4081

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

Yang Zhang, Tze Tzun Teoh, Wei Hern Lim et al.

ECCV 2024arXiv:2403.06381
19
citations
#4082

Discriminability-Driven Channel Selection for Out-of-Distribution Detection

Yue Yuan, Rundong He, Yicong Dong et al.

CVPR 2024
19
citations
#4083

Liouville Flow Importance Sampler

Yifeng Tian, Nishant Panda, Yen Ting Lin

ICML 2024arXiv:2405.06672
19
citations
#4084

Temporally and Distributionally Robust Optimization for Cold-Start Recommendation

Xinyu Lin, Wenjie Wang, Jujia Zhao et al.

AAAI 2024paperarXiv:2312.09901
19
citations
#4085

Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval

Yongchao Du, Min Wang, Wengang Zhou et al.

ICLR 2024spotlightarXiv:2403.01431
19
citations
#4086

Segment Every Out-of-Distribution Object

Wenjie Zhao, Jia Li, Xin Dong et al.

CVPR 2024arXiv:2311.16516
19
citations
#4087

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

Sisi Dai, Wenhao Li, Haowen Sun et al.

ECCV 2024arXiv:2403.15612
19
citations
#4088

RPSC: Robust Pseudo-Labeling for Semantic Clustering

Sihang Liu, Wenming Cao, Ruigang Fu et al.

AAAI 2024paper
19
citations
#4089

Window Attention is Bugged: How not to Interpolate Position Embeddings

Daniel Bolya, Chaitanya Ryali, Judy Hoffman et al.

ICLR 2024arXiv:2311.05613
19
citations
#4090

ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video

Xinhao Li, Yuhan Zhu, Limin Wang

ECCV 2024arXiv:2310.01324
19
citations
#4091

MINDE: Mutual Information Neural Diffusion Estimation

Giulio Franzese, Mustapha BOUNOUA, Pietro Michiardi

ICLR 2024arXiv:2310.09031
19
citations
#4092

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding

Yonglu Li, Xiaoqian Wu, Xinpeng Liu et al.

CVPR 2024highlightarXiv:2304.00553
19
citations
#4093

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

ECCV 2024arXiv:2403.11415
19
citations
#4094

M-BEV: Masked BEV Perception for Robust Autonomous Driving

Siran Chen, Yue Ma, Yu Qiao et al.

AAAI 2024paperarXiv:2312.12144
19
citations
#4095

Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation

Qinghe Ma, Jian Zhang, Lei Qi et al.

CVPR 2024arXiv:2404.08951
19
citations
#4096

FreestyleRet: Retrieving Images from Style-Diversified Queries

Hao Li, Yanhao Jia, Peng Jin et al.

ECCV 2024arXiv:2312.02428
19
citations
#4097

Demystifying Embedding Spaces using Large Language Models

Guy Tennenholtz, Yinlam Chow, ChihWei Hsu et al.

ICLR 2024arXiv:2310.04475
19
citations
#4098

Image Translation as Diffusion Visual Programmers

Cheng Han, James Liang, Qifan Wang et al.

ICLR 2024arXiv:2401.09742
19
citations
#4099

Weighted Envy-Freeness for Submodular Valuations

Luisa Montanari, Ulrike Schmidt-Kraepelin, Warut Suksompong et al.

AAAI 2024paperarXiv:2209.06437
19
citations
#4100

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

Yifu Yuan, Jianye HAO, Yi Ma et al.

ICLR 2024arXiv:2402.02423
19
citations
#4101

Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Gwladys Kelodjou, Laurence Rozé, Véronique Masson et al.

AAAI 2024paperarXiv:2312.12115
19
citations
#4102

Spiking Wavelet Transformer

Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.

ECCV 2024arXiv:2403.11138
19
citations
#4103

Augmented Commonsense Knowledge for Remote Object Grounding

Bahram Mohammadi, Yicong Hong, Yuankai Qi et al.

AAAI 2024paperarXiv:2406.01256
19
citations
#4104

Constrained Bayesian Optimization under Partial Observations: Balanced Improvements and Provable Convergence

Shengbo Wang, Ke Li

AAAI 2024paperarXiv:2312.03212
19
citations
#4105

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Akshay Krishnan, Abhijit Kundu, Kevis Maninis et al.

ECCV 2024arXiv:2407.08711
19
citations
#4106

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

Hongkang Li, Meng Wang, Tengfei Ma et al.

ICML 2024arXiv:2406.01977
19
citations
#4107

Towards Real-world Event-guided Low-light Video Enhancement and Deblurring

Taewoo Kim, Jaeseok Jeong, Hoonhee Cho et al.

ECCV 2024arXiv:2408.14916
19
citations
#4108

GlitchBench: Can Large Multimodal Models Detect Video Game Glitches?

Mohammad Reza Taesiri, Tianjun Feng, Cor-Paul Bezemer et al.

CVPR 2024arXiv:2312.05291
19
citations
#4109

Learning with Language-Guided State Abstractions

Andi Peng, Ilia Sucholutsky, Belinda Li et al.

ICLR 2024arXiv:2402.18759
19
citations
#4110

Good Teachers Explain: Explanation-Enhanced Knowledge Distillation

Amin Parchami, Moritz Böhle, Sukrut Rao et al.

ECCV 2024arXiv:2402.03119
19
citations
#4111

Topological data analysis on noisy quantum computers

Ismail Akhalwaya, Shashanka Ubaru, Kenneth Clarkson et al.

ICLR 2024arXiv:2209.09371
19
citations
#4112

Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Antonis Antoniades, Yiyi Yu, Joe Canzano et al.

ICLR 2024oralarXiv:2311.00136
19
citations
#4113

Principal-Agent Reward Shaping in MDPs

Omer Ben-Porat, Yishay Mansour, Michal Moshkovitz et al.

AAAI 2024paperarXiv:2401.00298
19
citations
#4114

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966
19
citations
#4115

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Tianzhe Chu, Shengbang Tong, Tianjiao Ding et al.

ICLR 2024arXiv:2306.05272
19
citations
#4116

Attribute-Guided Pedestrian Retrieval: Bridging Person Re-ID with Internal Attribute Variability

Yan Huang, Zhang Zhang, Qiang Wu et al.

CVPR 2024
19
citations
#4117

HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

Andrey Bryutkin, Jiahao Huang, Zhongying Deng et al.

ICML 2024arXiv:2402.03541
19
citations
#4118

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068
19
citations
#4119

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

Yang Miao, Francis Engelmann, Olga Vysotska et al.

ECCV 2024arXiv:2404.00469
19
citations
#4120

Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

Hao Fang, Peng Wu, Yawei Li et al.

ECCV 2024arXiv:2407.07427
19
citations
#4121

Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation

Xinyao Li, Yuke Li, Zhekai Du et al.

CVPR 2024arXiv:2403.06946
19
citations
#4122

Fine-Tuning Graph Neural Networks by Preserving Graph Generative Patterns

Yifei Sun, Qi Zhu, Yang Yang et al.

AAAI 2024paperarXiv:2312.13583
19
citations
#4123

Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation

Xu Zheng, Yuanhuiyi Lyu, jiazhou zhou et al.

ECCV 2024arXiv:2407.11344
19
citations
#4124

Towards Neuro-Symbolic Video Understanding

Minkyu Choi, Harsh Goel, Mohammad Omama et al.

ECCV 2024arXiv:2403.11021
19
citations
#4125

Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration

Tony C. W. MOK, Zi Li, Yunhao Bai et al.

CVPR 2024highlightarXiv:2402.18933
19
citations
#4126

Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents

Arrasy Rahman, Jiaxun Cui, Peter Stone

AAAI 2024paperarXiv:2308.09595
19
citations
#4127

Comparing the Robustness of Modern No-Reference Image- and Video-Quality Metrics to Adversarial Attacks

Anastasia Antsiferova, Khaled Abud, Aleksandr Gushchin et al.

AAAI 2024paperarXiv:2310.06958
19
citations
#4128

Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration

Gang Wu, Junjun Jiang, Kui Jiang et al.

AAAI 2024paperarXiv:2309.06023
19
citations
#4129

Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models

Jingyao Xu, Yuetong Lu, Yandong Li et al.

CVPR 2024arXiv:2404.15081
19
citations
#4130

Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification

Chao Yi, Lu Ren, De-Chuan Zhan et al.

CVPR 2024arXiv:2404.17753
19
citations
#4131

Diffusion Language-Shapelets for Semi-supervised Time-Series Classification

Zhen Liu, Wenbin Pei, Disen Lan et al.

AAAI 2024paper
19
citations
#4132

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Akshat Ramachandran, Souvik Kundu, Tushar Krishna

ECCV 2024arXiv:2407.05266
19
citations
#4133

NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation

Sicheng Li, Hao Li, Yiyi Liao et al.

CVPR 2024arXiv:2404.02185
19
citations
#4134

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

Gabriele Trivigno, Carlo Masone, Barbara Caputo et al.

CVPR 2024highlightarXiv:2404.10438
19
citations
#4135

Agile Multi-Source-Free Domain Adaptation

Xinyao Li, Jingjing Li, Fengling Li et al.

AAAI 2024paperarXiv:2403.05062
19
citations
#4136

$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

Zishun Yu, Yunzhe Tao, Liyu Chen et al.

ICLR 2024spotlightarXiv:2310.03173
19
citations
#4137

URHand: Universal Relightable Hands

Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo et al.

CVPR 2024arXiv:2401.05334
19
citations
#4138

Text Image Inpainting via Global Structure-Guided Diffusion Models

Shipeng Zhu, Pengfei Fang, Chenjie Zhu et al.

AAAI 2024paperarXiv:2401.14832
19
citations
#4139

Enhancing Cognitive Diagnosis Using Un-interacted Exercises: A Collaboration-Aware Mixed Sampling Approach

Haiping Ma, Changqian Wang, Hengshu Zhu et al.

AAAI 2024paperarXiv:2312.10110
19
citations
#4140

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling

Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman et al.

ICML 2024oralarXiv:2402.10211
19
citations
#4141

LatentEditor: Text Driven Local Editing of 3D Scenes

Umar Khalid, Hasan Iqbal, Muhammad Tayyab et al.

ECCV 2024arXiv:2312.09313
19
citations
#4142

FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation

Yanlu Cai, Weizhong Zhang, Yuan Wu et al.

AAAI 2024paper
19
citations
#4143

Caltech Aerial RGB-Thermal Dataset in the Wild

Connor Lee, Matthew Anderson, Nikhil Ranganathan et al.

ECCV 2024arXiv:2403.08997
19
citations
#4144

Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary Domains

Levi Lingsch, Mike Yan Michelis, Emmanuel de Bézenac et al.

ICML 2024arXiv:2305.19663
19
citations
#4145

ReGCL: Rethinking Message Passing in Graph Contrastive Learning

Cheng Ji, Zixuan Huang, Qingyun Sun et al.

AAAI 2024paper
19
citations
#4146

Scalable 3D Registration via Truncated Entry-wise Absolute Residuals

Tianyu Huang, Liangzu Peng, Rene Vidal et al.

CVPR 2024arXiv:2404.00915
19
citations
#4147

Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency

Yuchao Lin, Jacob Helwig, Shurui Gui et al.

ICML 2024spotlightarXiv:2406.07598
19
citations
#4148

Osmosis: RGBD Diffusion Prior for Underwater Image Restoration

Opher Bar Nathan, Deborah Steinberger-Levy, Tali Treibitz et al.

ECCV 2024arXiv:2403.14837
19
citations
#4149

Aspect-Based Sentiment Analysis with Explicit Sentiment Augmentations

Jihong Ouyang, Zhiyao Yang, Silong Liang et al.

AAAI 2024paperarXiv:2312.10961
19
citations
#4150

Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy

Shuhai Zhang, Yiliao Song, Jiahao Yang et al.

ICLR 2024arXiv:2402.16041
19
citations
#4151

Protein-ligand binding representation learning from fine-grained interactions

Shikun Feng, Minghao Li, Yinjun JIA et al.

ICLR 2024arXiv:2311.16160
19
citations
#4152

Class Incremental Learning via Likelihood Ratio Based Task Prediction

Haowei Lin, Yijia Shao, Weinan Qian et al.

ICLR 2024arXiv:2309.15048
19
citations
#4153

Text-Enhanced Data-free Approach for Federated Class-Incremental Learning

Minh-Tuan Tran, Trung Le, Xuan-May Le et al.

CVPR 2024arXiv:2403.14101
19
citations
#4154

InfMAE: A Foundation Model in The Infrared Modality

Fangcen liu, Chenqiang Gao, Yaming Zhang et al.

ECCV 2024arXiv:2402.00407
19
citations
#4155

LiDAR-based Person Re-identification

Wenxuan Guo, Zhiyu Pan, Yingping Liang et al.

CVPR 2024arXiv:2312.03033
19
citations
#4156

LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion

Pancheng Zhao, Peng Xu, Pengda Qin et al.

CVPR 2024arXiv:2404.00292
19
citations
#4157

CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion

Jiarui Sun, Girish Chowdhary

ECCV 2024arXiv:2305.12554
19
citations
#4158

A Stealthy Wrongdoer: Feature-Oriented Reconstruction Attack against Split Learning

Xiaoyang Xu, Mengda Yang, Wenzhe Yi et al.

CVPR 2024arXiv:2405.04115
19
citations
#4159

ParamISP: Learned Forward and Inverse ISPs using Camera Parameters

Woohyeok Kim, Geonu Kim, Junyong Lee et al.

CVPR 2024arXiv:2312.13313
19
citations
#4160

OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

Zhenyu Wang, Ya-Li Li, TAICHI LIU et al.

ECCV 2024arXiv:2403.19580
19
citations
#4161

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

Haoyu Zhao, Tianyi Lu, Jiaxi Gu et al.

ECCV 2024arXiv:2311.17338
19
citations
#4162

SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis

Teng Hu, Ran Yi, Baihong Qian et al.

CVPR 2024arXiv:2406.09794
19
citations
#4163

Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors

Chun-Yin Huang, Kartik Srinivas, Xin Zhang et al.

ICML 2024arXiv:2405.11525
19
citations
#4164

GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns

Maria Korosteleva, Timur Levent Kesdogan, Fabian Kemper et al.

ECCV 2024arXiv:2405.17609
19
citations
#4165

Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes

Zhi Cai, Yingjie Gao, Yaoyan Zheng et al.

ECCV 2024arXiv:2407.11464
19
citations
#4166

Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging

Max Zimmer, Christoph Spiegel, Sebastian Pokutta

ICLR 2024arXiv:2306.16788
19
citations
#4167

Conformal Prediction with Learned Features

Shayan Kiyani, George J. Pappas, Hamed Hassani

ICML 2024arXiv:2404.17487
19
citations
#4168

Distinguished In Uniform: Self-Attention Vs. Virtual Nodes

Eran Rosenbluth, Jan Tönshoff, Martin Ritzert et al.

ICLR 2024
19
citations
#4169

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning

Mengxin Zheng, Jiaqi Xue, Zihao Wang et al.

ECCV 2024arXiv:2303.09079
19
citations
#4170

Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation

Bolin Lai, Fiona Ryan, Wenqi Jia et al.

ECCV 2024arXiv:2305.03907
19
citations
#4171

How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?

Wenxuan Li, Alan Yuille, Zongwei Zhou

ICLR 2024arXiv:2501.11253
19
citations
#4172

eTraM: Event-based Traffic Monitoring Dataset

Aayush Atul Verma, Bharatesh Chakravarthi, Arpitsinh Vaghela et al.

CVPR 2024highlightarXiv:2403.19976
19
citations
#4173

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought

sili huang, Jifeng Hu, Hechang Chen et al.

ICML 2024arXiv:2405.20692
19
citations
#4174

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

Thomas Hummel, Shyamgopal Karthik, Mariana-Iuliana Georgescu et al.

ECCV 2024arXiv:2407.16658
19
citations
#4175

UNIC: Universal Classification Models via Multi-teacher Distillation

Yannis Kalantidis, Larlus Diane, Mert Bulent SARIYILDIZ et al.

ECCV 2024arXiv:2408.05088
19
citations
#4176

SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization

Mae Younes, Amine Ouasfi, Adnane Boukhayma

ECCV 2024arXiv:2407.14257
19
citations
#4177

Empowering Graph Invariance Learning with Deep Spurious Infomax

Tianjun Yao, Yongqiang Chen, Zhenhao Chen et al.

ICML 2024arXiv:2407.11083
19
citations
#4178

Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition

Mohamad Amin Mohamadi, Zhiyuan Li, Lei Wu et al.

ICML 2024arXiv:2407.12332
19
citations
#4179

Code-Style In-Context Learning for Knowledge-Based Question Answering

Zhijie Nie, Richong Zhang, Zhongyuan Wang et al.

AAAI 2024paperarXiv:2309.04695
19
citations
#4180

MESA: Matching Everything by Segmenting Anything

Yesheng Zhang, Xu Zhao

CVPR 2024arXiv:2401.16741
19
citations
#4181

Drug Discovery with Dynamic Goal-aware Fragments

Seul Lee, Seanie Lee, Kenji Kawaguchi et al.

ICML 2024arXiv:2310.00841
19
citations
#4182

NICP: Neural ICP for 3D Human Registration at Scale

Riccardo Marin, Enric Corona, Gerard Pons-Moll

ECCV 2024arXiv:2312.14024
19
citations
#4183

Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment

Aobo Li, Jinjian Wu, Yongxu Liu et al.

CVPR 2024arXiv:2405.04167
19
citations
#4184

Repeated Fair Allocation of Indivisible Items

Ayumi Igarashi, Martin Lackner, Oliviero Nardi et al.

AAAI 2024paperarXiv:2304.01644
19
citations
#4185

FairProof : Confidential and Certifiable Fairness for Neural Networks

Chhavi Yadav, Amrita Roy Chowdhury, Dan Boneh et al.

ICML 2024arXiv:2402.12572
19
citations
#4186

Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution

Xingyuan Li, Jinyuan Liu, ZHIXIN CHEN et al.

ECCV 2024
19
citations
#4187

Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

Drew Prinster, Samuel Stanton, Anqi Liu et al.

ICML 2024arXiv:2405.06627
19
citations
#4188

Efficient Subgraph GNNs by Learning Effective Selection Policies

Beatrice Bevilacqua, Moshe Eliasof, Eli Meirom et al.

ICLR 2024arXiv:2310.20082
19
citations
#4189

Rotation-Agnostic Image Representation Learning for Digital Pathology

Saghir Alfasly, Abubakr Shafique, Peyman Nejat et al.

CVPR 2024arXiv:2311.08359
19
citations
#4190

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Wu Lin, Felix Dangel, Runa Eschenhagen et al.

ICML 2024arXiv:2402.03496
19
citations
#4191

Unsupervised Keypoints from Pretrained Diffusion Models

Eric Hedlin, Gopal Sharma, Shweta Mahajan et al.

CVPR 2024highlightarXiv:2312.00065
19
citations
#4192

BCLNet: Bilateral Consensus Learning for Two-View Correspondence Pruning

Xiangyang Miao, Guobao Xiao, Shiping Wang et al.

AAAI 2024paperarXiv:2401.03459
19
citations
#4193

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion

Ming Tao, BINGKUN BAO, Hao Tang et al.

ECCV 2024arXiv:2404.05979
19
citations
#4194

OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning

Wei-Cheng Huang, Chun-Fu Chen, Hsiang Hsu

ICLR 2024arXiv:2402.04129
19
citations
#4195

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024arXiv:2407.10494
19
citations
#4196

Deep Incomplete Multi-View Learning Network with Insufficient Label Information

Zhangqi Jiang, Tingjin Luo, Xinyan Liang

AAAI 2024paper
19
citations
#4197

Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios

Shiyan Chen, Jiyuan Zhang, Zhaofei Yu et al.

CVPR 2024arXiv:2303.16783
19
citations
#4198

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Xiyao Wang, Ruijie Zheng, Yanchao Sun et al.

ICLR 2024arXiv:2310.07220
18
citations
#4199

DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation

Xiaoliang Ju, Zhaoyang Huang, Yijin Li et al.

CVPR 2024arXiv:2306.00519
18
citations
#4200

Generating Novel Leads for Drug Discovery Using LLMs with Logical Feedback

Shreyas Bhat Brahmavar, Ashwin Srinivasan, Tirtharaj Dash et al.

AAAI 2024paper
18
citations