Most Cited 2024 "in-distribution separability" Papers

12,324 papers found • Page 9 of 62

#1601

Self-Distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach

Ziyin Zhang, Ning Lu, Minghui Liao et al.

AAAI 2024paperarXiv:2308.08806
20
citations
#1602

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Yi-Lin Sung, Jaehong Yoon, Mohit Bansal

ICLR 2024posterarXiv:2310.02998
20
citations
#1603

ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement

Muhammad Atif Butt, Kai Wang, Javier Vazquez-Corral et al.

ECCV 2024posterarXiv:2407.07197
20
citations
#1604

CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction

Zhangchen Ye, Tao Jiang, Chenfeng Xu et al.

ECCV 2024posterarXiv:2409.13430
19
citations
#1605

SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions

XIAOYU LIU, Yuxiang WEI, Ming LIU et al.

ECCV 2024posterarXiv:2404.06451
19
citations
#1606

ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video

Xinhao Li, Yuhan Zhu, Limin Wang

ECCV 2024posterarXiv:2310.01324
19
citations
#1607

iHuman: Instant Animatable Digital Humans From Monocular Videos

Pramish Paudel, Anubhav Khanal, Danda Pani Paudel et al.

ECCV 2024posterarXiv:2407.11174
19
citations
#1608

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

Gabriele Trivigno, Carlo Masone, Barbara Caputo et al.

CVPR 2024highlightarXiv:2404.10438
19
citations
#1609

LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels

Tuo Feng, Wenguan Wang, Fan Ma et al.

CVPR 2024posterarXiv:2403.15173
19
citations
#1610

Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

Hao Fang, Peng Wu, Yawei Li et al.

ECCV 2024posterarXiv:2407.07427
19
citations
#1611

Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

Kejun Tang, Jiayu Zhai, Xiaoliang Wan et al.

ICLR 2024posterarXiv:2305.18702
19
citations
#1612

Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios

Shiyan Chen, Jiyuan Zhang, Zhaofei Yu et al.

CVPR 2024posterarXiv:2303.16783
19
citations
#1613

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Siyu Zou, Jiji Tang, Yiyi Zhou et al.

AAAI 2024paperarXiv:2401.07709
19
citations
#1614

Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou, Tao Zhang, Xiangtai Li et al.

ECCV 2024posterarXiv:2404.00086
19
citations
#1615

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

Yang Zhang, Tze Tzun Teoh, Wei Hern Lim et al.

ECCV 2024posterarXiv:2403.06381
19
citations
#1616

AMEGO: Active Memory from long EGOcentric videos

Gabriele Goletto, Tushar Nagarajan, Giuseppe Averta et al.

ECCV 2024posterarXiv:2409.10917
19
citations
#1617

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

Jiacheng Zhang, Jiaming Li, Xiangru Lin et al.

CVPR 2024posterarXiv:2403.17387
19
citations
#1618

Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer

Hyeongjin Nam, Daniel Jung, Gyeongsik Moon et al.

CVPR 2024posterarXiv:2404.04819
19
citations
#1619

Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation

Xinyao Li, Yuke Li, Zhekai Du et al.

CVPR 2024posterarXiv:2403.06946
19
citations
#1620

Distinguished In Uniform: Self-Attention Vs. Virtual Nodes

Eran Rosenbluth, Jan Tönshoff, Martin Ritzert et al.

ICLR 2024poster
19
citations
#1621

Improving Transferable Targeted Adversarial Attacks with Model Self-Enhancement

Han Wu, Guanyan Ou, Weibin Wu et al.

CVPR 2024poster
19
citations
#1622

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

Sisi Dai, Wenhao Li, Haowen Sun et al.

ECCV 2024posterarXiv:2403.15612
19
citations
#1623

Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents

Arrasy Rahman, Jiaxun Cui, Peter Stone

AAAI 2024paperarXiv:2308.09595
19
citations
#1624

HR-Pro: Point-Supervised Temporal Action Localization via Hierarchical Reliability Propagation

Huaxin Zhang, Xiang Wang, Xiaohao Xu et al.

AAAI 2024paperarXiv:2308.12608
19
citations
#1625

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Akshay Krishnan, Abhijit Kundu, Kevis Maninis et al.

ECCV 2024posterarXiv:2407.08711
19
citations
#1626

Scalable 3D Registration via Truncated Entry-wise Absolute Residuals

Tianyu Huang, Liangzu Peng, Rene Vidal et al.

CVPR 2024posterarXiv:2404.00915
19
citations
#1627

Diffusion Language-Shapelets for Semi-supervised Time-Series Classification

Zhen Liu, Wenbin Pei, Disen Lan et al.

AAAI 2024paper
19
citations
#1628

Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation

Xu Zheng, Yuanhuiyi Lyu, jiazhou zhou et al.

ECCV 2024posterarXiv:2407.11344
19
citations
#1629

WebVLN: Vision-and-Language Navigation on Websites

Qi Chen, Dileepa Pitawela, Chongyang Zhao et al.

AAAI 2024paperarXiv:2312.15820
19
citations
#1630

Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification

Chao Yi, Lu Ren, De-Chuan Zhan et al.

CVPR 2024posterarXiv:2404.17753
19
citations
#1631

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel

ICLR 2024spotlightarXiv:2311.11321
19
citations
#1632

NICP: Neural ICP for 3D Human Registration at Scale

Riccardo Marin, Enric Corona, Gerard Pons-Moll

ECCV 2024posterarXiv:2312.14024
19
citations
#1633

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Tianzhe Chu, Shengbang Tong, Tianjiao Ding et al.

ICLR 2024posterarXiv:2306.05272
19
citations
#1634

Customizing Language Model Responses with Contrastive In-Context Learning

Xiang Gao, Kamalika Das

AAAI 2024paperarXiv:2401.17390
19
citations
#1635

Towards Neuro-Symbolic Video Understanding

Minkyu Choi, Harsh Goel, Mohammad Omama et al.

ECCV 2024posterarXiv:2403.11021
19
citations
#1636

LiDAR-based Person Re-identification

Wenxuan Guo, Zhiyu Pan, Yingping Liang et al.

CVPR 2024posterarXiv:2312.03033
19
citations
#1637

RecDiffusion: Rectangling for Image Stitching with Diffusion Models

Tianhao Zhou, Li Haipeng, Ziyi Wang et al.

CVPR 2024posterarXiv:2403.19164
19
citations
#1638

HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

Zhongyu Xia, ZhiWei Lin, Xinhao Wang et al.

ECCV 2024posterarXiv:2404.02517
19
citations
#1639

Discriminability-Driven Channel Selection for Out-of-Distribution Detection

Yue Yuan, Rundong He, Yicong Dong et al.

CVPR 2024poster
19
citations
#1640

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

Fang Kaipeng, Jingkuan Song, Lianli Gao et al.

CVPR 2024posterarXiv:2312.12478
19
citations
#1641

Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding

YIWEN TANG, Renrui Zhang, Jiaming Liu et al.

ECCV 2024poster
19
citations
#1642

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

Yumeng Li, Margret Keuper, Dan Zhang et al.

ICLR 2024posterarXiv:2401.08815
19
citations
#1643

Dexterous Grasp Transformer

Guo-Hao Xu, Yi-Lin Wei, Dian Zheng et al.

CVPR 2024posterarXiv:2404.18135
19
citations
#1644

You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation

Mehdi Noroozi, Isma Hadji, Brais Martinez et al.

ECCV 2024posterarXiv:2401.17258
19
citations
#1645

Visible and Clear: Finding Tiny Objects in Difference Map

Bing Cao, Haiyu Yao, Pengfei Zhu et al.

ECCV 2024posterarXiv:2405.11276
19
citations
#1646

Instrumental Variable Estimation for Causal Inference in Longitudinal Data with Time-Dependent Latent Confounders

Debo Cheng, Ziqi Xu, Jiuyong Li et al.

AAAI 2024paperarXiv:2312.07175
19
citations
#1647

Efficient Subgraph GNNs by Learning Effective Selection Policies

Beatrice Bevilacqua, Moshe Eliasof, Eli Meirom et al.

ICLR 2024posterarXiv:2310.20082
19
citations
#1648

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos

Changan Chen, Puyuan Peng, Ami Baid et al.

ECCV 2024posterarXiv:2406.09272
19
citations
#1649

DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer

Wei-Ting Chen, Gurunandan Krishnan, Qiang Gao et al.

CVPR 2024posterarXiv:2406.09622
19
citations
#1650

GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns

Maria Korosteleva, Timur Levent Kesdogan, Fabian Kemper et al.

ECCV 2024posterarXiv:2405.17609
19
citations
#1651

Underwater Organism Color Fine-Tuning via Decomposition and Guidance

Xiaofeng Cong, Jie Gui, Junming Hou

AAAI 2024paper
19
citations
#1652

Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images

Jacopo Bonato, Marco Cotogni, Luigi Sabetta

ECCV 2024posterarXiv:2404.12922
19
citations
#1653

CA-Jaccard: Camera-aware Jaccard Distance for Person Re-identification

Yiyu Chen, Zheyi Fan, Zhaoru Chen et al.

CVPR 2024posterarXiv:2311.10605
19
citations
#1654

Unsupervised Keypoints from Pretrained Diffusion Models

Eric Hedlin, Gopal Sharma, Shweta Mahajan et al.

CVPR 2024highlightarXiv:2312.00065
19
citations
#1655

VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning

Kang Chen, Xiangqian Wu

CVPR 2024posterarXiv:2303.02635
19
citations
#1656

ConGeo: Robust Cross-view Geo-localization across Ground View Variations

Li Mi, Chang Xu, Javiera Castillo Navarro et al.

ECCV 2024posterarXiv:2403.13965
19
citations
#1657

TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

Dong Huo, Zixin Guo, Xinxin Zuo et al.

ECCV 2024posterarXiv:2408.01291
19
citations
#1658

LatentEditor: Text Driven Local Editing of 3D Scenes

Umar Khalid, Hasan Iqbal, Muhammad Tayyab et al.

ECCV 2024posterarXiv:2312.09313
19
citations
#1659

Theoretically Achieving Continuous Representation of Oriented Bounding Boxes

Zikai Xiao, Guo-Ye Yang, Xue Yang et al.

CVPR 2024posterarXiv:2402.18975
19
citations
#1660

SPIRE: Semantic Prompt-Driven Image Restoration

Chenyang Qi, Zhengzhong Tu, Keren Ye et al.

ECCV 2024posterarXiv:2312.11595
19
citations
#1661

NViST: In the Wild New View Synthesis from a Single Image with Transformers

Wonbong Jang, Lourdes Agapito

CVPR 2024posterarXiv:2312.08568
19
citations
#1662

Deep Orthogonal Hypersphere Compression for Anomaly Detection

Yunhe Zhang, Yan Sun, Jinyu Cai et al.

ICLR 2024spotlightarXiv:2302.06430
19
citations
#1663

LAN: Learning to Adapt Noise for Image Denoising

Changjin Kim, Tae Hyun Kim, Sungyong Baik

CVPR 2024posterarXiv:2412.10651
19
citations
#1664

Towards Real-world Event-guided Low-light Video Enhancement and Deblurring

Taewoo Kim, Jaeseok Jeong, Hoonhee Cho et al.

ECCV 2024posterarXiv:2408.14916
19
citations
#1665

Federated Q-Learning: Linear Regret Speedup with Low Communication Cost

Zhong Zheng, Fengyu Gao, Lingzhou Xue et al.

ICLR 2024posterarXiv:2312.15023
19
citations
#1666

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

Weitai Kang, Gaowen Liu, Shah Mubarak et al.

ECCV 2024posterarXiv:2407.03200
19
citations
#1667

As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors

Seungwoo Yoo, Kunho Kim, Vladimir G. Kim et al.

CVPR 2024posterarXiv:2311.16739
19
citations
#1668

You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain et al.

CVPR 2024posterarXiv:2403.07222
19
citations
#1669

LayoutFlow: Flow Matching for Layout Generation

Julian Jorge Andrade Guerreiro, Naoto Inoue, Kento Masui et al.

ECCV 2024posterarXiv:2403.18187
19
citations
#1670

To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets

Darshil Doshi, Aritra Das, Tianyu He et al.

ICLR 2024posterarXiv:2310.13061
19
citations
#1671

Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning

Jinglin Liang, Jin Zhong, Hanlin Gu et al.

ECCV 2024posterarXiv:2409.01128
19
citations
#1672

HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

Haozhe Qi, Chen Zhao, Mathieu Salzmann et al.

CVPR 2024posterarXiv:2402.17062
19
citations
#1673

SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization

Mae Younes, Amine Ouasfi, Adnane Boukhayma

ECCV 2024posterarXiv:2407.14257
19
citations
#1674

GlitchBench: Can Large Multimodal Models Detect Video Game Glitches?

Mohammad Reza Taesiri, Tianjun Feng, Cor-Paul Bezemer et al.

CVPR 2024posterarXiv:2312.05291
19
citations
#1675

Robust-Wide: Robust Watermarking against Instruction-driven Image Editing

Runyi Hu, Jie Zhang, Ting Xu et al.

ECCV 2024posterarXiv:2402.12688
19
citations
#1676

Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices

Huancheng Chen, Haris Vikalo

CVPR 2024posterarXiv:2311.18129
19
citations
#1677

ODIN: A Single Model for 2D and 3D Segmentation

Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios et al.

CVPR 2024highlightarXiv:2401.02416
19
citations
#1678

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Yi Zhang, Wang Zeng, Sheng Jin et al.

ECCV 2024posterarXiv:2407.10125
19
citations
#1679

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion

Ming Tao, BINGKUN BAO, Hao Tang et al.

ECCV 2024posterarXiv:2404.05979
19
citations
#1680

Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models

Sijia Chen, Baochun Li, Di Niu

ICLR 2024posterarXiv:2402.11140
19
citations
#1681

Continuous Memory Representation for Anomaly Detection

Joo Chan Lee, Taejune Kim, Eunbyung Park et al.

ECCV 2024posterarXiv:2402.18293
19
citations
#1682

LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion

Pancheng Zhao, Peng Xu, Pengda Qin et al.

CVPR 2024posterarXiv:2404.00292
19
citations
#1683

Weighted Envy-Freeness for Submodular Valuations

Luisa Montanari, Ulrike Schmidt-Kraepelin, Warut Suksompong et al.

AAAI 2024paperarXiv:2209.06437
19
citations
#1684

FedFixer: Mitigating Heterogeneous Label Noise in Federated Learning

Xinyuan Ji, Zhaowei Zhu, Wei Xi et al.

AAAI 2024paperarXiv:2403.16561
19
citations
#1685

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Yunhan Yang, Yukun Huang, Xiaoyang Wu et al.

CVPR 2024posterarXiv:2312.03611
19
citations
#1686

ParamISP: Learned Forward and Inverse ISPs using Camera Parameters

Woohyeok Kim, Geonu Kim, Junyong Lee et al.

CVPR 2024posterarXiv:2312.13313
19
citations
#1687

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

Wei Dong, Xing Zhang, Bihui Chen et al.

CVPR 2024posterarXiv:2403.19067
19
citations
#1688

Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Antonis Antoniades, Yiyi Yu, Joe Canzano et al.

ICLR 2024oralarXiv:2311.00136
19
citations
#1689

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

Ji-Jia Wu, Andy Chia-Hao Chang, Chieh-Yu Chuang et al.

CVPR 2024posterarXiv:2404.04231
19
citations
#1690

Constrained Bayesian Optimization under Partial Observations: Balanced Improvements and Provable Convergence

Shengbo Wang, Ke Li

AAAI 2024paperarXiv:2312.03212
19
citations
#1691

Robust Image Denoising through Adversarial Frequency Mixup

Donghun Ryou, Inju Ha, Hyewon Yoo et al.

CVPR 2024poster
19
citations
#1692

Good Teachers Explain: Explanation-Enhanced Knowledge Distillation

Amin Parchami, Moritz Böhle, Sukrut Rao et al.

ECCV 2024posterarXiv:2402.03119
18
citations
#1693

Beta-Tuned Timestep Diffusion Model

Tianyi Zheng, Peng-Tao Jiang, Ben Wan et al.

ECCV 2024poster
18
citations
#1694

Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

Duy Tho Le, Hengcan Shi, Jianfei Cai et al.

ECCV 2024posterarXiv:2404.04629
18
citations
#1695

Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring

Huicong Zhang, Haozhe Xie, Hongxun Yao

CVPR 2024posterarXiv:2406.07551
18
citations
#1696

PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning

Haiyang Guo, Fei Zhu, Wenzhuo Liu et al.

ECCV 2024posterarXiv:2401.02094
18
citations
#1697

Class Incremental Learning via Likelihood Ratio Based Task Prediction

Haowei Lin, Yijia Shao, Weinan Qian et al.

ICLR 2024posterarXiv:2309.15048
18
citations
#1698

DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos

Arjun Balasingam, Joseph Chandler, Chenning Li et al.

CVPR 2024posterarXiv:2312.09523
18
citations
#1699

PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos

Yufei Zhang, Jeffrey Kephart, Zijun Cui et al.

CVPR 2024posterarXiv:2404.04430
18
citations
#1700

OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

Zhenyu Wang, Ya-Li Li, TAICHI LIU et al.

ECCV 2024posterarXiv:2403.19580
18
citations
#1701

DVSAI: Diverse View-Shared Anchors Based Incomplete Multi-View Clustering

Shengju Yu, Siwei Wang, Pei Zhang et al.

AAAI 2024paper
18
citations
#1702

Rethinking the Evaluation Protocol of Domain Generalization

Han Yu, Xingxuan Zhang, Renzhe Xu et al.

CVPR 2024posterarXiv:2305.15253
18
citations
#1703

Exploring Diverse Representations for Open Set Recognition

Yu Wang, Junxian Mu, Pengfei Zhu et al.

AAAI 2024paperarXiv:2401.06521
18
citations
#1704

Unmixing Diffusion for Self-Supervised Hyperspectral Image Denoising

Haijin Zeng, Jiezhang Cao, Yongyong Chen et al.

CVPR 2024poster
18
citations
#1705

AssistGUI: Task-Oriented PC Graphical User Interface Automation

Difei Gao, Lei Ji, Zechen Bai et al.

CVPR 2024poster
18
citations
#1706

Benchmarking Algorithms for Federated Domain Generalization

Ruqi Bai, Saurabh Bagchi, David Inouye

ICLR 2024spotlightarXiv:2307.04942
18
citations
#1707

Diverse Person: Customize Your Own Dataset for Text-Based Person Search

Zifan Song, Guosheng Hu, Cairong Zhao

AAAI 2024paper
18
citations
#1708

Cloud-Device Collaborative Learning for Multimodal Large Language Models

Guanqun Wang, Jiaming Liu, Chenxuan Li et al.

CVPR 2024posterarXiv:2312.16279
18
citations
#1709

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping

Zijie Pan, Jiachen Lu, Xiatian Zhu et al.

ICLR 2024posterarXiv:2310.12474
18
citations
#1710

A Simple and Effective Point-based Network for Event Camera 6-DOFs Pose Relocalization

Hongwei Ren, Jiadong Zhu, Yue Zhou et al.

CVPR 2024posterarXiv:2403.19412
18
citations
#1711

A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

Shezheng Song, Shan Zhao, ChengYu Wang et al.

AAAI 2024paperarXiv:2312.11816
18
citations
#1712

Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Ruibin Li, Ruihuang Li, Song Guo et al.

ECCV 2024posterarXiv:2403.11105
18
citations
#1713

EAT: Towards Long-Tailed Out-of-Distribution Detection

Tong Wei, Bo-Lin Wang, Min-Ling Zhang

AAAI 2024paperarXiv:2312.08939
18
citations
#1714

SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis

Teng Hu, Ran Yi, Baihong Qian et al.

CVPR 2024posterarXiv:2406.09794
18
citations
#1715

Open-Set Domain Adaptation for Semantic Segmentation

Seun-An Choe, Ah-Hyung Shin, Keon Hee Park et al.

CVPR 2024posterarXiv:2405.19899
18
citations
#1716

HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation

Yongliang Lin, Yongzhi Su, Praveen Nathan et al.

CVPR 2024posterarXiv:2311.12588
18
citations
#1717

Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt

Bin-Bin Gao

ECCV 2024posterarXiv:2505.09264
18
citations
#1718

Composing Object Relations and Attributes for Image-Text Matching

Khoi Pham, Chuong Huynh, Ser-Nam Lim et al.

CVPR 2024posterarXiv:2406.11820
18
citations
#1719

RMem: Restricted Memory Banks Improve Video Object Segmentation

Junbao Zhou, Ziqi Pang, Yu-Xiong Wang

CVPR 2024posterarXiv:2406.08476
18
citations
#1720

Text Image Inpainting via Global Structure-Guided Diffusion Models

Shipeng Zhu, Pengfei Fang, Chenjie Zhu et al.

AAAI 2024paperarXiv:2401.14832
18
citations
#1721

OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning

Wei-Cheng Huang, Chun-Fu Chen, Hsiang Hsu

ICLR 2024posterarXiv:2402.04129
18
citations
#1722

Implicit Concept Removal of Diffusion Models

Zhili LIU, Kai Chen, Yifan Zhang et al.

ECCV 2024posterarXiv:2310.05873
18
citations
#1723

FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation

Yanlu Cai, Weizhong Zhang, Yuan Wu et al.

AAAI 2024paper
18
citations
#1724

Shadow Generation for Composite Image Using Diffusion Model

Qingyang Liu, Junqi You, Jian-Ting Wang et al.

CVPR 2024posterarXiv:2403.15234
18
citations
#1725

Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation

Zongrui Li, Minghui Hu, Qian Zheng et al.

ECCV 2024posterarXiv:2407.13584
18
citations
#1726

MoST: Motion Style Transformer Between Diverse Action Contents

Boeun Kim, Jungho Kim, Hyung Jin Chang et al.

CVPR 2024posterarXiv:2403.06225
18
citations
#1727

Correspondence-Free Non-Rigid Point Set Registration Using Unsupervised Clustering Analysis

Mingyang Zhao, Jiang Jingen, Lei Ma et al.

CVPR 2024highlightarXiv:2406.18817
18
citations
#1728

Fair-VPT: Fair Visual Prompt Tuning for Image Classification

Sungho Park, Hyeran Byun

CVPR 2024poster
18
citations
#1729

STDiff: Spatio-Temporal Diffusion for Continuous Stochastic Video Prediction

Xi Ye, Guillaume-Alexandre Bilodeau

AAAI 2024paperarXiv:2312.06486
18
citations
#1730

PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

Lirong Wu, Yufei Huang, Cheng Tan et al.

AAAI 2024paperarXiv:2402.08198
18
citations
#1731

Deep Quantum Error Correction

Yoni Choukroun, Lior Wolf

AAAI 2024paperarXiv:2301.11930
18
citations
#1732

UNIC: Universal Classification Models via Multi-teacher Distillation

Yannis Kalantidis, Larlus Diane, Mert Bulent SARIYILDIZ et al.

ECCV 2024posterarXiv:2408.05088
18
citations
#1733

Zero-Shot Aerial Object Detection with Visual Description Regularization

Chenyu Lin, Zhengqing Zang, Chenwei Tang et al.

AAAI 2024paperarXiv:2402.18233
18
citations
#1734

MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding

HaiTao Yu, Mofei Song

AAAI 2024paperarXiv:2402.10002
18
citations
#1735

Temporally and Distributionally Robust Optimization for Cold-Start Recommendation

Xinyu Lin, Wenjie Wang, Jujia Zhao et al.

AAAI 2024paperarXiv:2312.09901
18
citations
#1736

SemiReward: A General Reward Model for Semi-supervised Learning

Siyuan Li, Weiyang Jin, Zedong Wang et al.

ICLR 2024posterarXiv:2310.03013
18
citations
#1737

T4P: Test-Time Training of Trajectory Prediction via Masked Autoencoder and Actor-specific Token Memory

Daehee Park, Jaeseok Jeong, Sung-Hoon Yoon et al.

CVPR 2024posterarXiv:2403.10052
18
citations
#1738

Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark

Mengxi Ya, Yiming Li, Tao Dai et al.

ICLR 2024poster
18
citations
#1739

Relightable and Animatable Neural Avatars from Videos

Wenbin Lin, Chengwei Zheng, Jun-hai Yong et al.

AAAI 2024paperarXiv:2312.12877
18
citations
#1740

Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation

6428 Can Xu, Haosen Wang, Weigang Wang et al.

AAAI 2024paperarXiv:2401.02683
18
citations
#1741

Text-Enhanced Data-free Approach for Federated Class-Incremental Learning

Minh-Tuan Tran, Trung Le, Xuan-May Le et al.

CVPR 2024posterarXiv:2403.14101
18
citations
#1742

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

Haoyu Zhao, Tianyi Lu, Jiaxi Gu et al.

ECCV 2024posterarXiv:2311.17338
18
citations
#1743

Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes

Zhi Cai, Yingjie Gao, Yaoyan Zheng et al.

ECCV 2024posterarXiv:2407.11464
18
citations
#1744

Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Alex Gomez-Villa, Dipam Goswami, Kai Wang et al.

ECCV 2024posterarXiv:2407.08536
18
citations
#1745

Traffic Flow Optimisation for Lifelong Multi-Agent Path Finding

Zhe Chen, Daniel Harabor, Jiaoyang Li et al.

AAAI 2024paperarXiv:2308.11234
18
citations
#1746

Repeated Fair Allocation of Indivisible Items

Ayumi Igarashi, Martin Lackner, Oliviero Nardi et al.

AAAI 2024paperarXiv:2304.01644
18
citations
#1747

Code-Style In-Context Learning for Knowledge-Based Question Answering

Zhijie Nie, Richong Zhang, Zhongyuan Wang et al.

AAAI 2024paperarXiv:2309.04695
18
citations
#1748

Unprocessing Seven Years of Algorithmic Fairness

André F. Cruz, Moritz Hardt

ICLR 2024posterarXiv:2306.07261
18
citations
#1749

InsMapper: Exploring Inner-instance Information for Vectorized HD Mapping

Zhenhua Xu, Kwan-Yee K. Wong, Hengshuang ZHAO

ECCV 2024posterarXiv:2308.08543
18
citations
#1750

BCLNet: Bilateral Consensus Learning for Two-View Correspondence Pruning

Xiangyang Miao, Guobao Xiao, Shiping Wang et al.

AAAI 2024paperarXiv:2401.03459
18
citations
#1751

Lifting by Image – Leveraging Image Cues for Accurate 3D Human Pose Estimation

Feng Zhou, Jianqin Yin, Peiyang Li

AAAI 2024paperarXiv:2312.15636
18
citations
#1752

Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction

Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann et al.

ECCV 2024posterarXiv:2403.07263
18
citations
#1753

MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation

Min Zhang, Haoxuan Li, Fei Wu et al.

ICLR 2024posterarXiv:2404.19644
18
citations
#1754

TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding

Zhihao Zhang, Shengcao Cao, Yu-Xiong Wang

CVPR 2024posterarXiv:2402.18490
18
citations
#1755

FreePoint: Unsupervised Point Cloud Instance Segmentation

Zhikai Zhang, Jian Ding, Li Jiang et al.

CVPR 2024posterarXiv:2305.06973
18
citations
#1756

Project-Fair and Truthful Mechanisms for Budget Aggregation

Rupert Freeman, Ulrike Schmidt-Kraepelin

AAAI 2024paperarXiv:2309.02613
18
citations
#1757

Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution

Xingyuan Li, Jinyuan Liu, ZHIXIN CHEN et al.

ECCV 2024poster
18
citations
#1758

Spectral-Based Graph Neutral Networks for Complementary Item Recommendation

Haitong Luo, Xuying Meng, Suhang Wang et al.

AAAI 2024paper
18
citations
#1759

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

Chao Xu, Yang Liu, Jiazheng Xing et al.

CVPR 2024posterarXiv:2403.01901
18
citations
#1760

Locality Sensitive Sparse Encoding for Learning World Models Online

Zichen Liu, Chao Du, Wee Sun Lee et al.

ICLR 2024posterarXiv:2401.13034
18
citations
#1761

Improving Virtual Try-On with Garment-focused Diffusion Models

Siqi Wan, Yehao Li, Jingwen Chen et al.

ECCV 2024posterarXiv:2409.08258
18
citations
#1762

Dense Projection for Anomaly Detection

Dazhi Fu, Zhao Zhang, Jicong Fan

AAAI 2024paper
18
citations
#1763

A New Mechanism for Eliminating Implicit Conflict in Graph Contrastive Learning

Dongxiao He, Jitao Zhao, Cuiying Huo et al.

AAAI 2024paper
18
citations
#1764

ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining

Dezhi Peng, Chongyu Liu, Yuliang Liu et al.

AAAI 2024paperarXiv:2306.12106
18
citations
#1765

Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

Zikai Huang, Xuemiao Xu, Cheng Xu et al.

ECCV 2024posterarXiv:2407.07554
18
citations
#1766

FAR: Flexible Accurate and Robust 6DoF Relative Camera Pose Estimation

Chris Rockwell, Nilesh Kulkarni, Linyi Jin et al.

CVPR 2024highlightarXiv:2403.03221
18
citations
#1767

CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning

Ziyang Gong, FuHao Li, Yupeng Deng et al.

ECCV 2024posterarXiv:2403.17369
18
citations
#1768

Generating Novel Leads for Drug Discovery Using LLMs with Logical Feedback

Shreyas Bhat Brahmavar, Ashwin Srinivasan, Tirtharaj Dash et al.

AAAI 2024paper
18
citations
#1769

PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery

Fernando Julio Cendra, Bingchen Zhao, Kai Han

ECCV 2024posterarXiv:2407.19001
18
citations
#1770

CC-SAM: Enhancing SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation

Shreyank Narayana Gowda, David A Clifton

ECCV 2024poster
18
citations
#1771

FedMef: Towards Memory-efficient Federated Dynamic Pruning

Hong Huang, Weiming Zhuang, Chen Chen et al.

CVPR 2024posterarXiv:2403.14737
18
citations
#1772

Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

Haoxuanye Ji, Pengpeng Liang, Erkang Cheng

CVPR 2024posterarXiv:2403.06093
17
citations
#1773

SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds

Yanbo Wang, Wentao Zhao, Cao Chuan et al.

ECCV 2024posterarXiv:2407.11569
17
citations
#1774

Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset

Yiming Li, Zhiheng Li, Nuo Chen et al.

CVPR 2024posterarXiv:2406.09383
17
citations
#1775

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs

Camillo Quattrocchi, Antonino Furnari, Daniele Di Mauro et al.

ECCV 2024posterarXiv:2312.02638
17
citations
#1776

SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration

Xu Cao, Takafumi Taketomi

CVPR 2024posterarXiv:2312.04803
17
citations
#1777

SfmCAD: Unsupervised CAD Reconstruction by Learning Sketch-based Feature Modeling Operations

Pu Li, Jianwei Guo, HUIBIN LI et al.

CVPR 2024poster
17
citations
#1778

eTraM: Event-based Traffic Monitoring Dataset

Aayush Atul Verma, Bharatesh Chakravarthi, Arpitsinh Vaghela et al.

CVPR 2024highlightarXiv:2403.19976
17
citations
#1779

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Xiaoyi Bao, Siyang Sun, Shuailei Ma et al.

ECCV 2024posterarXiv:2404.05673
17
citations
#1780

Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts

Andong Tan, Fengtao Zhou, Hao Chen

ECCV 2024posterarXiv:2408.02265
17
citations
#1781

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Xiyao Wang, Ruijie Zheng, Yanchao Sun et al.

ICLR 2024posterarXiv:2310.07220
17
citations
#1782

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Vidit Goel, Elia Peruzzo, Yifan Jiang et al.

CVPR 2024poster
17
citations
#1783

PetFace: A Large-Scale Dataset and Benchmark for Animal Identification

Risa Shinoda, Kaede Shiohara

ECCV 2024posterarXiv:2407.13555
17
citations
#1784

Data Valuation and Detections in Federated Learning

Wenqian Li, Shuran Fu, Fengrui Zhang et al.

CVPR 2024posterarXiv:2311.05304
17
citations
#1785

Neural Visibility Field for Uncertainty-Driven Active Mapping

Shangjie Xue, Jesse Dill, Pranay Mathur et al.

CVPR 2024posterarXiv:2406.06948
17
citations
#1786

Dataset Enhancement with Instance-Level Augmentations

Orest Kupyn, Christian Rupprecht

ECCV 2024posterarXiv:2406.08249
17
citations
#1787

Image Inpainting via Iteratively Decoupled Probabilistic Modeling

Wenbo Li, Xin Yu, Kun Zhou et al.

ICLR 2024spotlightarXiv:2212.02963
17
citations
#1788

Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

Brian Gordon, Yonatan Bitton, Yonatan Shafir et al.

ECCV 2024posterarXiv:2312.03766
17
citations
#1789

Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality Assessment

Yongxu Liu, Yinghui Quan, Guoyao Xiao et al.

AAAI 2024paperarXiv:2401.02614
17
citations
#1790

Differentiable Information Bottleneck for Deterministic Multi-view Clustering

Xiaoqiang Yan, Zhixiang Jin, Fengshou Han et al.

CVPR 2024posterarXiv:2403.15681
17
citations
#1791

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

Yichi Zhang, Yinpeng Dong, Siyuan Zhang et al.

CVPR 2024highlightarXiv:2404.11207
17
citations
#1792

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World

Rujie Wu, Xiaojian Ma, Zhenliang Zhang et al.

ICLR 2024posterarXiv:2310.10207
17
citations
#1793

Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal

Yeying Jin, Xin Li, Jiadong Wang et al.

ECCV 2024posterarXiv:2407.16957
17
citations
#1794

RGMComm: Return Gap Minimization via Discrete Communications in Multi-Agent Reinforcement Learning

Jingdi Chen, Tian Lan, Carlee Joe-Wong

AAAI 2024paperarXiv:2308.03358
17
citations
#1795

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression

Runtian Zhai, Bingbin Liu, Andrej Risteski et al.

ICLR 2024spotlightarXiv:2306.00788
17
citations
#1796

GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction

Yuxuan Mu, Xinxin Zuo, Chuan Guo et al.

ECCV 2024posterarXiv:2407.04237
17
citations
#1797

Understanding Video Transformers via Universal Concept Discovery

Matthew Kowal, Achal Dave, Rares Andrei Ambrus et al.

CVPR 2024highlightarXiv:2401.10831
17
citations
#1798

Adaptive VIO: Deep Visual-Inertial Odometry with Online Continual Learning

Youqi Pan, Wugen Zhou, Yingdian Cao et al.

CVPR 2024posterarXiv:2405.16754
17
citations
#1799

Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding

Tatsunori Taniai, Ryo Igarashi, Yuta Suzuki et al.

ICLR 2024posterarXiv:2403.11686
17
citations
#1800

DeiT-LT: Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

Harsh Rangwani, Pradipto Mondal, Mayank Mishra et al.

CVPR 2024posterarXiv:2404.02900
17
citations