Most Cited 2024 &quot;gpu kernel optimization&quot; Papers

AAAI 2024paperarXiv:2312.10921

#1602

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

Dongze Li, Kang Zhao, Wei Wang et al.

AAAI 2024paperarXiv:2312.15184

#1603

ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-Order Optimization

Shuoran Jiang, Qingcai Chen, Yang Xiang et al.

ICLR 2024posterarXiv:2311.06792

#1604

IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models

Zhaoyuan Yang, Zhengyang Yu, Zhiwei Xu et al.

ECCV 2024posterarXiv:2505.09263

#1605

Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation

Guan Gui, Bin-Bin Gao, Jun Liu et al.

CVPR 2024posterarXiv:2311.17048

#1606

Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions

Zeyu Han, Fangrui Zhu, Qianru Lao et al.

AAAI 2024paperarXiv:2312.11850

#1607

GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction

Xinshun Wang, Qiongjie Cui, Chen Chen et al.

CVPR 2024posterarXiv:2404.14542

#1608

UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement

yaofeng xie, Lingwei Kong, Kai Chen et al.

ICLR 2024posterarXiv:2310.10732

#1609

MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design

Xiang Fu, Tian Xie, Andrew Rosen et al.

ECCV 2024posterarXiv:2408.13320

#1610

Online Zero-Shot Classification with CLIP

Qi Qian, JUHUA HU

ECCV 2024posterarXiv:2403.09079

#1611

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

Tianyuan Yuan, Mao Yucheng, Jiawei Yang et al.

#1612

Unmixing Diffusion for Self-Supervised Hyperspectral Image Denoising

Haijin Zeng, Jiezhang Cao, Yongyong Chen et al.

ECCV 2024posterarXiv:2407.15350

#1613

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding

Quan Kong, Yuki Kawana, Rajat Saini et al.

ECCV 2024posterarXiv:2405.11276

#1614

Visible and Clear: Finding Tiny Objects in Difference Map

Bing Cao, Haiyu Yao, Pengfei Zhu et al.

ECCV 2024posterarXiv:2310.08820

#1615

Learning to Adapt SAM for Segmenting Cross-domain Point Clouds

Xidong Peng, Runnan Chen, Feng Qiao et al.

CVPR 2024highlightarXiv:2410.18355

#1616

Real-time 3D-aware Portrait Video Relighting

Ziqi Cai, Kaiwen Jiang, Shu-Yu Chen et al.

ECCV 2024posterarXiv:2407.12345

#1617

VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions

Seokha Moon, Hyun Woo, Hongbeen Park et al.

ICLR 2024posterarXiv:2302.00704

#1618

Pathologies of Predictive Diversity in Deep Ensembles

Geoff Pleiss, Taiga Abe, E. Kelly Buchanan et al.

#1619

Boosting Neural Cognitive Diagnosis with Student’s Affective State Modeling

Shanshan Wang, Zhen Zeng, Xun Yang et al.

CVPR 2024highlightarXiv:2311.17082

#1620

DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling

Linqi Zhou, Andy Shih, Chenlin Meng et al.

CVPR 2024posterarXiv:2403.00592

#1621

Rethinking Few-shot 3D Point Cloud Semantic Segmentation

Zhaochong An, Guolei Sun, Yun Liu et al.

CVPR 2024posterarXiv:2310.18285

#1622

Unlocking the Potential of Prompt-Tuning in Bridging Generalized and Personalized Federated Learning

wenlong deng, Christos Thrampoulidis, Xiaoxiao Li

ICLR 2024posterarXiv:2306.11251

#1623

Lipschitz Singularities in Diffusion Models

Zhantao Yang, Ruili Feng, Han Zhang et al.

ECCV 2024posterarXiv:2407.10749

#1624

SEED: A Simple and Effective 3D DETR in Point Clouds

Zhe Liu, Jinghua Hou, Xiaoqing Ye et al.

#1625

Surface Reconstruction for 3D Gaussian Splatting via Local Structural Hints

Qianyi Wu, Jianmin Zheng, Jianfei Cai

AAAI 2024paperarXiv:2402.13188

#1626

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

Chao Xue, Di Liang, Pengfei Wang et al.

#1627

Conditional Information Bottleneck Approach for Time Series Imputation

MinGyu Choi, Changhee Lee

ICLR 2024oral

CVPR 2024posterarXiv:2403.16646

#1628

Clustering Propagation for Universal Medical Image Segmentation

Yuhang Ding, Liulei Li, Wenguan Wang et al.

CVPR 2024highlightarXiv:2403.03122

#1629

NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors

Yannan He, Garvita Tiwari, Tolga Birdal et al.

CVPR 2024posterarXiv:2301.13096

#1630

Language-Driven Anchors for Zero-Shot Adversarial Robustness

Xiao Li, Wei Zhang, Yining Liu et al.

CVPR 2024posterarXiv:2404.09011

#1631

PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization

Zining Chen, Weiqiu Wang, Zhicheng Zhao et al.

CVPR 2024posterarXiv:2403.09439

#1632

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Songchun Zhang, Yibo Zhang, Quan Zheng et al.

ECCV 2024posterarXiv:2404.19149

#1633

SAGS: Structure-Aware 3D Gaussian Splatting

Evangelos Ververas, Rolandos Alexandros Potamias, Song Jifei et al.

ECCV 2024posterarXiv:2407.13987

#1634

RealViformer: Investigating Attention for Real-World Video Super-Resolution

Yuehan Zhang, Angela Yao

CVPR 2024posterarXiv:2404.11732

#1635

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Mir Rayat Imtiaz Hossain, Mennatullah Siam, Leonid Sigal et al.

CVPR 2024posterarXiv:2404.16306

#1636

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Haomiao Ni, Bernhard Egger, Suhas Lohit et al.

ECCV 2024posterarXiv:2409.04004

#1637

One-Shot Diffusion Mimicker for Handwritten Text Generation

Gang Dai, Yifan Zhang, Quhui Ke et al.

ECCV 2024posterarXiv:2404.11615

#1638

Factorized Diffusion: Perceptual Illusions by Noise Decomposition

Daniel Geng, Inbum Park, Andrew Owens

CVPR 2024posterarXiv:2312.13980

#1639

Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning

Desai Xie, Jiahao Li, Hao Tan et al.

CVPR 2024posterarXiv:2312.01407

#1640

VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams

Liao Wang, Kaixin Yao, Chengcheng Guo et al.

ECCV 2024posterarXiv:2401.04730

#1641

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars

Ronglai Zuo, Fangyun Wei, Zenggui Chen et al.

CVPR 2024posterarXiv:2403.03561

#1642

HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations

Peng Dai, Yang Zhang, Tao Liu et al.

ECCV 2024posterarXiv:2407.04616

#1643

Isomorphic Pruning for Vision Models

Gongfan Fang, Xinyin Ma, Michael Bi Mi et al.

ICLR 2024posterarXiv:2310.07449

#1644

PORF: POSE RESIDUAL FIELD FOR ACCURATE NEURAL SURFACE RECONSTRUCTION

Jia-Wang Bian, Wenjing Bian, Victor Prisacariu et al.

#1645

Learning to Predict Activity Progress by Self-Supervised Video Alignment

Gerard Donahue, Ehsan Elhamifar

AAAI 2024paperarXiv:2401.06521

#1646

Exploring Diverse Representations for Open Set Recognition

Yu Wang, Junxian Mu, Pengfei Zhu et al.

CVPR 2024highlightarXiv:2402.17483

#1647

AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis

Tao Tang, Guangrun Wang, Yixing Lao et al.

AAAI 2024paperarXiv:2309.03797

#1648

Conformal Autoregressive Generation: Beam Search with Coverage Guarantees

Nicolas Deutschmann, Marvin Alberts, María Rodríguez Martínez

CVPR 2024posterarXiv:2404.02405

#1649

TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression

Ho-Joong Kim, Jung-Ho Hong, Heejo Kong et al.

ICLR 2024posterarXiv:2305.19044

#1650

Exploring the Promise and Limits of Real-Time Recurrent Learning

Kazuki Irie, Anand Gopalakrishnan, Jürgen Schmidhuber

ECCV 2024posterarXiv:2407.07586

#1651

Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights

Yan Hao, Florent Forest, Olga Fink

CVPR 2024posterarXiv:2312.02152

#1652

Steerers: A Framework for Rotation Equivariant Keypoint Descriptors

Georg Bökman, Johan Edstedt, Michael Felsberg et al.

CVPR 2024posterarXiv:2404.00168

#1653

Multi-Level Neural Scene Graphs for Dynamic Urban Environments

Tobias Fischer, Lorenzo Porzi, Samuel Rota Bulò et al.

ECCV 2024posterarXiv:2407.07197

#1654

ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement

Muhammad Atif Butt, Kai Wang, Javier Vazquez-Corral et al.

AAAI 2024paperarXiv:2305.15090

#1655

STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models

Mingyu Derek Ma, Xiaoxuan Wang, Po-Nien Kung et al.

AAAI 2024paperarXiv:2401.17390

#1656

Customizing Language Model Responses with Contrastive In-Context Learning

Xiang Gao, Kamalika Das

CVPR 2024posterarXiv:2311.17389

#1657

360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries

Huajian Huang, Changkun Liu, Yipeng Zhu et al.

CVPR 2024posterarXiv:2403.17537

#1658

NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation

Jiahao Chen, Yipeng Qin, Lingjie Liu et al.

CVPR 2024posterarXiv:2307.04570

#1659

A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified Benchmark

Jakub Paplham, Vojtech Franc

ICLR 2024posterarXiv:2304.01910

#1660

On the Variance of Neural Network Training with respect to Test Sets and Distributions

Keller Jordan

ECCV 2024posterarXiv:2403.17541

#1661

WordRobe: Text-Guided Generation of Textured 3D Garments

Astitva Srivastava, Pranav Manu, Amit Raj et al.

CVPR 2024posterarXiv:2403.02626

#1662

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

Imad Eddine Toubal, Aditya Avinash, Neil Alldrin et al.

#1663

Underwater Organism Color Fine-Tuning via Decomposition and Guidance

Xiaofeng Cong, Jie Gui, Junming Hou

AAAI 2024paperarXiv:2312.07175

#1664

Instrumental Variable Estimation for Causal Inference in Longitudinal Data with Time-Dependent Latent Confounders

Debo Cheng, Ziqi Xu, Jiuyong Li et al.

AAAI 2024paperarXiv:2312.08865

#1665

Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning

Zhiyue Liu, Jinyuan Liu, Fanrong Ma

ECCV 2024posterarXiv:2404.00086

#1666

Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou, Tao Zhang, Xiangtai Li et al.

CVPR 2024posterarXiv:2401.06129

#1667

Distilling Vision-Language Models on Millions of Videos

Yue Zhao, Long Zhao, Xingyi Zhou et al.

ECCV 2024posterarXiv:2407.03056

#1668

Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Marco Mistretta, Alberto Baldrati, Marco Bertini et al.

CVPR 2024posterarXiv:2403.15760

#1669

An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

Jianqing Zhang, Yang Liu, Yang Hua et al.

ECCV 2024posterarXiv:2408.16219

#1670

Training-free Video Temporal Grounding using Large-scale Pre-trained Models

Minghang Zheng, Xinhao Cai, Qingchao Chen et al.

#1671

Improved Graph Contrastive Learning for Short Text Classification

Yonghao Liu, Lan Huang, Fausto Giunchiglia et al.

AAAI 2024paperarXiv:2305.16830

#1672

Leaving the Nest: Going beyond Local Loss Functions for Predict-Then-Optimize

Sanket Shah, Bryan Wilder, Andrew Perrault et al.

CVPR 2024posterarXiv:2402.17563

#1673

Structure-Guided Adversarial Training of Diffusion Models

Ling Yang, Haotian Qian, Zhilong Zhang et al.

ECCV 2024posterarXiv:2406.10708

#1674

MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception

Mohammad Mahbubur Rahman, Ryoma Yataka, Sorachi Kato et al.

CVPR 2024posterarXiv:2405.00256

#1675

ASAM: Boosting Segment Anything Model with Adversarial Tuning

Bo Li, Haoke Xiao, Lv Tang

ECCV 2024posterarXiv:2409.10917

#1676

AMEGO: Active Memory from long EGOcentric videos

Gabriele Goletto, Tushar Nagarajan, Giuseppe Averta et al.

ECCV 2024posterarXiv:2409.19403

#1677

Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration

Chu Jie Qin, Ruiqi Wu, Zikun Liu et al.

CVPR 2024posterarXiv:2312.06713

#1678

TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video

Minye Wu, Zehao Wang, Georgios Kouros et al.

#1679

Embarrassingly Simple Dataset Distillation

Yunzhen Feng, Shanmukha Ramakrishna Vedantam, Julia Kempe

ICLR 2024poster

ECCV 2024posterarXiv:2407.15087

#1680

Navigation Instruction Generation with BEV Perception and Large Language Models

Sheng Fan, Rui Liu, Wenguan Wang et al.

CVPR 2024posterarXiv:2403.20236

#1681

Long-Tailed Anomaly Detection with Learnable Class Names

Chih-Hui Ho, Kuan-Chuan Peng, Nuno Vasconcelos

CVPR 2024posterarXiv:2312.00600

#1682

Improving Plasticity in Online Continual Learning via Collaborative Learning

Maorong Wang, Nicolas Michel, Ling Xiao et al.

ECCV 2024posterarXiv:2311.11227

#1683

FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients

Shangchao Su, Bin Li, Xiangyang Xue

ICLR 2024posterarXiv:2404.09403

#1684

Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning

Xiongye Xiao, Gengshuo Liu, Gaurav Gupta et al.

ECCV 2024posterarXiv:2405.11921

#1685

MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections

Jiayue Liu, Tang Xiao, Freeman Cheng et al.

#1686

Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein Representation Learning

Youhan Lee, Hasun Yu, Jaemyung Lee et al.

ICLR 2024poster

ICLR 2024posterarXiv:2310.02998

#1687

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Yi-Lin Sung, Jaehong Yoon, Mohit Bansal

AAAI 2024paperarXiv:2401.06176

#1688

GOODAT: Towards Test-Time Graph Out-of-Distribution Detection

Luzhi Wang, Di Jin, He Zhang et al.

ECCV 2024posterarXiv:2408.14371

#1689

SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery

Sarah Rastegar, Mohammadreza Salehi, Yuki M Asano et al.

AAAI 2024paperarXiv:2308.08806

#1690

Self-Distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach

Ziyin Zhang, Ning Lu, Minghui Liao et al.

CVPR 2024posterarXiv:2402.19082

#1691

VideoMAC: Video Masked Autoencoders Meet ConvNets

Gensheng Pei, Tao Chen, Xiruo Jiang et al.

CVPR 2024posterarXiv:2302.06637

#1692

PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees

Chulin Xie, De-An Huang, Wenda Chu et al.

AAAI 2024paperarXiv:2401.14729

#1693

Sketch and Refine: Towards Fast and Accurate Lane Detection

Chao Chen, Jie Liu, Chang Zhou et al.

#1694

Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering

Antoine Guedon, Vincent Lepetit

ECCV 2024posterarXiv:2312.05915

#1695

Diffusion for Natural Image Matting

Yihan Hu, Yiheng Lin, Wei Wang et al.

#1696

One-Class Face Anti-spoofing via Spoof Cue Map-Guided Feature Learning

Pei-Kai Huang, Cheng-Hsuan Chiang, Tzu-Hsien Chen et al.

CVPR 2024posterarXiv:2404.00234

#1697

Grid Diffusion Models for Text-to-Video Generation

Taegyeong Lee, Soyeong Kwon, Taehwan Kim

ECCV 2024posterarXiv:2405.18483

#1698

Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

Shan Mengyi, Lu Dong, Yutao Han et al.

CVPR 2024posterarXiv:2404.05657

#1699

MLP Can Be A Good Transformer Learner

Sihao Lin, Pumeng Lyu, Dongrui Liu et al.

AAAI 2024paperarXiv:2312.15911

#1700

Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection

Songmin Dai, Yifan Wu, Xiaoqiang Li et al.

ECCV 2024posterarXiv:2311.17891

#1701

A Graph-Based Approach for Category-Agnostic Pose Estimation

Or Hirschorn, Shai Avidan

ECCV 2024posterarXiv:2311.12066

#1702

EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models

Ruoxi Chen, Haibo Jin, Yixin Liu et al.

#1703

FlowTrack: Revisiting Optical Flow for Long-Range Dense Tracking

Seokju Cho, Gabriel Huang, Seungryong Kim et al.

ICLR 2024posterarXiv:2311.01885

#1704

Domain Randomization via Entropy Maximization

Gabriele Tiboni, Pascal Klink, Jan Peters et al.

CVPR 2024posterarXiv:2404.04231

#1705

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

Ji-Jia Wu, Andy Chia-Hao Chang, Chieh-Yu Chuang et al.

ICLR 2024posterarXiv:2307.13883

#1706

ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis

Kensen Shi, Joey Hong, Yinlin Deng et al.

ICLR 2024posterarXiv:2309.06651

#1707

ConR: Contrastive Regularizer for Deep Imbalanced Regression

Mahsa Keramati, Lili Meng, R. Evans

AAAI 2024paperarXiv:2312.14066

#1708

Upper Bounding Barlow Twins: A Novel Filter for Multi-Relational Clustering

Xiaowei Qian, Bingheng Li, Zhao Kang

AAAI 2024paperarXiv:2402.18233

#1709

Zero-Shot Aerial Object Detection with Visual Description Regularization

Chenyu Lin, Zhengqing Zang, Chenwei Tang et al.

AAAI 2024paperarXiv:2312.12703

#1710

Federated Learning with Extremely Noisy Clients via Negative Distillation

Yang Lu, Lin Chen, Yonggang Zhang et al.

AAAI 2024paperarXiv:2308.11971

#1711

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

Junyi Chen, Longteng Guo, Jia Sun et al.

AAAI 2024paperarXiv:2401.07709

#1712

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Siyu Zou, Jiji Tang, Yiyi Zhou et al.

CVPR 2024posterarXiv:2403.07222

#1713

You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain et al.

CVPR 2024posterarXiv:2404.02755

#1714

DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement

Hao Wu, Huabin Liu, Yu Qiao et al.

#1715

Improved Self-Training for Test-Time Adaptation

Jing Ma

#1716

Dual Prior Unfolding for Snapshot Compressive Imaging

Jiancheng Zhang, Haijin Zeng, Jiezhang Cao et al.

ECCV 2024posterarXiv:2407.10625

#1717

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Zijian He, Peixin Chen, Guangrun Wang et al.

#1718

Loose Inertial Poser: Motion Capture with IMU-attached Loose-Wear Jacket

Chengxu Zuo, Yiming Wang, Lishuang Zhan et al.

ICML 2024posterarXiv:2406.00670

#1719

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Yunheng Li, Zhong-Yu Li, Quan-Sheng Zeng et al.

CVPR 2024posterarXiv:2404.04819

#1720

Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer

Hyeongjin Nam, Daniel Jung, Gyeongsik Moon et al.

AAAI 2024paperarXiv:2308.09595

#1721

Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents

Arrasy Rahman, Jiaxun Cui, Peter Stone

ECCV 2024posterarXiv:2404.02517

#1722

HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

Zhongyu Xia, ZhiWei Lin, Xinhao Wang et al.

ICLR 2024posterarXiv:2306.05272

#1723

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Tianzhe Chu, Shengbang Tong, Tianjiao Ding et al.

AAAI 2024paperarXiv:2308.12608

#1724

HR-Pro: Point-Supervised Temporal Action Localization via Hierarchical Reliability Propagation

Huaxin Zhang, Xiang Wang, Xiaohao Xu et al.

#1725

RPSC: Robust Pseudo-Labeling for Semantic Clustering

Sihang Liu, Wenming Cao, Ruigang Fu et al.

ECCV 2024posterarXiv:2404.06451

#1726

SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions

XIAOYU LIU, Yuxiang WEI, Ming LIU et al.

#1727

Diffusion Language-Shapelets for Semi-supervised Time-Series Classification

Zhen Liu, Wenbin Pei, Disen Lan et al.

CVPR 2024posterarXiv:2403.17387

#1728

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

Jiacheng Zhang, Jiaming Li, Xiangru Lin et al.

CVPR 2024posterarXiv:2303.16783

#1729

Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios

Shiyan Chen, Jiyuan Zhang, Zhaofei Yu et al.

CVPR 2024posterarXiv:2312.09523

#1730

DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos

Arjun Balasingam, Joseph Chandler, Chenning Li et al.

AAAI 2024paperarXiv:2304.01644

#1731

Repeated Fair Allocation of Indivisible Items

Ayumi Igarashi, Martin Lackner, Oliviero Nardi et al.

CVPR 2024posterarXiv:2404.17753

#1732

Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification

Chao Yi, Lu Ren, De-Chuan Zhan et al.

ECCV 2024posterarXiv:2407.08536

#1733

Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Alex Gomez-Villa, Dipam Goswami, Kai Wang et al.

CVPR 2024posterarXiv:2412.10651

#1734

LAN: Learning to Adapt Noise for Image Denoising

Changjin Kim, Tae Hyun Kim, Sungyong Baik

AAAI 2024paperarXiv:2403.16561

#1735

FedFixer: Mitigating Heterogeneous Label Noise in Federated Learning

Xinyuan Ji, Zhaowei Zhu, Wei Xi et al.

#1736

Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding

YIWEN TANG, Renrui Zhang, Jiaming Liu et al.

CVPR 2024posterarXiv:2403.06946

#1737

Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation

Xinyao Li, Yuke Li, Zhekai Du et al.

ECCV 2024posterarXiv:2407.11174

#1738

iHuman: Instant Animatable Digital Humans From Monocular Videos

Pramish Paudel, Anubhav Khanal, Danda Pani Paudel et al.

ECCV 2024posterarXiv:2402.18293

#1739

Continuous Memory Representation for Anomaly Detection

Joo Chan Lee, Taejune Kim, Eunbyung Park et al.

CVPR 2024posterarXiv:2403.19164

#1740

RecDiffusion: Rectangling for Image Stitching with Diffusion Models

Tianhao Zhou, Li Haipeng, Ziyi Wang et al.

CVPR 2024posterarXiv:2404.00292

#1741

LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion

Pancheng Zhao, Peng Xu, Pengda Qin et al.

#1742

FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation

Yanlu Cai, Weizhong Zhang, Yuan Wu et al.

CVPR 2024posterarXiv:2312.13313

#1743

ParamISP: Learned Forward and Inverse ISPs using Camera Parameters

Woohyeok Kim, Geonu Kim, Junyong Lee et al.

ICLR 2024posterarXiv:2310.20082

#1744

Efficient Subgraph GNNs by Learning Effective Selection Policies

Beatrice Bevilacqua, Moshe Eliasof, Eli Meirom et al.

ECCV 2024posterarXiv:2312.11595

#1745

SPIRE: Semantic Prompt-Driven Image Restoration

Chenyang Qi, Zhengzhong Tu, Keren Ye et al.

ICLR 2024posterarXiv:2312.15023

#1746

Federated Q-Learning: Linear Regret Speedup with Low Communication Cost

Zhong Zheng, Fengyu Gao, Lingzhou Xue et al.

CVPR 2024posterarXiv:2312.03033

#1747

LiDAR-based Person Re-identification

Wenxuan Guo, Zhiyu Pan, Yingping Liang et al.

ECCV 2024posterarXiv:2408.01291

#1748

TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

Dong Huo, Zixin Guo, Xinxin Zuo et al.

ECCV 2024posterarXiv:2403.06381

#1749

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

Yang Zhang, Tze Tzun Teoh, Wei Hern Lim et al.

ECCV 2024posterarXiv:2403.15612

#1750

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

Sisi Dai, Wenhao Li, Haowen Sun et al.

AAAI 2024paperarXiv:2312.08939

#1751

EAT: Towards Long-Tailed Out-of-Distribution Detection

Tong Wei, Bo-Lin Wang, Min-Ling Zhang

ECCV 2024posterarXiv:2310.01324

#1752

ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video

Xinhao Li, Yuhan Zhu, Limin Wang

CVPR 2024posterarXiv:2303.02635

#1753

VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning

Kang Chen, Xiangqian Wu

ICLR 2024posterarXiv:2310.13061

#1754

To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets

Darshil Doshi, Aritra Das, Tianyu He et al.

#1755

Deep Incomplete Multi-View Learning Network with Insufficient Label Information

Zhangqi Jiang, Tingjin Luo, Xinyan Liang

ICLR 2024spotlightarXiv:2302.06430

#1756

Deep Orthogonal Hypersphere Compression for Anomaly Detection

Yunhe Zhang, Yan Sun, Jinyu Cai et al.

CVPR 2024highlightarXiv:2403.19976

#1757

eTraM: Event-based Traffic Monitoring Dataset

Aayush Atul Verma, Bharatesh Chakravarthi, Arpitsinh Vaghela et al.

AAAI 2024paperarXiv:2312.15820

#1758

WebVLN: Vision-and-Language Navigation on Websites

Qi Chen, Dileepa Pitawela, Chongyang Zhao et al.

CVPR 2024posterarXiv:2311.10605

#1759

CA-Jaccard: Camera-aware Jaccard Distance for Person Re-identification

Yiyu Chen, Zheyi Fan, Zhaoru Chen et al.

CVPR 2024posterarXiv:2312.03611

#1760

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Yunhan Yang, Yukun Huang, Xiaoyang Wu et al.

ECCV 2024posterarXiv:2404.12922

#1761

Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images

Jacopo Bonato, Marco Cotogni, Luigi Sabetta

#1762

Improving Transferable Targeted Adversarial Attacks with Model Self-Enhancement

Han Wu, Guanyan Ou, Weibin Wu et al.

CVPR 2024posterarXiv:2312.08568

#1763

NViST: In the Wild New View Synthesis from a Single Image with Transformers

Wonbong Jang, Lourdes Agapito

AAAI 2024paperarXiv:2312.03212

#1764

Constrained Bayesian Optimization under Partial Observations: Balanced Improvements and Provable Convergence

Shengbo Wang, Ke Li

ICLR 2024oralarXiv:2311.00136

#1765

Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Antonis Antoniades, Yiyi Yu, Joe Canzano et al.

ECCV 2024posterarXiv:2407.08711

#1766

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Akshay Krishnan, Abhijit Kundu, Kevis Maninis et al.

ECCV 2024posterarXiv:2408.14916

#1767

Towards Real-world Event-guided Low-light Video Enhancement and Deblurring

Taewoo Kim, Jaeseok Jeong, Hoonhee Cho et al.

CVPR 2024posterarXiv:2402.18975

#1768

Theoretically Achieving Continuous Representation of Oriented Bounding Boxes

Zikai Xiao, Guo-Ye Yang, Xue Yang et al.

ECCV 2024posterarXiv:2409.13430

#1769

CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction

Zhangchen Ye, Tao Jiang, Chenfeng Xu et al.

ECCV 2024posterarXiv:2407.14257

#1770

SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization

Mae Younes, Amine Ouasfi, Adnane Boukhayma

CVPR 2024posterarXiv:2311.16739

#1771

As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors

Seungwoo Yoo, Kunho Kim, Vladimir G. Kim et al.

#1772

Distinguished In Uniform: Self-Attention Vs. Virtual Nodes

Eran Rosenbluth, Jan Tönshoff, Martin Ritzert et al.

ICLR 2024poster

#1773

Discriminability-Driven Channel Selection for Out-of-Distribution Detection

Yue Yuan, Rundong He, Yicong Dong et al.

CVPR 2024posterarXiv:2312.12478

#1774

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

Fang Kaipeng, Jingkuan Song, Lianli Gao et al.

ECCV 2024posterarXiv:2406.09272

#1775

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos

Changan Chen, Puyuan Peng, Ami Baid et al.

ECCV 2024posterarXiv:2407.07427

#1776

Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

Hao Fang, Peng Wu, Yawei Li et al.

ECCV 2024posterarXiv:2401.17258

#1777

You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation

Mehdi Noroozi, Isma Hadji, Brais Martinez et al.

CVPR 2024posterarXiv:2311.18129

#1778

Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices

Huancheng Chen, Haris Vikalo

CVPR 2024posterarXiv:2403.19067

#1779

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

Wei Dong, Xing Zhang, Bihui Chen et al.

ECCV 2024posterarXiv:2407.11344

#1780

Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation

Xu Zheng, Yuanhuiyi Lyu, jiazhou zhou et al.

CVPR 2024posterarXiv:2404.00915

#1781

Scalable 3D Registration via Truncated Entry-wise Absolute Residuals

Tianyu Huang, Liangzu Peng, Rene Vidal et al.

ECCV 2024posterarXiv:2403.11021

#1782

Towards Neuro-Symbolic Video Understanding

Minkyu Choi, Harsh Goel, Mohammad Omama et al.

#1783

Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution

Xingyuan Li, Jinyuan Liu, ZHIXIN CHEN et al.

ECCV 2024posterarXiv:2402.12688

#1784

Robust-Wide: Robust Watermarking against Instruction-driven Image Editing

Runyi Hu, Jie Zhang, Ting Xu et al.

CVPR 2024posterarXiv:2404.18135

#1785

Dexterous Grasp Transformer

Guo-Hao Xu, Yi-Lin Wei, Dian Zheng et al.

ECCV 2024posterarXiv:2403.13965

#1786

ConGeo: Robust Cross-view Geo-localization across Ground View Variations

Li Mi, Chang Xu, Javiera Castillo Navarro et al.

#1787

Robust Image Denoising through Adversarial Frequency Mixup

Donghun Ryou, Inju Ha, Hyewon Yoo et al.

ICLR 2024posterarXiv:2305.18702

#1788

Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

Kejun Tang, Jiayu Zhai, Xiaoliang Wan et al.

CVPR 2024highlightarXiv:2312.00065

#1789

Unsupervised Keypoints from Pretrained Diffusion Models

Eric Hedlin, Gopal Sharma, Shweta Mahajan et al.

ICLR 2024posterarXiv:2402.11140

#1790

Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models

Sijia Chen, Baochun Li, Di Niu

ECCV 2024posterarXiv:2312.09313

#1791

LatentEditor: Text Driven Local Editing of 3D Scenes

Umar Khalid, Hasan Iqbal, Muhammad Tayyab et al.

#1792

ReGCL: Rethinking Message Passing in Graph Contrastive Learning

Cheng Ji, Zixuan Huang, Qingyun Sun et al.

CVPR 2024highlightarXiv:2401.02416

#1793

ODIN: A Single Model for 2D and 3D Segmentation

Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios et al.

ICLR 2024spotlightarXiv:2311.11321

#1794

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel

ECCV 2024posterarXiv:2403.18187

#1795

LayoutFlow: Flow Matching for Layout Generation

Julian Jorge Andrade Guerreiro, Naoto Inoue, Kento Masui et al.

ECCV 2024posterarXiv:2407.03200

#1796

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

Weitai Kang, Gaowen Liu, Shah Mubarak et al.

CVPR 2024posterarXiv:2402.17062

#1797

HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

Haozhe Qi, Chen Zhao, Mathieu Salzmann et al.

ECCV 2024posterarXiv:2403.19580

#1798

OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

Zhenyu Wang, Ya-Li Li, TAICHI LIU et al.

ECCV 2024posterarXiv:2405.17609

#1799

GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns

Maria Korosteleva, Timur Levent Kesdogan, Fabian Kemper et al.

CVPR 2024highlightarXiv:2404.10438

#1800

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

Gabriele Trivigno, Carlo Masone, Barbara Caputo et al.