Most Cited 2024 "fourier embedding" Papers

12,324 papers found • Page 14 of 62

#2601

Zero-Shot Structure-Preserving Diffusion Model for High Dynamic Range Tone Mapping

Ruoxi Zhu, Shusong Xu, Peiye Liu et al.

CVPR 2024highlight
10
citations
#2602

Uncertainty-aware sign language video retrieval with probability distribution modeling

Xuan Wu, Hongxiang Li, yuanjiang luo et al.

ECCV 2024posterarXiv:2405.19689
10
citations
#2603

MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment

Kanglei Zhou, Liyuan Wang, Xingxing Zhang et al.

ECCV 2024posterarXiv:2403.04398
10
citations
#2604

Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior

Kai Cui, Sascha Hauck, Christian Fabian et al.

ICLR 2024posterarXiv:2307.06175
10
citations
#2605

Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

Tianyu Luan, Zhong Li, Lele Chen et al.

CVPR 2024posterarXiv:2403.01619
10
citations
#2606

Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

Fang Kong, Shuai Li

AAAI 2024paperarXiv:2401.01528
10
citations
#2607

Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization

Kaan Gokcesu, Hakan Gökcesu

AAAI 2024paperarXiv:2108.10859
10
citations
#2608

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

Ye-Bin Moon, Nam Hyeon-Woo, Wonseok Choi et al.

ECCV 2024posterarXiv:2407.13442
10
citations
#2609

ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention

Jiawei Wang, Changjian Li

CVPR 2024posterarXiv:2311.16682
10
citations
#2610

Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures

Jiaqi He, Zhihua Wang, Leon Wang et al.

ECCV 2024posterarXiv:2407.10181
10
citations
#2611

PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus

Florian Kluger, Bodo Rosenhahn

AAAI 2024paperarXiv:2401.14919
10
citations
#2612

Understanding and Improving Optimization in Predictive Coding Networks

Nicholas Alonso, Jeffrey Krichmar, Emre Neftci

AAAI 2024paperarXiv:2305.13562
10
citations
#2613

Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning

Tung Le, Khai Nguyen, Shanlin Sun et al.

CVPR 2024posterarXiv:2403.01781
10
citations
#2614

MemoNav: Working Memory Model for Visual Navigation

Hongxin Li, Zeyu Wang, Xu Yang et al.

CVPR 2024highlightarXiv:2402.19161
10
citations
#2615

Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

Lilang Lin, Lehong Wu, Jiahang Zhang et al.

ECCV 2024posterarXiv:2410.20349
10
citations
#2616

PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation

Ning Gao, Sanping Zhou, Le Wang et al.

ECCV 2024posterarXiv:2409.05122
10
citations
#2617

Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions

Jiacong Xu, Mingqian Liao, Ram Prabhakar Kathirvel et al.

ECCV 2024posterarXiv:2403.14053
10
citations
#2618

Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation

Philipp Schröppel, Christopher Wewer, Jan Lenssen et al.

CVPR 2024posterarXiv:2312.14124
10
citations
#2619

Graph Neural Network Causal Explanation via Neural Causal Models

Arman Behnam, Binghui Wang

ECCV 2024posterarXiv:2407.09378
10
citations
#2620

SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views

Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.

AAAI 2024paperarXiv:2307.05892
10
citations
#2621

YolOOD: Utilizing Object Detection Concepts for Multi-Label Out-of-Distribution Detection

Alon Zolfi, Guy AmiT, Amit Baras et al.

CVPR 2024posterarXiv:2212.02081
10
citations
#2622

Text-guided Explorable Image Super-resolution

Kanchana Vaishnavi Gandikota, Paramanand Chandramouli

CVPR 2024posterarXiv:2403.01124
10
citations
#2623

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi, Hunsang Lee, Seyoung Joung et al.

ECCV 2024posterarXiv:2404.08330
10
citations
#2624

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Shen Jianbing, Chunliang Li, Wencheng Han et al.

ECCV 2024posterarXiv:2407.10876
10
citations
#2625

VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression

Won Jo, Geuntaek Lim, Gwangjin Lee et al.

AAAI 2024paperarXiv:2303.08906
10
citations
#2626

Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

Ganghua Wang, Xun Xian, Ashish Kundu et al.

ICLR 2024posterarXiv:2310.10780
10
citations
#2627

OctOcc: High-Resolution 3D Occupancy Prediction with Octree

Wenzhe Ouyang, Xiaolin Song, Bailan Feng et al.

AAAI 2024paper
10
citations
#2628

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Tuo FENG, Wenguan Wang, Ruijie Quan et al.

ECCV 2024posterarXiv:2407.10200
10
citations
#2629

Motion and Structure from Event-based Normal Flow

Zhongyang Ren, Bangyan Liao, Delei Kong et al.

ECCV 2024posterarXiv:2407.12239
10
citations
#2630

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024posterarXiv:2407.12489
9
citations
#2631

TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Rui Song, Fausto Giunchiglia, Yingji Li et al.

AAAI 2024paperarXiv:2312.17263
9
citations
#2632

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Hantao Yang, Xutong Liu, Zhiyong Wang et al.

AAAI 2024paperarXiv:2402.16312
9
citations
#2633

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

Mingyuan Sun, Donghao Zhang, Zongyuan Ge et al.

ICLR 2024posterarXiv:2403.09274
9
citations
#2634

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024posterarXiv:2407.07324
9
citations
#2635

Learning Visual Abstract Reasoning through Dual-Stream Networks

Kai Zhao, Chang Xu, Bailu Si

AAAI 2024paperarXiv:2411.19451
9
citations
#2636

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

Ming Yan, Yan Zhang, Shuqiang Cai et al.

CVPR 2024posterarXiv:2403.19501
9
citations
#2637

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Bowen Shi, XIAOPENG ZHANG, Yaoming Wang et al.

ICLR 2024posterarXiv:2306.15876
9
citations
#2638

LDReg: Local Dimensionality Regularized Self-Supervised Learning

Hanxun Huang, Ricardo Campello, Sarah Erfani et al.

ICLR 2024posterarXiv:2401.10474
9
citations
#2639

Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models

Peiyan Zhang, Haoyang Liu, Chaozhuo Li et al.

ICLR 2024posterarXiv:2308.10632
9
citations
#2640

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Xiaoxu Xu, Yitian Yuan, Jinlong Li et al.

ECCV 2024posterarXiv:2407.09826
9
citations
#2641

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

ECCV 2024posterarXiv:2311.15908
9
citations
#2642

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

Tianyu Huang, Yihan Zeng, Bowen Dong et al.

ICLR 2024posterarXiv:2309.17175
9
citations
#2643

Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis

Francesco Kriegel

AAAI 2024paper
9
citations
#2644

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL

Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.

ICLR 2024posterarXiv:2305.17342
9
citations
#2645

Link Prediction in Multilayer Networks via Cross-Network Embedding

Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.

AAAI 2024paper
9
citations
#2646

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

ECCV 2024posterarXiv:2409.16763
9
citations
#2647

Axiomatic Aggregations of Abductive Explanations

Gagan Biradar, Yacine Izza, Elita Lobo et al.

AAAI 2024paperarXiv:2310.03131
9
citations
#2648

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

ECCV 2024posterarXiv:2403.12003
9
citations
#2649

Noisy Interpolation Learning with Shallow Univariate ReLU Networks

Nirmit Joshi, Gal Vardi, Nathan Srebro

ICLR 2024spotlightarXiv:2307.15396
9
citations
#2650

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024poster
9
citations
#2651

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024posterarXiv:2407.13254
9
citations
#2652

Comprehensive View Embedding Learning for Single-Cell Multimodal Integration

Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.

AAAI 2024paper
9
citations
#2653

Poly-View Contrastive Learning

Amitis Shidani, R Devon Hjelm, Jason Ramapuram et al.

ICLR 2024posterarXiv:2403.05490
9
citations
#2654

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

ECCV 2024posterarXiv:2407.06704
9
citations
#2655

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

Ke Xu, Tsun Wai Siu, Rynson W.H. Lau

AAAI 2024paper
9
citations
#2656

Linear Log-Normal Attention with Unbiased Concentration

Yury Nahshan, Joseph Kampeas, Emir Haleva

ICLR 2024posterarXiv:2311.13541
9
citations
#2657

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024posterarXiv:2407.02665
9
citations
#2658

Equivariant Matrix Function Neural Networks

Ilyes Batatia, Lars Leon Schaaf, Gábor Csányi et al.

ICLR 2024spotlightarXiv:2310.10434
9
citations
#2659

Multi-Domain Recommendation to Attract Users via Domain Preference Modeling

Hyunjun Ju, SeongKu Kang, Dongha Lee et al.

AAAI 2024paperarXiv:2403.17374
9
citations
#2660

DNI: Dilutional Noise Initialization for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong et al.

ECCV 2024posterarXiv:2409.13037
9
citations
#2661

Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation

Wei Cong, Yang Cong, Yuyang Liu et al.

ECCV 2024posterarXiv:2407.09047
9
citations
#2662

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

Ancong Wu, Wei-shi Zheng

AAAI 2024paper
9
citations
#2663

Bellman Optimal Stepsize Straightening of Flow-Matching Models

Bao Nguyen, Binh Nguyen, Viet Anh Nguyen

ICLR 2024posterarXiv:2312.16414
9
citations
#2664

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024posterarXiv:2407.05352
9
citations
#2665

Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study

Qiyu Kang, Kai Zhao, Yang Song et al.

AAAI 2024paperarXiv:2401.04331
9
citations
#2666

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Alaa Saade, Steven Kapturowski, Daniele Calandriello et al.

ICLR 2024spotlightarXiv:2305.01521
9
citations
#2667

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee et al.

ECCV 2024posterarXiv:2407.04345
9
citations
#2668

Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning

Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.

AAAI 2024paper
9
citations
#2669

Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA

Chengen Lai, Shengli Song, Shiqi Meng et al.

AAAI 2024paperarXiv:2312.13594
9
citations
#2670

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment

Chong Li, Xuelin Qian, Yun Wang et al.

ECCV 2024poster
9
citations
#2671

Unraveling Batch Normalization for Realistic Test-Time Adaptation

Zixian Su, Jingwei Guo, Kai Yao et al.

AAAI 2024paperarXiv:2312.09486
9
citations
#2672

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

ECCV 2024poster
9
citations
#2673

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Pengwei Yin, Jingjing Wang, Guanzhong Zeng et al.

ECCV 2024posterarXiv:2411.08606
9
citations
#2674

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Xinghao Wang, Junliang He, Pengyu Wang et al.

AAAI 2024paperarXiv:2401.13621
9
citations
#2675

A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis

Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee

AAAI 2024paperarXiv:2308.07301
9
citations
#2676

Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores

Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania et al.

ECCV 2024posterarXiv:2404.07336
9
citations
#2677

A Primal-Dual Algorithm for Hybrid Federated Learning

Tom Overman, Garrett Blum, Diego Klabjan

AAAI 2024paperarXiv:2210.08106
9
citations
#2678

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

ECCV 2024posterarXiv:2403.18820
9
citations
#2679

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024posterarXiv:2312.07485
9
citations
#2680

Learning to Make Keypoints Sub-Pixel Accurate

Shinjeong Kim, Marc Pollefeys, Daniel Barath

ECCV 2024posterarXiv:2407.11668
9
citations
#2681

Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

Youngjae Cho, HeeSun Bae, Seungjae Shin et al.

AAAI 2024paperarXiv:2401.06799
9
citations
#2682

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views

Shuai Guo, Qiuwen Wang, Yijie Gao et al.

AAAI 2024paperarXiv:2403.02063
9
citations
#2683

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.

CVPR 2024posterarXiv:2403.03037
9
citations
#2684

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

ECCV 2024posterarXiv:2407.10704
9
citations
#2685

DIUSum: Dynamic Image Utilization for Multimodal Summarization

Min Xiao, Junnan Zhu, Feifei Zhai et al.

AAAI 2024paper
9
citations
#2686

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Chengxu Liu, Xuan Wang, Yuanting Fan et al.

AAAI 2024paperarXiv:2403.05660
9
citations
#2687

Epitopological learning and Cannistraci-Hebb network shape intelligence brain-inspired theory for ultra-sparse advantage in deep learning

Yingtao Zhang, Jialin Zhao, Wenjing Wu et al.

ICLR 2024poster
9
citations
#2688

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024poster
9
citations
#2689

TriNeRFLet: A Wavelet Based Triplane NeRF Representation

Rajaei Khatib, RAJA GIRYES

ECCV 2024posterarXiv:2401.06191
9
citations
#2690

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024posterarXiv:2407.07402
9
citations
#2691

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ECCV 2024poster
9
citations
#2692

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

ECCV 2024poster
9
citations
#2693

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ECCV 2024posterarXiv:2405.09883
9
citations
#2694

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024posterarXiv:2407.08418
9
citations
#2695

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

chen rao, Guangyuan Li, Zehua Lan et al.

ECCV 2024posterarXiv:2408.13459
9
citations
#2696

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

Wenlong Liu, Tianyu Yang, Yuhan Wang et al.

ICLR 2024posterarXiv:2401.10556
9
citations
#2697

Improved Active Learning via Dependent Leverage Score Sampling

Atsushi Shimizu, Xiaoou Cheng, Christopher Musco et al.

ICLR 2024posterarXiv:2310.04966
9
citations
#2698

Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning

Hang Du, Xuejun Yan, Jingjing Wang et al.

AAAI 2024paperarXiv:2403.05117
9
citations
#2699

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024posterarXiv:2406.02461
9
citations
#2700

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

Bang Yang, Yong Dai, Xuxin Cheng et al.

AAAI 2024paperarXiv:2401.17186
9
citations
#2701

Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI

Sean I. Young, Yaël Balbastre, Bruce Fischl et al.

CVPR 2024posterarXiv:2312.03102
9
citations
#2702

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ECCV 2024posterarXiv:2312.07315
9
citations
#2703

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Ziqian Bai, Feitong Tan, Sean Fanello et al.

CVPR 2024posterarXiv:2404.01543
9
citations
#2704

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Yang Liu, Pengxiang Ding, Siteng Huang et al.

ECCV 2024posterarXiv:2409.07239
9
citations
#2705

Live and Learn: Continual Action Clustering with Incremental Views

Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.

AAAI 2024paperarXiv:2404.07962
9
citations
#2706

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

Hancheng Ye, Chong Yu, Peng Ye et al.

CVPR 2024posterarXiv:2403.15835
9
citations
#2707

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024posterarXiv:2403.13808
9
citations
#2708

Relation Rectification in Diffusion Model

Yinwei Wu, Xingyi Yang, Xinchao Wang

CVPR 2024posterarXiv:2403.20249
9
citations
#2709

A Unified and Interpretable Emotion Representation and Expression Generation

Reni Paskaleva, Mykyta Holubakha, Andela Ilic et al.

CVPR 2024posterarXiv:2404.01243
9
citations
#2710

Semantics-aware Motion Retargeting with Vision-Language Models

Haodong Zhang, ZhiKe Chen, Haocheng Xu et al.

CVPR 2024posterarXiv:2312.01964
9
citations
#2711

CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution

Qingguo Liu, Chenyi Zhuang, Pan Gao et al.

CVPR 2024poster
9
citations
#2712

PTMQ: Post-training Multi-Bit Quantization of Neural Networks

Ke Xu, Zhongcheng Li, Shanshan Wang et al.

AAAI 2024paper
9
citations
#2713

BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion

Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells et al.

ECCV 2024posterarXiv:2305.15798
9
citations
#2714

Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos

Shankhanil Mitra, Rajiv Soundararajan

AAAI 2024paperarXiv:2312.15425
9
citations
#2715

Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images

Ruohua Shi, Lingyu Duan, Tiejun Huang et al.

AAAI 2024paper
9
citations
#2716

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024posterarXiv:2403.04908
9
citations
#2717

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Wonjun Kang, Kevin Galim, Hyung Il Koo

ECCV 2024posterarXiv:2403.09468
9
citations
#2718

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024posterarXiv:2305.03716
9
citations
#2719

FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication

Eric Slyman, Stefan Lee, Scott Cohen et al.

CVPR 2024posterarXiv:2404.16123
9
citations
#2720

TurboSL: Dense Accurate and Fast 3D by Neural Inverse Structured Light

Parsa Mirdehghan, Maxx Wu, Wenzheng Chen et al.

CVPR 2024poster
9
citations
#2721

Adaptive Window Pruning for Efficient Local Motion Deblurring

Haoying Li, Jixin Zhao, Shangchen Zhou et al.

ICLR 2024posterarXiv:2306.14268
9
citations
#2722

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024posterarXiv:2311.13777
9
citations
#2723

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ECCV 2024posterarXiv:2407.12939
9
citations
#2724

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Swetha Sirnam, Jinyu Yang, Tal Neiman et al.

ECCV 2024posterarXiv:2407.13851
9
citations
#2725

Neural Super-Resolution for Real-time Rendering with Radiance Demodulation

Jia Li, Ziling Chen, Xiaolong Wu et al.

CVPR 2024posterarXiv:2308.06699
9
citations
#2726

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024posterarXiv:2407.09115
9
citations
#2727

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Minjun Kim, SeungWoo Song, Youhan Lee et al.

AAAI 2024paperarXiv:2401.06443
9
citations
#2728

Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization

Takuhiro Kaneko

CVPR 2024posterarXiv:2406.04155
9
citations
#2729

Understanding Expressivity of GNN in Rule Learning

Haiquan Qiu, Yongqi Zhang, Yong Li et al.

ICLR 2024posterarXiv:2303.12306
9
citations
#2730

Active Object Detection with Knowledge Aggregation and Distillation from Large Models

Dejie Yang, Yang Liu

CVPR 2024posterarXiv:2405.12509
9
citations
#2731

Multi-Dimensional Fair Federated Learning

Cong Su, Guoxian Yu, Jun Wang et al.

AAAI 2024paperarXiv:2312.05551
9
citations
#2732

Learning Subject-Aware Cropping by Outpainting Professional Photos

James Hong, Lu Yuan, Michaël Gharbi et al.

AAAI 2024paperarXiv:2312.12080
9
citations
#2733

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Jiawei Han, Kaiqi Liu, Wei Li et al.

ECCV 2024posterarXiv:2408.10537
9
citations
#2734

One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls

Minghui Hu, Jianbin Zheng, Chuanxia Zheng et al.

CVPR 2024posterarXiv:2311.15744
9
citations
#2735

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

Muer Tie, Julong Wei, Zhengjun Wang et al.

ECCV 2024posterarXiv:2404.06836
9
citations
#2736

Fusing Personal and Environmental Cues for Identification and Segmentation of First-Person Camera Wearers in Third-Person Views

Ziwei Zhao, Yuchen Wang, Chuhua Wang

CVPR 2024poster
9
citations
#2737

Memory-Scalable and Simplified Functional Map Learning

Robin Magnet, Maks Ovsjanikov

CVPR 2024posterarXiv:2404.00330
9
citations
#2738

High-Quality Facial Geometry and Appearance Capture at Home

Yuxuan Han, Junfeng Lyu, Feng Xu

CVPR 2024posterarXiv:2312.03442
9
citations
#2739

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ECCV 2024posterarXiv:2403.13524
9
citations
#2740

Intrinsic Phase-Preserving Networks for Depth Super Resolution

Xuanhong Chen, Hang Wang, Jinfan Liu et al.

AAAI 2024paper
9
citations
#2741

Clockwork Diffusion: Efficient Generation With Model-Step Distillation

Amirhossein Habibian, Amir Ghodrati, Noor Fathima et al.

CVPR 2024highlightarXiv:2312.08128
9
citations
#2742

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ECCV 2024posterarXiv:2403.11586
9
citations
#2743

Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners

Chun Feng, Joy Hsu, Weiyu Liu et al.

CVPR 2024posterarXiv:2404.19696
9
citations
#2744

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

ECCV 2024posterarXiv:2406.01194
9
citations
#2745

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

ECCV 2024posterarXiv:2406.18537
9
citations
#2746

GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields

Fangyin Wei, Hanlin Chen, Gim Hee Lee

CVPR 2024poster
9
citations
#2747

Flying with Photons: Rendering Novel Views of Propagating Light

Anagh Malik, Noah Juravsky, Ryan Po et al.

ECCV 2024posterarXiv:2404.06493
9
citations
#2748

Exploring Vulnerabilities in Spiking Neural Networks: Direct Adversarial Attacks on Raw Event Data

Yanmeng Yao, Xiaohan Zhao, Bin Gu

ECCV 2024poster
9
citations
#2749

C3Net: Compound Conditioned ControlNet for Multimodal Content Generation

Juntao Zhang, Yuehuai LIU, Yu-Wing Tai et al.

CVPR 2024posterarXiv:2311.17951
9
citations
#2750

AAMDM: Accelerated Auto-regressive Motion Diffusion Model

Tianyu Li, Calvin Zhuhan Qiao, Ren Guanqiao et al.

CVPR 2024posterarXiv:2401.06146
9
citations
#2751

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

ECCV 2024poster
9
citations
#2752

ProMotion: Prototypes As Motion Learners

Yawen Lu, Dongfang Liu, Qifan Wang et al.

CVPR 2024posterarXiv:2406.04999
9
citations
#2753

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

ECCV 2024posterarXiv:2407.04086
9
citations
#2754

Making Visual Sense of Oracle Bones for You and Me

Runqi Qiao, LAN YANG, Kaiyue Pang et al.

CVPR 2024poster
9
citations
#2755

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

ECCV 2024posterarXiv:2403.19238
9
citations
#2756

Motion Diversification Networks

Hee Jae Kim, Eshed Ohn-Bar

CVPR 2024poster
9
citations
#2757

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024posterarXiv:2404.17672
9
citations
#2758

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

ECCV 2024posterarXiv:2410.10659
9
citations
#2759

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

AAAI 2024paperarXiv:2407.09787
9
citations
#2760

VIXEN: Visual Text Comparison Network for Image Difference Captioning

Alexander Black, Jing Shi, Yifei Fan et al.

AAAI 2024paperarXiv:2402.19119
9
citations
#2761

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

ECCV 2024posterarXiv:2407.21032
9
citations
#2762

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Wenze Chen, Shiyu Huang, Yuan Chiang et al.

AAAI 2024paperarXiv:2207.05631
9
citations
#2763

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception

Bohan Li, Yasheng Sun, Jingxin Dong et al.

AAAI 2024paperarXiv:2306.12681
9
citations
#2764

Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization

Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa

AAAI 2024paperarXiv:2406.08799
9
citations
#2765

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Haodong LI, Hao LU, Yingcong Chen

ECCV 2024posterarXiv:2409.17316
9
citations
#2766

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

Mengfei Xia, Yujun Shen, Changsong Lei et al.

CVPR 2024posterarXiv:2310.09469
9
citations
#2767

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Fan Duan, Jiahao Yu, Li Chen

ECCV 2024posterarXiv:2407.05008
9
citations
#2768

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

ECCV 2024posterarXiv:2407.19666
9
citations
#2769

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.

CVPR 2024highlightarXiv:2401.15261
9
citations
#2770

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

Zizhao Wang, Caroline Wang, Xuesu Xiao et al.

AAAI 2024paperarXiv:2401.12497
9
citations
#2771

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024posterarXiv:2411.06344
9
citations
#2772

Hierarchical Correlation Clustering and Tree Preserving Embedding

Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani

CVPR 2024posterarXiv:2002.07756
9
citations
#2773

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ECCV 2024posterarXiv:2407.02047
9
citations
#2774

Neural structure learning with stochastic differential equations

Benjie Wang, Joel Jennings, Wenbo Gong

ICLR 2024oralarXiv:2311.03309
9
citations
#2775

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang et al.

CVPR 2024posterarXiv:2403.10988
9
citations
#2776

ASMR: Activation-Sharing Multi-Resolution Coordinate Networks for Efficient Inference

Jason Chun Lok Li, Steven Luo, Le Xu et al.

ICLR 2024posterarXiv:2405.12398
9
citations
#2777

DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

Maximilian Augustin, Yannic Neuhaus, Matthias Hein

CVPR 2024posterarXiv:2311.17833
9
citations
#2778

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Zhilin Huang, Quanmin Liang, Yijie Yu et al.

CVPR 2024posterarXiv:2405.10037
9
citations
#2779

UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

Shuai Yuan, Lei Luo, Zhuo Hui et al.

CVPR 2024posterarXiv:2405.02608
9
citations
#2780

DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models

Yitian Liu, Zhouhui Lian

AAAI 2024paperarXiv:2312.10314
9
citations
#2781

Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction

Guowei Xu, Jiale Tao, Wen Li et al.

ECCV 2024posterarXiv:2407.11494
9
citations
#2782

Residual Hyperbolic Graph Convolution Networks

Yangkai Xue, Jindou Dai, Zhipeng Lu et al.

AAAI 2024paperarXiv:2412.03825
8
citations
#2783

Improved Metric Distortion via Threshold Approvals

Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.

AAAI 2024paperarXiv:2305.14024
8
citations
#2784

Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach

Aoqi Zuo, yiqing li, Susan Wei et al.

ICLR 2024posterarXiv:2401.10632
8
citations
#2785

Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment

Simon Weber, Je Hyeong Hong, Daniel Cremers

ECCV 2024posterarXiv:2405.05079
8
citations
#2786

Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients

Xueyang Tang, Song Guo, Jie ZHANG et al.

ICLR 2024poster
8
citations
#2787

IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers

Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.

AAAI 2024paperarXiv:2305.06741
8
citations
#2788

Graph Context Transformation Learning for Progressive Correspondence Pruning

Junwen Guo, Guobao Xiao, Shiping Wang et al.

AAAI 2024paperarXiv:2312.15971
8
citations
#2789

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Guillaume Bono, Leonid Antsfeld, Assem Sadek et al.

ICLR 2024posterarXiv:2306.03857
8
citations
#2790

Self-Supervised Representation Learning for Adversarial Attack Detection

Yi Li, Plamen Angelov, Neeraj Suri

ECCV 2024posterarXiv:2407.04382
8
citations
#2791

UniCal: Unified Neural Sensor Calibration

Ze Yang, George G Chen, Haowei Zhang et al.

ECCV 2024posterarXiv:2409.18953
8
citations
#2792

Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process

Mingjie Xu, Feng Lu

AAAI 2024paper
8
citations
#2793

Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile

Seokjun Lee, Seung-Won Jung, Hyunseok Seo

AAAI 2024paperarXiv:2403.05093
8
citations
#2794

Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

Depeng Li, Tianqi Wang, Junwei Chen et al.

AAAI 2024paperarXiv:2401.09067
8
citations
#2795

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

Xunjiang Gu, Guanyu Song, Igor Gilitschenski et al.

ECCV 2024posterarXiv:2407.06683
8
citations
#2796

Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search

Haosen SUN, Lujun Li, Peijie Dong et al.

ECCV 2024poster
8
citations
#2797

Backdoor Adjustment via Group Adaptation for Debiased Coupon Recommendations

Junpeng Fang, Gongduo Zhang, Qing Cui et al.

AAAI 2024paper
8
citations
#2798

Object-Centric Learning with Slot Mixture Module

Daniil Kirilenko, Vitaliy Vorobyov, Aleksey Kovalev et al.

ICLR 2024posterarXiv:2311.04640
8
citations
#2799

Robust Policy Learning via Offline Skill Diffusion

Woo Kyung Kim, Minjong Yoo, Honguk Woo

AAAI 2024paperarXiv:2403.00225
8
citations
#2800

1/2-Approximate MMS Allocation for Separable Piecewise Linear Concave Valuations

Chandra Chekuri, Pooja Kulkarni, Rucha Kulkarni et al.

AAAI 2024paperarXiv:2312.08504
8
citations