Most Cited 2024 "local search operators" Papers

12,324 papers found • Page 11 of 62

#2001

CSL: Class-Agnostic Structure-Constrained Learning for Segmentation including the Unseen

Hao Zhang, Fang Li, Lu Qi et al.

AAAI 2024paperarXiv:2312.05538
15
citations
#2002

A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing

Li Maomao, Yu Li, Tianyu Yang et al.

CVPR 2024posterarXiv:2312.05856
15
citations
#2003

Faceptor: A Generalist Model for Face Perception

Lixiong Qin, Mei Wang, Xuannan Liu et al.

ECCV 2024posterarXiv:2403.09500
15
citations
#2004

Accelerating Image Generation with Sub-path Linear Approximation Model

Chen Xu, Tianhui Song, Weixin Feng et al.

ECCV 2024posterarXiv:2404.13903
15
citations
#2005

CO2: Efficient Distributed Training with Full Communication-Computation Overlap

Weigao Sun, Qin Zhen, Weixuan Sun et al.

ICLR 2024spotlightarXiv:2401.16265
15
citations
#2006

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

Tangfei Liao, Xiaoqin Zhang, Li Zhao et al.

AAAI 2024paperarXiv:2312.08774
15
citations
#2007

DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans

Akash Sengupta, Thiemo Alldieck, NIKOS KOLOTOUROS et al.

CVPR 2024posterarXiv:2404.00485
15
citations
#2008

ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video Colorization

Yixin Yang, Jiangxin Dong, Jinhui Tang et al.

ECCV 2024posterarXiv:2404.06251
15
citations
#2009

Large-Scale Multi-Hypotheses Cell Tracking Using Ultrametric Contours Maps

Jordao Bragantini, Merlin Lange, Loïc A Royer

ECCV 2024posterarXiv:2308.04526
15
citations
#2010

Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns

Brian DuSell, David Chiang

ICLR 2024spotlightarXiv:2310.01749
15
citations
#2011

Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection

BA KHANH TRINH LE, Huy-Hung Nguyen, Long Hoang Pham et al.

ECCV 2024posterarXiv:2407.16497
15
citations
#2012

VEON: Vocabulary-Enhanced Occupancy Prediction

Jilai Zheng, Pin Tang, Zhongdao Wang et al.

ECCV 2024posterarXiv:2407.12294
15
citations
#2013

Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability

Ivan Lee, Nan Jiang, Taylor Berg-Kirkpatrick

ICLR 2024posterarXiv:2310.08049
15
citations
#2014

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

Wei Su, Peihan Miao, Huanzhang Dou et al.

CVPR 2024posterarXiv:2406.18048
15
citations
#2015

Efficient Vision-Language Pre-training by Cluster Masking

Zihao Wei, Zixuan Pan, Andrew Owens

CVPR 2024posterarXiv:2405.08815
15
citations
#2016

One-Shot Structure-Aware Stylized Image Synthesis

Hansam Cho, Jonghyun Lee, Seunggyu Chang et al.

CVPR 2024posterarXiv:2402.17275
15
citations
#2017

Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

Xu Zheng, Farhad Shirani, Tianchun Wang et al.

ICLR 2024spotlightarXiv:2310.01820
15
citations
#2018

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Zanlin Ni, Yulin Wang, Renping Zhou et al.

ECCV 2024posterarXiv:2409.00342
15
citations
#2019

Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms

Joren Brunekreef, Eric Marcus, Ray Sheombarsing et al.

CVPR 2024posterarXiv:2311.11837
15
citations
#2020

Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Sikai Bai, Shuaicheng Li, Weiming Zhuang et al.

AAAI 2024paperarXiv:2307.05358
15
citations
#2021

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

Yingsen Zeng, Yujie Zhong, Chengjian Feng et al.

ECCV 2024posterarXiv:2404.04933
15
citations
#2022

DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding

Jincen Jiang, Qianyu Zhou, Yuhang Li et al.

ECCV 2024posterarXiv:2407.08801
15
citations
#2023

Compositional Generative Inverse Design

Tailin Wu, Takashi Maruyama, Long Wei et al.

ICLR 2024spotlightarXiv:2401.13171
15
citations
#2024

Bidirectional Autoregessive Diffusion Model for Dance Generation

Canyu Zhang, Youbao Tang, NING Zhang et al.

CVPR 2024poster
15
citations
#2025

Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models

Kota Sueyoshi, Takashi Matsubara

CVPR 2024highlightarXiv:2311.16117
15
citations
#2026

Diffusion Bridges for 3D Point Cloud Denoising

Mathias Vogel, Keisuke Tateno, Marc Pollefeys et al.

ECCV 2024posterarXiv:2408.16325
15
citations
#2027

The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective

Wenqi Jia, Miao Liu, Hao Jiang et al.

CVPR 2024posterarXiv:2312.12870
15
citations
#2028

A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting

Wouter Van Gansbeke, Bert De Brabandere

ECCV 2024posterarXiv:2401.10227
15
citations
#2029

Versatile Medical Image Segmentation Learned from Multi-Source Datasets via Model Self-Disambiguation

Xiaoyang Chen, Hao Zheng, Yuemeng LI et al.

CVPR 2024posterarXiv:2311.10696
15
citations
#2030

Learning Camouflaged Object Detection from Noisy Pseudo Label

Jin Zhang, Ruiheng Zhang, Yanjiao Shi et al.

ECCV 2024posterarXiv:2407.13157
15
citations
#2031

The Need for Speed: Pruning Transformers with One Recipe

Samir Khaki, Konstantinos Plataniotis

ICLR 2024posterarXiv:2403.17921
14
citations
#2032

Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction

Yizhi Wang, Wallace Lira, Wenqi Wang et al.

CVPR 2024poster
14
citations
#2033

Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos

Seoha Kim, Jeongmin Bae, Youngsik Yun et al.

AAAI 2024paperarXiv:2310.13356
14
citations
#2034

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

Junwen Xiong, Peng Zhang, Tao You et al.

CVPR 2024posterarXiv:2403.01226
14
citations
#2035

TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation

Nikolai Kalischek, Torben Peters, Jan Dirk Wegner et al.

ECCV 2024posterarXiv:2211.13220
14
citations
#2036

Pre-training with Random Orthogonal Projection Image Modeling

Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed et al.

ICLR 2024spotlightarXiv:2310.18737
14
citations
#2037

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning

Fan-Ming Luo, Tian Xu, Xingchen Cao et al.

ICLR 2024spotlightarXiv:2310.05422
14
citations
#2038

Tri-Modal Motion Retrieval by Learning a Joint Embedding Space

Kangning Yin, Shihao Zou, Yuxuan Ge et al.

CVPR 2024highlightarXiv:2403.00691
14
citations
#2039

Complementing Event Streams and RGB Frames for Hand Mesh Reconstruction

Jianping Jiang, xinyu zhou, Bingxuan Wang et al.

CVPR 2024posterarXiv:2403.07346
14
citations
#2040

NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

Ruikai Cui, Weizhe Liu, Weixuan Sun et al.

ECCV 2024posterarXiv:2403.18241
14
citations
#2041

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024posterarXiv:2311.11325
14
citations
#2042

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024poster
14
citations
#2043

Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models

Weiwei Cao, Jianpeng Zhang, Yingda Xia et al.

CVPR 2024posterarXiv:2404.04936
14
citations
#2044

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

Haoxuan You, Xiaoyue Guo, Zhecan Wang et al.

ICLR 2024posterarXiv:2303.13455
14
citations
#2045

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024posterarXiv:2404.05729
14
citations
#2046

Customization Assistant for Text-to-Image Generation

Yufan Zhou, Ruiyi Zhang, Jiuxiang Gu et al.

CVPR 2024posterarXiv:2312.03045
14
citations
#2047

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

Yifu Chen, Jingwen Chen, Yingwei Pan et al.

ECCV 2024posterarXiv:2409.08260
14
citations
#2048

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Renzhe Zhou, Chen-Xiao Gao, Zongzhang Zhang et al.

AAAI 2024paperarXiv:2312.15909
14
citations
#2049

Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration

Qiang Wang, Yuhang He, Songlin Dong et al.

ECCV 2024poster
14
citations
#2050

Learning to Learn Better Visual Prompts

Fengxiang Wang, Wanrong Huang, Shaowu Yang et al.

AAAI 2024paper
14
citations
#2051

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Shihao Zhao, Shaozhe Hao, Bojia Zi et al.

ECCV 2024posterarXiv:2403.07860
14
citations
#2052

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024posterarXiv:2311.11533
14
citations
#2053

The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical Images

Nicholas Konz, Maciej Mazurowski

ICLR 2024posterarXiv:2401.08865
14
citations
#2054

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

Cheng Han, Qifan Wang, Sohail A Dianat et al.

ECCV 2024posterarXiv:2407.04208
14
citations
#2055

ProTeCt: Prompt Tuning for Taxonomic Open Set Classification

Tz-Ying Wu, Chih-Hui Ho, Nuno Vasconcelos

CVPR 2024posterarXiv:2306.02240
14
citations
#2056

MERGE: Fast Private Text Generation

Zi Liang, Pinghui Wang, Ruofei Zhang et al.

AAAI 2024paperarXiv:2305.15769
14
citations
#2057

Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation

Zhanfeng Liao, Yan Liu, Qian Zheng et al.

AAAI 2024paperarXiv:2311.09077
14
citations
#2058

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024posterarXiv:2409.06290
14
citations
#2059

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024posterarXiv:2407.11499
14
citations
#2060

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?

JIANFEI YANG, Hanjie Qian, Yuecong Xu et al.

ICLR 2024posterarXiv:2305.18712
14
citations
#2061

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024posterarXiv:2403.13556
14
citations
#2062

MFABA: A More Faithful and Accelerated Boundary-Based Attribution Method for Deep Neural Networks

Zhiyu Zhu, Huaming Chen, Jiayu Zhang et al.

AAAI 2024paperarXiv:2312.13630
14
citations
#2063

Deep Structural Knowledge Exploitation and Synergy for Estimating Node Importance Value on Heterogeneous Information Networks

Yankai Chen, Yixiang Fang, Qiongyan Wang et al.

AAAI 2024paperarXiv:2402.12411
14
citations
#2064

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

ECCV 2024posterarXiv:2403.09419
14
citations
#2065

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Yunhao Ge, Yihe Tang, Jiashu Xu et al.

CVPR 2024highlightarXiv:2405.09546
14
citations
#2066

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024posterarXiv:2407.01872
14
citations
#2067

CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide Images

olga fourkioti, Matt De Vries, Chris Bakal

ICLR 2024spotlightarXiv:2305.05314
14
citations
#2068

SURE: SUrvey REcipes for building reliable and robust deep networks

Yuting Li, Yingyi Chen, Xuanlong Yu et al.

CVPR 2024posterarXiv:2403.00543
14
citations
#2069

A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation

Ayan Sengupta, Shantanu Dixit, Md Shad Akhtar et al.

ICLR 2024poster
14
citations
#2070

Hyperbolic Learning with Synthetic Captions for Open-World Detection

Fanjie Kong, Yanbei Chen, Jiarui Cai et al.

CVPR 2024posterarXiv:2404.05016
14
citations
#2071

Binarized Low-light Raw Video Enhancement

Gengchen Zhang, Yulun Zhang, Xin Yuan et al.

CVPR 2024posterarXiv:2403.19944
14
citations
#2072

Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes

Zhiyuan Yu, Zheng Qin, lintao zheng et al.

CVPR 2024posterarXiv:2404.04557
14
citations
#2073

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024posterarXiv:2409.06703
14
citations
#2074

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

ECCV 2024posterarXiv:2407.15328
14
citations
#2075

On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving

Kaituo Feng, Changsheng Li, Dongchun Ren et al.

CVPR 2024posterarXiv:2403.01238
14
citations
#2076

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek et al.

CVPR 2024posterarXiv:2402.08657
14
citations
#2077

3426 Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving

Junkai Xu, Liang Peng, Haoran Cheng et al.

AAAI 2024paper
14
citations
#2078

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024posterarXiv:2312.13663
14
citations
#2079

FoSp: Focus and Separation Network for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

AAAI 2024paperarXiv:2306.04474
14
citations
#2080

FoX: Formation-Aware Exploration in Multi-Agent Reinforcement Learning

Yonghyeon Jo, Sunwoo Lee, Junghyuk Yum et al.

AAAI 2024paperarXiv:2308.11272
14
citations
#2081

Image Demoireing in RAW and sRGB Domains

Shuning Xu, Binbin Song, Xiangyu Chen et al.

ECCV 2024posterarXiv:2312.09063
14
citations
#2082

NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs

Michael Fischer, Zhengqin Li, Thu Nguyen-Phuoc et al.

CVPR 2024posterarXiv:2402.08622
14
citations
#2083

In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging

Xin Wang, Lizhi Wang, Xiangtian Ma et al.

CVPR 2024posterarXiv:2312.13319
14
citations
#2084

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

Xiaopei Wu, Yuenan Hou, Xiaoshui Huang et al.

CVPR 2024posterarXiv:2407.09751
14
citations
#2085

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data

Yifang Men, Biwen Lei, Yuan Yao et al.

CVPR 2024posterarXiv:2401.01173
14
citations
#2086

Multi-Objective Bayesian Optimization with Active Preference Learning

Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki et al.

AAAI 2024paperarXiv:2311.13460
14
citations
#2087

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

ICLR 2024spotlightarXiv:2403.09953
14
citations
#2088

Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks

Zhiying Jiang, Xingyuan Li, Jinyuan Liu et al.

AAAI 2024paperarXiv:2402.15959
14
citations
#2089

Protein Multimer Structure Prediction via Prompt Learning

Ziqi Gao, Xiangguo SUN, Zijing Liu et al.

ICLR 2024posterarXiv:2402.18813
14
citations
#2090

CNN Kernels Can Be the Best Shapelets

Eric Qu, Yansen Wang, Xufang Luo et al.

ICLR 2024poster
14
citations
#2091

ProCC: Progressive Cross-Primitive Compatibility for Open-World Compositional Zero-Shot Learning

Fushuo Huo, Wenchao Xu, Song Guo et al.

AAAI 2024paperarXiv:2211.12417
14
citations
#2092

Regroup Median Loss for Combating Label Noise

Authors: Fengpeng Li, Kemou Li, Jinyu Tian et al.

AAAI 2024paperarXiv:2312.06273
14
citations
#2093

Unifying Automatic and Interactive Matting with Pretrained ViTs

Zixuan Ye, Wenze Liu, He Guo et al.

CVPR 2024poster
14
citations
#2094

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion

Zhenjiang Du, Jiale Dou, Zhitao Liu et al.

AAAI 2024paper
14
citations
#2095

ReCoRe: Regularized Contrastive Representation Learning of World Model

Rudra P, K. Poudel, Harit Pandya et al.

CVPR 2024posterarXiv:2312.09056
14
citations
#2096

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

Wuyang Li, Xinyu Liu, Jiayi Ma et al.

ECCV 2024poster
14
citations
#2097

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024posterarXiv:2407.15626
14
citations
#2098

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024posterarXiv:2403.09072
14
citations
#2099

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024posterarXiv:2409.03944
14
citations
#2100

Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization

Animesh Basak Chowdhury, Marco Romanelli, Benjamin Tan et al.

ICLR 2024posterarXiv:2401.12205
14
citations
#2101

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Rui Zhao, Bin Shi, Jianfei Ruan et al.

CVPR 2024posterarXiv:2405.05714
14
citations
#2102

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions

Hao Xu, Li Haipeng, Yinqiao Wang et al.

CVPR 2024posterarXiv:2403.18575
14
citations
#2103

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024poster
14
citations
#2104

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

Kaibin Tian, Yanhua Cheng, Yi Liu et al.

AAAI 2024paperarXiv:2401.00701
14
citations
#2105

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

CVPR 2024poster
14
citations
#2106

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao et al.

CVPR 2024posterarXiv:2402.02352
14
citations
#2107

A Restoration Network as an Implicit Prior

Yuyang Hu, Mauricio Delbracio, Peyman Milanfar et al.

ICLR 2024posterarXiv:2310.01391
14
citations
#2108

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024poster
14
citations
#2109

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024posterarXiv:2407.02778
14
citations
#2110

Towards Fair Graph Federated Learning via Incentive Mechanisms

12794 Chenglu Pan, Jiarong Xu, Yue Yu et al.

AAAI 2024paperarXiv:2312.13306
14
citations
#2111

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024posterarXiv:2409.11718
14
citations
#2112

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Qianyun He, Xinya Ji, Yicheng Gong et al.

ECCV 2024posterarXiv:2408.00297
14
citations
#2113

HEAL-SWIN: A Vision Transformer On The Sphere

Oscar Carlsson, Jan E. Gerken, Hampus Linander et al.

CVPR 2024posterarXiv:2307.07313
14
citations
#2114

GenN2N: Generative NeRF2NeRF Translation

Xiangyue Liu, Han Xue, Kunming Luo et al.

CVPR 2024posterarXiv:2404.02788
14
citations
#2115

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024posterarXiv:2312.02362
14
citations
#2116

Neural-Symbolic Recursive Machine for Systematic Generalization

Qing Li, Yixin Zhu, Yitao Liang et al.

ICLR 2024posterarXiv:2210.01603
14
citations
#2117

One Forward is Enough for Neural Network Training via Likelihood Ratio Method

Jinyang Jiang, Zeliang Zhang, Chenliang Xu et al.

ICLR 2024posterarXiv:2305.08960
14
citations
#2118

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024posterarXiv:2407.15617
14
citations
#2119

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

Alex Trevithick, Matthew Chan, Towaki Takikawa et al.

CVPR 2024posterarXiv:2401.02411
14
citations
#2120

Adversarial Backdoor Attack by Naturalistic Data Poisoning on Trajectory Prediction in Autonomous Driving

Mozhgan Pourkeshavarz, Mohammad Sabokrou, Amir Rasouli

CVPR 2024posterarXiv:2306.15755
14
citations
#2121

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

Penghui Du, Yu Wang, Yifan Sun et al.

ECCV 2024posterarXiv:2407.11335
14
citations
#2122

SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch

Chun-Liang Li, Tomas Pfister, Kihyuk Sohn et al.

ICLR 2024posterarXiv:2212.00173
14
citations
#2123

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024posterarXiv:2404.14715
14
citations
#2124

Event-Adapted Video Super-Resolution

Zeyu Xiao, Dachun Kai, Yueyi Zhang et al.

ECCV 2024poster
14
citations
#2125

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification

YuTeng Ye, Hang Zhou, Jiale Cai et al.

AAAI 2024paperarXiv:2211.14742
14
citations
#2126

Learning Explicit Contact for Implicit Reconstruction of Hand-Held Objects from Monocular Images

Junxing Hu, Hongwen Zhang, Zerui Chen et al.

AAAI 2024paperarXiv:2305.20089
14
citations
#2127

Reinforcement Learning Friendly Vision-Language Model for Minecraft

Haobin Jiang, Junpeng Yue, Hao Luo et al.

ECCV 2024posterarXiv:2303.10571
14
citations
#2128

Tri^{2}-plane: Thinking Head Avatar via Feature Pyramid

Luchuan Song, Pinxin Liu, Lele Chen et al.

ECCV 2024posterarXiv:2401.09386
14
citations
#2129

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ECCV 2024posterarXiv:2310.08530
14
citations
#2130

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024posterarXiv:2212.09877
14
citations
#2131

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

CVPR 2024posterarXiv:2402.19270
14
citations
#2132

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model

Zhenghao Pan, Haijin Zeng, Jiezhang Cao et al.

CVPR 2024posterarXiv:2311.11417
14
citations
#2133

HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

Guian Fang, Wenbiao Yan, Yuanfan Guo et al.

ECCV 2024posterarXiv:2407.06937
14
citations
#2134

M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis

Ning Zhang, Hiuyi Cheng, Jiayu Chen et al.

AAAI 2024paper
14
citations
#2135

Accelerating Data Generation for Neural Operators via Krylov Subspace Recycling

Hong Wang, Zhongkai Hao, Jie Wang et al.

ICLR 2024spotlightarXiv:2401.09516
14
citations
#2136

AutoCast++: Enhancing World Event Prediction with Zero-shot Ranking-based Context Retrieval

Qi Yan, Raihan Seraj, Jiawei He et al.

ICLR 2024oralarXiv:2310.01880
14
citations
#2137

Exploiting Auxiliary Caption for Video Grounding

Hongxiang Li, Meng Cao, Xuxin Cheng et al.

AAAI 2024paperarXiv:2301.05997
14
citations
#2138

HONGAT: Graph Attention Networks in the Presence of High-Order Neighbors

Heng-Kai Zhang, Yi-Ge Zhang, Zhi Zhou et al.

AAAI 2024paper
14
citations
#2139

Temporal Event Stereo via Joint Learning with Stereoscopic Flow

Hoonhee Cho, Jae-young Kang, Kuk-Jin Yoon

ECCV 2024posterarXiv:2407.10831
14
citations
#2140

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Shuo Cao, Yihao Liu, Wenlong Zhang et al.

ECCV 2024posterarXiv:2407.12273
14
citations
#2141

Rating-Based Reinforcement Learning

Devin White, Mingkang Wu, Ellen Novoseller et al.

AAAI 2024paperarXiv:2307.16348
13
citations
#2142

Differentiable Euler Characteristic Transforms for Shape Classification

Ernst Roell, Bastian Rieck

ICLR 2024posterarXiv:2310.07630
13
citations
#2143

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024posterarXiv:2311.16914
13
citations
#2144

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Hao Tan, Jun Li, Yizhuang Zhou et al.

AAAI 2024paperarXiv:2312.06401
13
citations
#2145

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024posterarXiv:2403.10942
13
citations
#2146

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024posterarXiv:2401.06397
13
citations
#2147

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024posterarXiv:2212.02997
13
citations
#2148

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024posterarXiv:2408.05205
13
citations
#2149

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024posterarXiv:2407.10528
13
citations
#2150

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

ICLR 2024posterarXiv:2310.05861
13
citations
#2151

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

ICLR 2024posterarXiv:2406.16072
13
citations
#2152

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024posterarXiv:2401.00912
13
citations
#2153

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024posterarXiv:2403.05018
13
citations
#2154

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024poster
13
citations
#2155

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024posterarXiv:2404.09857
13
citations
#2156

Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity

Yiyue Chen, Haris Vikalo, Chianing Wang

AAAI 2024paperarXiv:2312.13380
13
citations
#2157

GDA: Generalized Diffusion for Robust Test-time Adaptation

Yun-Yun Tsai, Fu-Chen Chen, Albert Chen et al.

CVPR 2024posterarXiv:2404.00095
13
citations
#2158

Distilling Reliable Knowledge for Instance-Dependent Partial Label Learning

Dong-Dong Wu, Deng-Bao Wang, Min-Ling Zhang

AAAI 2024paper
13
citations
#2159

USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Xiaoqi Wang, Wenbin He, Xiwei Xuan et al.

CVPR 2024posterarXiv:2406.05271
13
citations
#2160

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024poster
13
citations
#2161

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024posterarXiv:2407.15773
13
citations
#2162

ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Yin Zhang, Yongqiang Zhang, Zian Zhang et al.

AAAI 2024paper
13
citations
#2163

Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D

Mukund Varma T, Peihao Wang, Zhiwen Fan et al.

CVPR 2024posterarXiv:2403.18922
13
citations
#2164

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024posterarXiv:2403.06168
13
citations
#2165

Full Bayesian Significance Testing via Neural Networks

Zehua Liu, Zimeng Li, Jingyuan Wang et al.

AAAI 2024paper
13
citations
#2166

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024posterarXiv:2311.16447
13
citations
#2167

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024posterarXiv:2409.09605
13
citations
#2168

CardiacNet: Learning to Reconstruct Abnormalities for Cardiac Disease Assessment from Echocardiogram Videos

JIEWEN YANG, Yiqun Lin, Bin Pu et al.

ECCV 2024posterarXiv:2410.20769
13
citations
#2169

Light Schrödinger Bridge

Alexander Korotin, Nikita Gushchin, Evgeny Burnaev

ICLR 2024posterarXiv:2310.01174
13
citations
#2170

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Yufan Chen, Jiaming Zhang, Kunyu Peng et al.

CVPR 2024posterarXiv:2403.14442
13
citations
#2171

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024posterarXiv:2406.04551
13
citations
#2172

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024posterarXiv:2311.12047
13
citations
#2173

Federated Causality Learning with Explainable Adaptive Optimization

Dezhi Yang, Xintong He, Jun Wang et al.

AAAI 2024paperarXiv:2312.05540
13
citations
#2174

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation

Xinliang Zhang, Lei Zhu, Hangzhou He et al.

AAAI 2024paperarXiv:2402.17555
13
citations
#2175

Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation

Xiuding Cai, Yaoyao Zhu, Dong Miao et al.

AAAI 2024paperarXiv:2211.10867
13
citations
#2176

FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval

Yanzhe Chen, Huasong Zhong, Xiangteng He et al.

AAAI 2024paper
13
citations
#2177

MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes

Bor Shiun Wang, Chien-Yi Wang, Wei-Chen Chiu

CVPR 2024posterarXiv:2404.08968
13
citations
#2178

UniHuman: A Unified Model For Editing Human Images in the Wild

Nannan Li, Qing Liu, Krishna Kumar Singh et al.

CVPR 2024posterarXiv:2312.14985
13
citations
#2179

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024posterarXiv:2407.08256
13
citations
#2180

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

Yan Gao, Haojun Xu, Jie Li et al.

AAAI 2024paperarXiv:2312.08951
13
citations
#2181

Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging

Fulin Luo, Xi Chen, Xiuwen Gong et al.

AAAI 2024paper
13
citations
#2182

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024posterarXiv:2403.14183
13
citations
#2183

Partial-to-Partial Shape Matching with Geometric Consistency

Viktoria Ehm, Maolin Gao, Paul Roetzer et al.

CVPR 2024posterarXiv:2404.12209
13
citations
#2184

BENO: Boundary-embedded Neural Operators for Elliptic PDEs

Haixin Wang, Jiaxin Li, Anubhav Dwivedi et al.

ICLR 2024posterarXiv:2401.09323
13
citations
#2185

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Juno Kim, Kakei Yamamoto, Kazusato Oko et al.

ICLR 2024spotlightarXiv:2312.01127
13
citations
#2186

Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport

Bin Li, Ye Shi, Qian Yu et al.

AAAI 2024paperarXiv:2402.18411
13
citations
#2187

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024posterarXiv:2407.04458
13
citations
#2188

3D Multi-frame Fusion for Video Stabilization

Zhan Peng, Xinyi Ye, Weiyue Zhao et al.

CVPR 2024posterarXiv:2404.12887
13
citations
#2189

Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang et al.

ICLR 2024posterarXiv:2402.11984
13
citations
#2190

Single-View Scene Point Cloud Human Grasp Generation

Yan-Kang Wang, Chengyi Xing, Yi-Lin Wei et al.

CVPR 2024posterarXiv:2404.15815
13
citations
#2191

Cycle-Consistency Learning for Captioning and Grounding

Ning Wang, Jiajun Deng, Mingbo Jia

AAAI 2024paperarXiv:2312.15162
13
citations
#2192

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024posterarXiv:2407.11555
13
citations
#2193

Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction

Zhixuan Chu, Mengxuan Hu, Qing Cui et al.

AAAI 2024paperarXiv:2312.16113
13
citations
#2194

Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning

Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu et al.

AAAI 2024paperarXiv:2312.12722
13
citations
#2195

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024posterarXiv:2401.02402
13
citations
#2196

Move Anything with Layered Scene Diffusion

Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu et al.

CVPR 2024posterarXiv:2404.07178
13
citations
#2197

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024posterarXiv:2407.18550
13
citations
#2198

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

Yunchen Li, Zhou Yu, Gaoqi He et al.

AAAI 2024paperarXiv:2312.08200
13
citations
#2199

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024posterarXiv:2311.18815
13
citations
#2200

How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain et al.

CVPR 2024posterarXiv:2403.07203
13
citations