Most Cited 2024 "neural pde solver" Papers

12,324 papers found • Page 12 of 62

#2201

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Shihao Zhao, Shaozhe Hao, Bojia Zi et al.

ECCV 2024posterarXiv:2403.07860
14
citations
#2202

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ECCV 2024posterarXiv:2310.08530
14
citations
#2203

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024posterarXiv:2312.13663
14
citations
#2204

A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation

Ayan Sengupta, Shantanu Dixit, Md Shad Akhtar et al.

ICLR 2024poster
14
citations
#2205

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data

Yifang Men, Biwen Lei, Yuan Yao et al.

CVPR 2024posterarXiv:2401.01173
14
citations
#2206

HEAL-SWIN: A Vision Transformer On The Sphere

Oscar Carlsson, Jan E. Gerken, Hampus Linander et al.

CVPR 2024posterarXiv:2307.07313
14
citations
#2207

CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide Images

olga fourkioti, Matt De Vries, Chris Bakal

ICLR 2024spotlightarXiv:2305.05314
14
citations
#2208

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

Haoxuan You, Xiaoyue Guo, Zhecan Wang et al.

ICLR 2024posterarXiv:2303.13455
14
citations
#2209

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

Alex Trevithick, Matthew Chan, Towaki Takikawa et al.

CVPR 2024posterarXiv:2401.02411
14
citations
#2210

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

ECCV 2024poster
14
citations
#2211

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024posterarXiv:2407.01872
14
citations
#2212

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024posterarXiv:2311.11325
14
citations
#2213

Online GNN Evaluation Under Test-time Graph Distribution Shifts

Xin Zheng, Dongjin Song, Qingsong Wen et al.

ICLR 2024spotlightarXiv:2403.09953
14
citations
#2214

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024posterarXiv:2311.11533
14
citations
#2215

CNN Kernels Can Be the Best Shapelets

Eric Qu, Yansen Wang, Xufang Luo et al.

ICLR 2024poster
14
citations
#2216

Towards Fair Graph Federated Learning via Incentive Mechanisms

12794 Chenglu Pan, Jiarong Xu, Yue Yu et al.

AAAI 2024paperarXiv:2312.13306
14
citations
#2217

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024posterarXiv:2404.05729
14
citations
#2218

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024poster
14
citations
#2219

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

ECCV 2024posterarXiv:2403.09419
14
citations
#2220

AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack

Ruikui Wang, Yuanfang Guo, Yunhong Wang

AAAI 2024paper
14
citations
#2221

M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis

Ning Zhang, Hiuyi Cheng, Jiayu Chen et al.

AAAI 2024paper
14
citations
#2222

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek et al.

CVPR 2024posterarXiv:2402.08657
14
citations
#2223

Generative 3D Part Assembly via Part-Whole-Hierarchy Message Passing

Bi'an Du, Xiang Gao, Wei Hu et al.

CVPR 2024posterarXiv:2402.17464
14
citations
#2224

Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization

Animesh Basak Chowdhury, Marco Romanelli, Benjamin Tan et al.

ICLR 2024posterarXiv:2401.12205
14
citations
#2225

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024posterarXiv:2403.13556
14
citations
#2226

SyFormer: Structure-Guided Synergism Transformer for Large-Portion Image Inpainting

Jie Wu, Yuchao Feng, Honghui Xu et al.

AAAI 2024paper
14
citations
#2227

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024posterarXiv:2409.06290
14
citations
#2228

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

Cheng Han, Qifan Wang, Sohail A Dianat et al.

ECCV 2024posterarXiv:2407.04208
14
citations
#2229

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification

YuTeng Ye, Hang Zhou, Jiale Cai et al.

AAAI 2024paperarXiv:2211.14742
14
citations
#2230

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

Xiaopei Wu, Yuenan Hou, Xiaoshui Huang et al.

CVPR 2024posterarXiv:2407.09751
14
citations
#2231

CMG-Net: Robust Normal Estimation for Point Clouds via Chamfer Normal Distance and Multi-Scale Geometry

Yingrui Wu, Mingyang Zhao, Keqiang Li et al.

AAAI 2024paperarXiv:2312.09154
14
citations
#2232

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024posterarXiv:2409.06703
14
citations
#2233

Pre-training with Random Orthogonal Projection Image Modeling

Maryam Haghighat, Peyman Moghadam, Shaheer Mohamed et al.

ICLR 2024spotlightarXiv:2310.18737
14
citations
#2234

DexFuncGrasp: A Robotic Dexterous Functional Grasp Dataset Constructed from a Cost-Effective Real-Simulation Annotation System

Jinglue Hang, Xiangbo Lin, Tianqiang Zhu et al.

AAAI 2024paper
14
citations
#2235

A Restoration Network as an Implicit Prior

Yuyang Hu, Mauricio Delbracio, Peyman Milanfar et al.

ICLR 2024posterarXiv:2310.01391
14
citations
#2236

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024posterarXiv:2212.09877
14
citations
#2237

Exploiting Auxiliary Caption for Video Grounding

Hongxiang Li, Meng Cao, Xuxin Cheng et al.

AAAI 2024paperarXiv:2301.05997
14
citations
#2238

HONGAT: Graph Attention Networks in the Presence of High-Order Neighbors

Heng-Kai Zhang, Yi-Ge Zhang, Zhi Zhou et al.

AAAI 2024paper
14
citations
#2239

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024posterarXiv:2409.03944
14
citations
#2240

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

Junwen Xiong, Peng Zhang, Tao You et al.

CVPR 2024posterarXiv:2403.01226
14
citations
#2241

Adversarial Backdoor Attack by Naturalistic Data Poisoning on Trajectory Prediction in Autonomous Driving

Mozhgan Pourkeshavarz, Mohammad Sabokrou, Amir Rasouli

CVPR 2024posterarXiv:2306.15755
14
citations
#2242

Neural-Symbolic Recursive Machine for Systematic Generalization

Qing Li, Yixin Zhu, Yitao Liang et al.

ICLR 2024posterarXiv:2210.01603
14
citations
#2243

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024posterarXiv:2403.09072
14
citations
#2244

Temporal Event Stereo via Joint Learning with Stereoscopic Flow

Hoonhee Cho, Jae-young Kang, Kuk-Jin Yoon

ECCV 2024posterarXiv:2407.10831
14
citations
#2245

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Renzhe Zhou, Chen-Xiao Gao, Zongzhang Zhang et al.

AAAI 2024paperarXiv:2312.15909
14
citations
#2246

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024posterarXiv:2409.11718
14
citations
#2247

Accelerating Data Generation for Neural Operators via Krylov Subspace Recycling

Hong Wang, Zhongkai Hao, Jie Wang et al.

ICLR 2024spotlightarXiv:2401.09516
14
citations
#2248

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

CVPR 2024poster
14
citations
#2249

Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation

Zhanfeng Liao, Yan Liu, Qian Zheng et al.

AAAI 2024paperarXiv:2311.09077
14
citations
#2250

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions

Hao Xu, Li Haipeng, Yinqiao Wang et al.

CVPR 2024posterarXiv:2403.18575
14
citations
#2251

Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching

Rui Gong, Weide Liu, ZAIWANG GU et al.

CVPR 2024posterarXiv:2402.19270
14
citations
#2252

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024posterarXiv:2407.15626
14
citations
#2253

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024posterarXiv:2312.02362
14
citations
#2254

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model

Zhenghao Pan, Haijin Zeng, Jiezhang Cao et al.

CVPR 2024posterarXiv:2311.11417
14
citations
#2255

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

ECCV 2024posterarXiv:2407.15328
14
citations
#2256

Inspecting Prediction Confidence for Detecting Black-Box Backdoor Attacks

Tong Wang, Yuan Yao, Feng Xu et al.

AAAI 2024paper
14
citations
#2257

Deep Structural Knowledge Exploitation and Synergy for Estimating Node Importance Value on Heterogeneous Information Networks

Yankai Chen, Yixiang Fang, Qiongyan Wang et al.

AAAI 2024paperarXiv:2402.12411
14
citations
#2258

UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization

Shuaibo Li, Wei Ma, Jianwei Guo et al.

CVPR 2024poster
14
citations
#2259

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024posterarXiv:2407.15617
14
citations
#2260

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024poster
14
citations
#2261

NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs

Michael Fischer, Zhengqin Li, Thu Nguyen-Phuoc et al.

CVPR 2024posterarXiv:2402.08622
14
citations
#2262

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024poster
14
citations
#2263

Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction

Yizhi Wang, Wallace Lira, Wenqi Wang et al.

CVPR 2024poster
14
citations
#2264

Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes

Zhiyuan Yu, Zheng Qin, lintao zheng et al.

CVPR 2024posterarXiv:2404.04557
14
citations
#2265

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024poster
14
citations
#2266

FoSp: Focus and Separation Network for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

AAAI 2024paperarXiv:2306.04474
14
citations
#2267

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024posterarXiv:2407.02778
14
citations
#2268

Region-Based Representations Revisited

Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao et al.

CVPR 2024posterarXiv:2402.02352
14
citations
#2269

Rating-Based Reinforcement Learning

Devin White, Mingkang Wu, Ellen Novoseller et al.

AAAI 2024paperarXiv:2307.16348
14
citations
#2270

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Rui Zhao, Bin Shi, Jianfei Ruan et al.

CVPR 2024posterarXiv:2405.05714
14
citations
#2271

Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks

Zhiying Jiang, Xingyuan Li, Jinyuan Liu et al.

AAAI 2024paperarXiv:2402.15959
14
citations
#2272

Regroup Median Loss for Combating Label Noise

Authors: Fengpeng Li, Kemou Li, Jinyu Tian et al.

AAAI 2024paperarXiv:2312.06273
14
citations
#2273

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024posterarXiv:2407.11499
14
citations
#2274

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024posterarXiv:2404.14715
14
citations
#2275

CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion

Zhenjiang Du, Jiale Dou, Zhitao Liu et al.

AAAI 2024paper
14
citations
#2276

ProTeCt: Prompt Tuning for Taxonomic Open Set Classification

Tz-Ying Wu, Chih-Hui Ho, Nuno Vasconcelos

CVPR 2024posterarXiv:2306.02240
14
citations
#2277

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Qianyun He, Xinya Ji, Yicheng Gong et al.

ECCV 2024posterarXiv:2408.00297
14
citations
#2278

Learning to Learn Better Visual Prompts

Fengxiang Wang, Wanrong Huang, Shaowu Yang et al.

AAAI 2024paper
14
citations
#2279

UniHuman: A Unified Model For Editing Human Images in the Wild

Nannan Li, Qing Liu, Krishna Kumar Singh et al.

CVPR 2024posterarXiv:2312.14985
14
citations
#2280

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?

JIANFEI YANG, Hanjie Qian, Yuecong Xu et al.

ICLR 2024posterarXiv:2305.18712
14
citations
#2281

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Yufan Chen, Jiaming Zhang, Kunyu Peng et al.

CVPR 2024posterarXiv:2403.14442
13
citations
#2282

Parameterized Approximation Algorithms for Sum of Radii Clustering and Variants

Xianrun Chen, Dachuan Xu, Yicheng Xu et al.

AAAI 2024paper
13
citations
#2283

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024posterarXiv:2402.18695
13
citations
#2284

Full Bayesian Significance Testing via Neural Networks

Zehua Liu, Zimeng Li, Jingyuan Wang et al.

AAAI 2024paper
13
citations
#2285

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024posterarXiv:2407.16696
13
citations
#2286

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024posterarXiv:2409.09605
13
citations
#2287

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

Xing Cui, Zekun Li, Peipei Li et al.

ECCV 2024posterarXiv:2311.15040
13
citations
#2288

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.

ECCV 2024posterarXiv:2408.03284
13
citations
#2289

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

CVPR 2024posterarXiv:2403.09344
13
citations
#2290

How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain et al.

CVPR 2024posterarXiv:2403.07203
13
citations
#2291

Federated Causality Learning with Explainable Adaptive Optimization

Dezhi Yang, Xintong He, Jun Wang et al.

AAAI 2024paperarXiv:2312.05540
13
citations
#2292

BENO: Boundary-embedded Neural Operators for Elliptic PDEs

Haixin Wang, Jiaxin Li, Anubhav Dwivedi et al.

ICLR 2024posterarXiv:2401.09323
13
citations
#2293

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024posterarXiv:2403.10942
13
citations
#2294

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024poster
13
citations
#2295

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation

Xinliang Zhang, Lei Zhu, Hangzhou He et al.

AAAI 2024paperarXiv:2402.17555
13
citations
#2296

Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation

Xiuding Cai, Yaoyao Zhu, Dong Miao et al.

AAAI 2024paperarXiv:2211.10867
13
citations
#2297

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024posterarXiv:2410.10207
13
citations
#2298

FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval

Yanzhe Chen, Huasong Zhong, Xiangteng He et al.

AAAI 2024paper
13
citations
#2299

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Nicolas Michel, Giovanni Chierchia, Romain Negrel et al.

AAAI 2024paperarXiv:2306.03364
13
citations
#2300

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

Yanqi Ge, Qiang Nie, Ye Huang et al.

AAAI 2024paperarXiv:2312.11872
13
citations
#2301

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024posterarXiv:2403.09805
13
citations
#2302

Differentiable Euler Characteristic Transforms for Shape Classification

Ernst Roell, Bastian Rieck

ICLR 2024posterarXiv:2310.07630
13
citations
#2303

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024posterarXiv:2311.16914
13
citations
#2304

MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes

Bor Shiun Wang, Chien-Yi Wang, Wei-Chen Chiu

CVPR 2024posterarXiv:2404.08968
13
citations
#2305

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

Yan Gao, Haojun Xu, Jie Li et al.

AAAI 2024paperarXiv:2312.08951
13
citations
#2306

Learning Temporal Resolution in Spectrogram for Audio Classification

Haohe Liu, Xubo Liu, Qiuqiang Kong et al.

AAAI 2024paperarXiv:2210.01719
13
citations
#2307

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024posterarXiv:2401.06397
13
citations
#2308

3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting

Zhe Jun Tang, Tat-Jen Cham

ECCV 2024posterarXiv:2408.03753
13
citations
#2309

Boosting Adversarial Training via Fisher-Rao Norm-based Regularization

Xiangyu Yin, Wenjie Ruan

CVPR 2024posterarXiv:2403.17520
13
citations
#2310

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Hao Tan, Jun Li, Yizhuang Zhou et al.

AAAI 2024paperarXiv:2312.06401
13
citations
#2311

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024posterarXiv:2401.00912
13
citations
#2312

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024posterarXiv:2408.05205
13
citations
#2313

Chronic Poisoning: Backdoor Attack against Split Learning

Fangchao Yu, Bo Zeng, Kai Zhao et al.

AAAI 2024paper
13
citations
#2314

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024poster
13
citations
#2315

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024posterarXiv:2407.09083
13
citations
#2316

Single-View Scene Point Cloud Human Grasp Generation

Yan-Kang Wang, Chengyi Xing, Yi-Lin Wei et al.

CVPR 2024posterarXiv:2404.15815
13
citations
#2317

Effective Video Mirror Detection with Inconsistent Motion Cues

Alex Warren, Ke Xu, Jiaying Lin et al.

CVPR 2024poster
13
citations
#2318

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024posterarXiv:2403.05018
13
citations
#2319

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

Nisarg Shah, Vibashan VS, Vishal M. Patel

CVPR 2024poster
13
citations
#2320

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024posterarXiv:2407.10528
13
citations
#2321

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

ICLR 2024posterarXiv:2406.16072
13
citations
#2322

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024posterarXiv:2407.05256
13
citations
#2323

Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction

Zhixuan Chu, Mengxuan Hu, Qing Cui et al.

AAAI 2024paperarXiv:2312.16113
13
citations
#2324

SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation

Jiaben Chen, Huaizu Jiang

CVPR 2024posterarXiv:2308.16876
13
citations
#2325

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024posterarXiv:2304.05645
13
citations
#2326

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024posterarXiv:2407.15773
13
citations
#2327

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024poster
13
citations
#2328

Trust Region Methods for Nonconvex Stochastic Optimization beyond Lipschitz Smoothness

Chenghan Xie, Chenxi Li, Chuwen Zhang et al.

AAAI 2024paperarXiv:2310.17319
13
citations
#2329

PH-Net: Semi-Supervised Breast Lesion Segmentation via Patch-wise Hardness

Siyao Jiang, Huisi Wu, Junyang Chen et al.

CVPR 2024poster
13
citations
#2330

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024posterarXiv:2407.08256
13
citations
#2331

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024posterarXiv:2311.16447
13
citations
#2332

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024posterarXiv:2403.14183
13
citations
#2333

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024posterarXiv:2404.14565
13
citations
#2334

GDA: Generalized Diffusion for Robust Test-time Adaptation

Yun-Yun Tsai, Fu-Chen Chen, Albert Chen et al.

CVPR 2024posterarXiv:2404.00095
13
citations
#2335

Light Schrödinger Bridge

Alexander Korotin, Nikita Gushchin, Evgeny Burnaev

ICLR 2024posterarXiv:2310.01174
13
citations
#2336

Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang et al.

ICLR 2024posterarXiv:2402.11984
13
citations
#2337

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024posterarXiv:2404.09857
13
citations
#2338

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024posterarXiv:2212.02997
13
citations
#2339

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024posterarXiv:2407.04458
13
citations
#2340

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, Kyungmin Kim, Hyunjung Shim

ECCV 2024posterarXiv:2407.02286
13
citations
#2341

Real-World Mobile Image Denoising Dataset with Efficient Baselines

Roman Flepp, Andrey Ignatov, Radu Timofte et al.

CVPR 2024poster
13
citations
#2342

USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Xiaoqi Wang, Wenbin He, Xiwei Xuan et al.

CVPR 2024posterarXiv:2406.05271
13
citations
#2343

SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field

Ru Li, Jia Liu, Guanghui Liu et al.

AAAI 2024paperarXiv:2312.08692
13
citations
#2344

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024posterarXiv:2311.12047
13
citations
#2345

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024posterarXiv:2407.11555
13
citations
#2346

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024posterarXiv:2403.06168
13
citations
#2347

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024posterarXiv:2401.02402
13
citations
#2348

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024posterarXiv:2407.18550
13
citations
#2349

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024posterarXiv:2311.18815
13
citations
#2350

F3Loc: Fusion and Filtering for Floorplan Localization

Changan Chen, Rui Wang, Christoph Vogel et al.

CVPR 2024highlight
13
citations
#2351

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

Jinyi Liu, Zhi Wang, Yan Zheng et al.

AAAI 2024paperarXiv:2312.12145
13
citations
#2352

Partial-to-Partial Shape Matching with Geometric Consistency

Viktoria Ehm, Maolin Gao, Paul Roetzer et al.

CVPR 2024posterarXiv:2404.12209
13
citations
#2353

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024posterarXiv:2406.04551
13
citations
#2354

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Zijian Zhou, Zheng Zhu, Holger Caesar et al.

ECCV 2024posterarXiv:2407.11213
13
citations
#2355

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal Patel

ECCV 2024posterarXiv:2404.09977
13
citations
#2356

PAC Prediction Sets Under Label Shift

Wenwen Si, Sangdon Park, Insup Lee et al.

ICLR 2024posterarXiv:2310.12964
13
citations
#2357

Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity

Yiyue Chen, Haris Vikalo, Chianing Wang

AAAI 2024paperarXiv:2312.13380
13
citations
#2358

CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception

Jiachen Sun, Haizhong Zheng, Qingzhao Zhang et al.

ICLR 2024posterarXiv:2306.00349
13
citations
#2359

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024posterarXiv:2407.11717
13
citations
#2360

AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing

Zhiyuan Ma, Guoli Jia, Bowen Zhou

AAAI 2024paperarXiv:2312.08019
13
citations
#2361

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

Zikai Xiao, Zihan Chen, Liyinglan Liu et al.

ICLR 2024posterarXiv:2401.08977
13
citations
#2362

3D Neural Edge Reconstruction

Lei Li, Songyou Peng, Zehao Yu et al.

CVPR 2024posterarXiv:2405.19295
13
citations
#2363

Distilling Reliable Knowledge for Instance-Dependent Partial Label Learning

Dong-Dong Wu, Deng-Bao Wang, Min-Ling Zhang

AAAI 2024paper
13
citations
#2364

Cell Graph Transformer for Nuclei Classification

Wei Lou, Guanbin Li, Xiang Wan et al.

AAAI 2024paperarXiv:2402.12946
13
citations
#2365

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024posterarXiv:2404.16029
13
citations
#2366

ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Yin Zhang, Yongqiang Zhang, Zian Zhang et al.

AAAI 2024paper
13
citations
#2367

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

ICLR 2024posterarXiv:2310.05861
13
citations
#2368

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

Zhen Yang, Zhou Shao, Yuxiao Dong et al.

AAAI 2024paperarXiv:2402.11855
13
citations
#2369

Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization

Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.

ECCV 2024posterarXiv:2407.04245
13
citations
#2370

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

Yingji Zhong, Lanqing Hong, Zhenguo Li et al.

CVPR 2024posterarXiv:2403.16885
13
citations
#2371

3D Multi-frame Fusion for Video Stabilization

Zhan Peng, Xinyi Ye, Weiyue Zhao et al.

CVPR 2024posterarXiv:2404.12887
13
citations
#2372

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Jianwu Li, Kaiyue Shi, Guo-Sen Xie et al.

AAAI 2024paper
13
citations
#2373

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024posterarXiv:2410.07579
13
citations
#2374

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.

ECCV 2024posterarXiv:2407.14754
13
citations
#2375

Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging

Fulin Luo, Xi Chen, Xiuwen Gong et al.

AAAI 2024paper
13
citations
#2376

Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability Composability and Decomposability from Anatomy via Self Supervision

Mohammad Reza Hosseinzadeh Taher, Michael Gotway, Jianming Liang

CVPR 2024poster
13
citations
#2377

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Juno Kim, Kakei Yamamoto, Kazusato Oko et al.

ICLR 2024spotlightarXiv:2312.01127
13
citations
#2378

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

Qi Jia, Yaqi Cai, Qi Jia et al.

CVPR 2024highlightarXiv:2405.06283
13
citations
#2379

Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning

Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu et al.

AAAI 2024paperarXiv:2312.12722
13
citations
#2380

SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

Qingwen Bu, Sungrae Park, Minsoo Khang et al.

AAAI 2024paperarXiv:2308.10531
13
citations
#2381

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024posterarXiv:2408.10777
13
citations
#2382

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

Yunchen Li, Zhou Yu, Gaoqi He et al.

AAAI 2024paperarXiv:2312.08200
13
citations
#2383

Move Anything with Layered Scene Diffusion

Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu et al.

CVPR 2024posterarXiv:2404.07178
13
citations
#2384

An Empirical Study of the Generalization Ability of Lidar 3D Object Detectors to Unseen Domains

George Eskandar

CVPR 2024posterarXiv:2402.17562
13
citations
#2385

Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D

Mukund Varma T, Peihao Wang, Zhiwen Fan et al.

CVPR 2024posterarXiv:2403.18922
13
citations
#2386

Identifiability of Direct Effects from Summary Causal Graphs

Simon Ferreira, Charles Assaad

AAAI 2024paperarXiv:2306.16958
13
citations
#2387

Generalizability of Adversarial Robustness Under Distribution Shifts

Bernard Ghanem, Kumail Alhamoud, Hasan Hammoud et al.

ICLR 2024poster
12
citations
#2388

Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

Rashindrie Perera, Saman Halgamuge

CVPR 2024posterarXiv:2403.04492
12
citations
#2389

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024posterarXiv:2312.14055
12
citations
#2390

ZeroFlow: Scalable Scene Flow via Distillation

Kyle Vedder, Neehar Peri, Nathaniel Chodosh et al.

ICLR 2024oralarXiv:2305.10424
12
citations
#2391

CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning

Qingsong Yan, Qiang Wang, Kaiyong Zhao et al.

AAAI 2024paperarXiv:2312.08760
12
citations
#2392

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024posterarXiv:2406.00474
12
citations
#2393

Generalized Planning for the Abstraction and Reasoning Corpus

Chao Lei, Nir Lipovetzky, Krista A. Ehinger

AAAI 2024paperarXiv:2401.07426
12
citations
#2394

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024posterarXiv:2409.08572
12
citations
#2395

Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction

Xiaoyang Lyu, Chirui Chang, Peng Dai et al.

CVPR 2024highlightarXiv:2403.19314
12
citations
#2396

H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

Minyoung Park, MIRAE DO, Yeon Jae Shin et al.

ICLR 2024spotlightarXiv:2402.08138
12
citations
#2397

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024poster
12
citations
#2398

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117
12
citations
#2399

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024posterarXiv:2407.04947
12
citations
#2400

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024posterarXiv:2312.06729
12
citations