Most Cited 2024 "box constraints" Papers

12,324 papers found • Page 12 of 62

#2201

Partial-to-Partial Shape Matching with Geometric Consistency

Viktoria Ehm, Maolin Gao, Paul Roetzer et al.

CVPR 2024posterarXiv:2404.12209
13
citations
#2202

SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging

Lingtong Kong, Bo Li, Yike Xiong et al.

ECCV 2024posterarXiv:2407.16308
13
citations
#2203

3D Multi-frame Fusion for Video Stabilization

Zhan Peng, Xinyi Ye, Weiyue Zhao et al.

CVPR 2024posterarXiv:2404.12887
13
citations
#2204

EvSign: Sign Language Recognition and Translation with Streaming Events

Pengyu Zhang, Hao Yin, Zeren Wang et al.

ECCV 2024posterarXiv:2407.12593
13
citations
#2205

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024posterarXiv:2404.16029
13
citations
#2206

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024posterarXiv:2401.00912
13
citations
#2207

Single-View Scene Point Cloud Human Grasp Generation

Yan-Kang Wang, Chengyi Xing, Yi-Lin Wei et al.

CVPR 2024posterarXiv:2404.15815
13
citations
#2208

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

ICLR 2024posterarXiv:2310.05861
13
citations
#2209

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024posterarXiv:2410.07579
13
citations
#2210

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024posterarXiv:2212.02997
13
citations
#2211

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Yueru Luo, Shuguang Cui, Zhen Li

ICLR 2024posterarXiv:2406.16072
13
citations
#2212

Move Anything with Layered Scene Diffusion

Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu et al.

CVPR 2024posterarXiv:2404.07178
13
citations
#2213

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024posterarXiv:2403.10942
13
citations
#2214

Differentiable Euler Characteristic Transforms for Shape Classification

Ernst Roell, Bastian Rieck

ICLR 2024posterarXiv:2310.07630
13
citations
#2215

Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability Composability and Decomposability from Anatomy via Self Supervision

Mohammad Reza Hosseinzadeh Taher, Michael Gotway, Jianming Liang

CVPR 2024poster
13
citations
#2216

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024poster
13
citations
#2217

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024posterarXiv:2407.05256
13
citations
#2218

SketchINR: A First Look into Sketches as Implicit Neural Representations

Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.

CVPR 2024posterarXiv:2403.09344
13
citations
#2219

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024posterarXiv:2408.10777
13
citations
#2220

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024posterarXiv:2408.05205
13
citations
#2221

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

Yanqi Ge, Qiang Nie, Ye Huang et al.

AAAI 2024paperarXiv:2312.11872
13
citations
#2222

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

Jinyi Liu, Zhi Wang, Yan Zheng et al.

AAAI 2024paperarXiv:2312.12145
13
citations
#2223

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024posterarXiv:2311.16447
13
citations
#2224

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024poster
13
citations
#2225

JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups

Simindokht Jahangard, Zhixi Cai, Shiki Wen et al.

CVPR 2024posterarXiv:2404.04458
13
citations
#2226

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024posterarXiv:2407.15773
13
citations
#2227

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

Yingji Zhong, Lanqing Hong, Zhenguo Li et al.

CVPR 2024posterarXiv:2403.16885
13
citations
#2228

BENO: Boundary-embedded Neural Operators for Elliptic PDEs

Haixin Wang, Jiaxin Li, Anubhav Dwivedi et al.

ICLR 2024posterarXiv:2401.09323
13
citations
#2229

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024posterarXiv:2407.09083
13
citations
#2230

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024posterarXiv:2403.05018
13
citations
#2231

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024poster
13
citations
#2232

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

Qi Jia, Yaqi Cai, Qi Jia et al.

CVPR 2024highlightarXiv:2405.06283
13
citations
#2233

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024posterarXiv:2409.09605
13
citations
#2234

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024posterarXiv:2404.09857
13
citations
#2235

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024posterarXiv:2410.10207
13
citations
#2236

Light Schrödinger Bridge

Alexander Korotin, Nikita Gushchin, Evgeny Burnaev

ICLR 2024posterarXiv:2310.01174
13
citations
#2237

SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field

Ru Li, Jia Liu, Guanghui Liu et al.

AAAI 2024paperarXiv:2312.08692
13
citations
#2238

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

Nisarg Shah, Vibashan VS, Vishal M. Patel

CVPR 2024poster
13
citations
#2239

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild

Zeren Jiang, Chen Guo, Manuel Kaufmann et al.

CVPR 2024posterarXiv:2406.01595
13
citations
#2240

AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing

Zhiyuan Ma, Guoli Jia, Bowen Zhou

AAAI 2024paperarXiv:2312.08019
13
citations
#2241

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024posterarXiv:2403.09805
13
citations
#2242

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024posterarXiv:2311.12047
13
citations
#2243

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024posterarXiv:2407.14142
12
citations
#2244

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery

Andy V Huynh, Lauren Gillespie, Jael Lopez-Saucedo et al.

ECCV 2024posterarXiv:2409.19439
12
citations
#2245

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024posterarXiv:2409.08572
12
citations
#2246

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024posterarXiv:2408.14930
12
citations
#2247

Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance

Giung Nam, Byeongho Heo, Juho Lee

ICLR 2024posterarXiv:2404.00860
12
citations
#2248

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024posterarXiv:2407.10753
12
citations
#2249

EX-Graph: A Pioneering Dataset Bridging Ethereum and X

Qian Wang, Zhen Zhang, Zemin Liu et al.

ICLR 2024posterarXiv:2310.01015
12
citations
#2250

CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning

Qingsong Yan, Qiang Wang, Kaiyong Zhao et al.

AAAI 2024paperarXiv:2312.08760
12
citations
#2251

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024posterarXiv:2312.06661
12
citations
#2252

H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

Minyoung Park, MIRAE DO, Yeon Jae Shin et al.

ICLR 2024spotlightarXiv:2402.08138
12
citations
#2253

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024posterarXiv:2407.04947
12
citations
#2254

Unsupervised Gaze Representation Learning from Multi-view Face Images

Yiwei Bao, Feng Lu

CVPR 2024poster
12
citations
#2255

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.

ECCV 2024posterarXiv:2409.15557
12
citations
#2256

Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction

Zhaoxi Mu, Xinyu Yang, Sining Sun et al.

AAAI 2024paperarXiv:2312.10305
12
citations
#2257

Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

Rashindrie Perera, Saman Halgamuge

CVPR 2024posterarXiv:2403.04492
12
citations
#2258

Robust Test-Time Adaptation for Zero-Shot Prompt Tuning

Ding-Chu Zhang, Zhi Zhou, Yufeng Li

AAAI 2024paper
12
citations
#2259

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

Zicheng Zhang, RUOBING ZHENG, Bonan Li et al.

CVPR 2024posterarXiv:2402.17364
12
citations
#2260

Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods

Sara Klein, Simon Weissmann, Leif Döring

ICLR 2024posterarXiv:2310.02671
12
citations
#2261

ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Huiqun Li, Hanhan Zhou, Yifei Zou et al.

AAAI 2024paperarXiv:2312.15555
12
citations
#2262

RICA^2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV et al.

ECCV 2024poster
12
citations
#2263

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024posterarXiv:2407.13083
12
citations
#2264

Data-efficient Large Vision Models through Sequential Autoregression

Zhiwei Hao, Jianyuan Guo, Chengcheng Wang et al.

ICML 2024posterarXiv:2402.04841
12
citations
#2265

Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

Xuanhua He, Tao Hu, Guoli Wang et al.

AAAI 2024paperarXiv:2401.02161
12
citations
#2266

Improving Bird's Eye View Semantic Segmentation by Task Decomposition

Tianhao Zhao, Yongcan Chen, Yu Wu et al.

CVPR 2024posterarXiv:2404.01925
12
citations
#2267

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024posterarXiv:2408.02157
12
citations
#2268

Federated Online Adaptation for Deep Stereo

Matteo Poggi, Fabio Tosi

CVPR 2024posterarXiv:2405.14873
12
citations
#2269

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024posterarXiv:2407.11717
12
citations
#2270

Symbolic Regression Enhanced Decision Trees for Classification Tasks

Kei Sen Fong, Mehul Motani

AAAI 2024paper
12
citations
#2271

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Qi Wang, Zhou Xu, Yuming Lin et al.

ECCV 2024posterarXiv:2407.05106
12
citations
#2272

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024posterarXiv:2406.00474
12
citations
#2273

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

Zhen Yang, Zhou Shao, Yuxiao Dong et al.

AAAI 2024paperarXiv:2402.11855
12
citations
#2274

Generalized Planning for the Abstraction and Reasoning Corpus

Chao Lei, Nir Lipovetzky, Krista A. Ehinger

AAAI 2024paperarXiv:2401.07426
12
citations
#2275

Accelerating Neural Field Training via Soft Mining

Shakiba Kheradmand, Daniel Rebain, Gopal Sharma et al.

CVPR 2024posterarXiv:2312.00075
12
citations
#2276

SINDER: Repairing the Singular Defects of DINOv2

Haoqi Wang, Tong Zhang, Mathieu Salzmann

ECCV 2024posterarXiv:2407.16826
12
citations
#2277

Double-Layer Hybrid-Label Identification Feature Selection for Multi-View Multi-Label Learning

Pingting Hao, Kunpeng Liu, Wanfu Gao

AAAI 2024paper
12
citations
#2278

A Generalized Shuffle Framework for Privacy Amplification: Strengthening Privacy Guarantees and Enhancing Utility

Chen E, Yang Cao, Ge Yifei

AAAI 2024paperarXiv:2312.14388
12
citations
#2279

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024posterarXiv:2409.05162
12
citations
#2280

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117
12
citations
#2281

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024posterarXiv:2403.14611
12
citations
#2282

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Zhiyu Zhao, Bingkun Huang, Sen Xing et al.

CVPR 2024posterarXiv:2311.03149
12
citations
#2283

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024posterarXiv:2407.20341
12
citations
#2284

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024poster
12
citations
#2285

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

Jiachun Pan, Jiachun Pan, Jun Hao Liew et al.

ICLR 2024posterarXiv:2307.10711
12
citations
#2286

Functional Diffusion

Biao Zhang, Peter Wonka

CVPR 2024posterarXiv:2311.15435
12
citations
#2287

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024posterarXiv:2508.16408
12
citations
#2288

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024posterarXiv:2311.17524
12
citations
#2289

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.

ECCV 2024posterarXiv:2403.12953
12
citations
#2290

PointInfinity: Resolution-Invariant Point Diffusion Models

Zixuan Huang, Justin Johnson, Shoubhik Debnath et al.

CVPR 2024posterarXiv:2404.03566
12
citations
#2291

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024posterarXiv:2301.12195
12
citations
#2292

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024posterarXiv:2407.13108
12
citations
#2293

Eliminating Warping Shakes for Unsupervised Online Video Stitching

Lang Nie, Chunyu Lin, Kang Liao et al.

ECCV 2024posterarXiv:2403.06378
12
citations
#2294

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Yongyuan Liang, Yanchao Sun, Ruijie Zheng et al.

ICLR 2024oralarXiv:2307.12062
12
citations
#2295

Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM

Linyu Tang, Lei Zhang

CVPR 2024posterarXiv:2403.11448
12
citations
#2296

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024posterarXiv:2408.08050
12
citations
#2297

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024posterarXiv:2407.07412
12
citations
#2298

SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

Qingwen Bu, Sungrae Park, Minsoo Khang et al.

AAAI 2024paperarXiv:2308.10531
12
citations
#2299

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024posterarXiv:2404.12524
12
citations
#2300

Generative Powers of Ten

Xiaojuan Wang, Janne Kontkanen, Brian Curless et al.

CVPR 2024highlightarXiv:2312.02149
12
citations
#2301

Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models

Luo Jiayun, Siddhesh Khandelwal, Leonid Sigal et al.

CVPR 2024posterarXiv:2311.17095
12
citations
#2302

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains

Eunsu Baek, Keondo Park, Ji-yoon Kim et al.

CVPR 2024posterarXiv:2404.15882
12
citations
#2303

Correcting Diffusion Generation through Resampling

Yujian Liu, Yang Zhang, Tommi Jaakkola et al.

CVPR 2024highlightarXiv:2312.06038
12
citations
#2304

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024posterarXiv:2407.21757
12
citations
#2305

Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction

Xiaoyang Lyu, Chirui Chang, Peng Dai et al.

CVPR 2024highlightarXiv:2403.19314
12
citations
#2306

Universal Novelty Detection Through Adaptive Contrastive Learning

Hossein Mirzaei, Mojtaba Nafez, Mohammad Jafari et al.

CVPR 2024posterarXiv:2408.10798
12
citations
#2307

Generalizable Fourier Augmentation for Unsupervised Video Object Segmentation

Huihui Song, Tiankang Su, Yuhui Zheng et al.

AAAI 2024paper
12
citations
#2308

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Yash Sanghvi, Yiheng Chi, Stanley Chan

ECCV 2024posterarXiv:2312.02319
12
citations
#2309

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024posterarXiv:2404.02697
12
citations
#2310

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

Jiaqi Xu, Mengyang Wu, Xiaowei Hu et al.

ECCV 2024posterarXiv:2409.02101
12
citations
#2311

Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing

Song Xia, Yi Yu, Jiang Xudong et al.

ICLR 2024posterarXiv:2404.09586
12
citations
#2312

Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

Muyao Wang, Wenchao Chen, Bo Chen

AAAI 2024paperarXiv:2403.05406
12
citations
#2313

Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck

Shifei Ding, Wei Du, Ling Ding et al.

AAAI 2024paper
12
citations
#2314

Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution

Yifan Su, Rishi Veerapaneni, Jiaoyang Li

AAAI 2024paperarXiv:2401.00315
12
citations
#2315

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

Yunpeng Bai, Xintao Wang, Yanpei Cao et al.

ECCV 2024poster
12
citations
#2316

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024posterarXiv:2407.11950
12
citations
#2317

Weakly Supervised Monocular 3D Detection with a Single-View Image

Xueying Jiang, Sheng Jin, Lewei Lu et al.

CVPR 2024posterarXiv:2402.19144
12
citations
#2318

TexOct: Generating Textures of 3D Models with Octree-based Diffusion

Jialun Liu, Chenming Wu, Xinqi Liu et al.

CVPR 2024poster
12
citations
#2319

Boosting Adversarial Training via Fisher-Rao Norm-based Regularization

Xiangyu Yin, Wenjie Ruan

CVPR 2024posterarXiv:2403.17520
12
citations
#2320

Discover and Mitigate Multiple Biased Subgroups in Image Classifiers

Zeliang Zhang, Mingqian Feng, Zhiheng Li et al.

CVPR 2024posterarXiv:2403.12777
12
citations
#2321

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024posterarXiv:2407.17331
12
citations
#2322

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2024posterarXiv:2401.09786
12
citations
#2323

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024posterarXiv:2404.15770
12
citations
#2324

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024posterarXiv:2406.04426
12
citations
#2325

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks

Yixuan Weng, Minjun Zhu, Fei Xia et al.

ICLR 2024posterarXiv:2304.01665
12
citations
#2326

CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers

Shahaf Arica, Or Rubin, Sapir Gershov et al.

CVPR 2024posterarXiv:2403.07700
12
citations
#2327

Pareto Deep Long-Tailed Recognition: A Conflict-Averse Solution

Zhipeng Zhou, Liu Liu, Peilin Zhao et al.

ICLR 2024oral
12
citations
#2328

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024poster
12
citations
#2329

On the hardness of learning under symmetries

Bobak Kiani, Thien Le, Hannah Lawrence et al.

ICLR 2024spotlightarXiv:2401.01869
12
citations
#2330

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024posterarXiv:2407.11859
12
citations
#2331

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

Yutong Xie, Qi Chen, Sinuo Wang et al.

CVPR 2024posterarXiv:2404.04960
12
citations
#2332

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024posterarXiv:2407.14709
12
citations
#2333

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024poster
12
citations
#2334

ZeroFlow: Scalable Scene Flow via Distillation

Kyle Vedder, Neehar Peri, Nathaniel Chodosh et al.

ICLR 2024oralarXiv:2305.10424
12
citations
#2335

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024posterarXiv:2407.11294
12
citations
#2336

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off

Yuansan Liu, Ruqing Zhang, Mingkun Zhang et al.

AAAI 2024paperarXiv:2312.10329
12
citations
#2337

Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing

Lokesh Nagalapatti, Akshay Iyer, Abir De et al.

AAAI 2024paperarXiv:2401.15447
12
citations
#2338

Adversarial Attacks on the Interpretation of Neuron Activation Maximization

Géraldin Nanfack, Alexander Fulleringer, Jonathan Marty et al.

AAAI 2024paperarXiv:2306.07397
12
citations
#2339

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024posterarXiv:2306.12941
12
citations
#2340

Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts

Kun Jin, Tongxin Yin, Zhongzhu Chen et al.

AAAI 2024paperarXiv:2305.05090
12
citations
#2341

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024posterarXiv:2403.10082
12
citations
#2342

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Shilin Yan, Xiaohao Xu, Renrui Zhang et al.

ECCV 2024posterarXiv:2309.12303
12
citations
#2343

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024posterarXiv:2407.07518
12
citations
#2344

Generalizability of Adversarial Robustness Under Distribution Shifts

Bernard Ghanem, Kumail Alhamoud, Hasan Hammoud et al.

ICLR 2024poster
12
citations
#2345

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

Jinming Liu, Ruoyu Feng, Yunpeng Qi et al.

ECCV 2024posterarXiv:2407.11700
12
citations
#2346

R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning

Mengyuan Chen, Junyu Gao, Changsheng Xu

ICLR 2024spotlight
12
citations
#2347

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024posterarXiv:2312.14055
12
citations
#2348

Exploring Transformer Extrapolation

Zhen Qin, Yiran Zhong, Hui Deng

AAAI 2024paperarXiv:2307.10156
12
citations
#2349

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024posterarXiv:2312.06729
12
citations
#2350

Neural Causal Abstractions

Kevin Xia, Elias Bareinboim

AAAI 2024paperarXiv:2401.02602
12
citations
#2351

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024posterarXiv:2408.16478
12
citations
#2352

C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Rongchang Li, Zhenhua Feng, Tianyang Xu et al.

ECCV 2024posterarXiv:2407.06113
12
citations
#2353

Language-Informed Visual Concept Learning

Sharon Lee, Yunzhi Zhang, Shangzhe Wu et al.

ICLR 2024posterarXiv:2312.03587
12
citations
#2354

Minimum-Norm Interpolation Under Covariate Shift

Neil Mallinar, Austin Zane, Spencer Frei et al.

ICML 2024posterarXiv:2404.00522
12
citations
#2355

Cell Graph Transformer for Nuclei Classification

Wei Lou, Guanbin Li, Xiang Wan et al.

AAAI 2024paperarXiv:2402.12946
12
citations
#2356

DiffusionPen: Towards Controlling the Style of Handwritten Text Generation

KONSTANTINA NIKOLAIDOU, George Retsinas, Giorgos Sfikas et al.

ECCV 2024posterarXiv:2409.06065
11
citations
#2357

Quantifying Task Priority for Multi-Task Optimization

Wooseong Jeong, Kuk-Jin Yoon

CVPR 2024posterarXiv:2406.02996
11
citations
#2358

EDformer: Transformer-Based Event Denoising Across Varied Noise Levels

Bin Jiang, Bo Xiong, Bohan Qu et al.

ECCV 2024poster
11
citations
#2359

CDMAD: Class-Distribution-Mismatch-Aware Debiasing for Class-Imbalanced Semi-Supervised Learning

Hyuck Lee, Heeyoung Kim

CVPR 2024posterarXiv:2403.10391
11
citations
#2360

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences

Anh Thai, Weiyao Wang, Hao Tang et al.

ECCV 2024posterarXiv:2407.09648
11
citations
#2361

Class-Agnostic Object Counting with Text-to-Image Diffusion Model

Xiaofei Hui, Qian Wu, Hossein Rahmani et al.

ECCV 2024poster
11
citations
#2362

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Changan Chen, Kumar Ashutosh, Rohit Girdhar et al.

CVPR 2024posterarXiv:2404.05206
11
citations
#2363

Bridging the Gap Between End-to-End and Two-Step Text Spotting

Mingxin Huang, Hongliang Li, Yuliang Liu et al.

CVPR 2024posterarXiv:2404.04624
11
citations
#2364

Dataset Quantization with Active Learning based Adaptive Sampling

Zhenghao Zhao, Yuzhang Shang, Junyi Wu et al.

ECCV 2024posterarXiv:2407.07268
11
citations
#2365

StraightPCF: Straight Point Cloud Filtering

Dasith de Silva Edirimuni, Xuequan Lu, Gang Li et al.

CVPR 2024posterarXiv:2405.08322
11
citations
#2366

Finsler-Laplace-Beltrami Operators with Application to Shape Analysis

Simon Weber, Thomas Dagès, Maolin Gao et al.

CVPR 2024posterarXiv:2404.03999
11
citations
#2367

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024posterarXiv:2407.13642
11
citations
#2368

SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning

Yuxin Deng, Jiayi Ma

AAAI 2024paperarXiv:2106.04434
11
citations
#2369

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024posterarXiv:2409.02882
11
citations
#2370

Learning Spatially Collaged Fourier Bases for Implicit Neural Representation

Jason Chun Lok Li, Chang Liu, Binxiao Huang et al.

AAAI 2024paperarXiv:2312.17018
11
citations
#2371

Bilevel Optimization under Unbounded Smoothness: A New Algorithm and Convergence Analysis

Jie Hao, Xiaochuan Gong, Mingrui Liu

ICLR 2024spotlightarXiv:2401.09587
11
citations
#2372

Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for 3D Molecule Generation

Ameya Daigavane, Song Eun Kim, Mario Geiger et al.

ICLR 2024posterarXiv:2311.16199
11
citations
#2373

Instance Tracking in 3D Scenes from Egocentric Videos

Yunhan Zhao, Haoyu Ma, Shu Kong et al.

CVPR 2024posterarXiv:2312.04117
11
citations
#2374

Task-Disruptive Background Suppression for Few-Shot Segmentation

Suho Park, SuBeen Lee, Sangeek Hyun et al.

AAAI 2024paperarXiv:2312.15894
11
citations
#2375

Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem

Qianliang Wu, Haobo Jiang, Lei Luo et al.

ECCV 2024poster
11
citations
#2376

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution

Gengrui Zhang, Xiaoshuang Chen, Yao WANG et al.

AAAI 2024paperarXiv:2401.06470
11
citations
#2377

Union Subgraph Neural Networks

Jiaxing Xu, Aihu Zhang, Qingtian Bian et al.

AAAI 2024paperarXiv:2305.15747
11
citations
#2378

Workflow Discovery from Dialogues in the Low Data Regime

David Vazquez, Stefania Raimondo, Christopher Pal et al.

ICLR 2024poster
11
citations
#2379

Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

Wen Yin, Jian Lou, Pan Zhou et al.

CVPR 2024posterarXiv:2404.19417
11
citations
#2380

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Wanyun Li, Pinxue Guo, Xinyu Zhou et al.

ECCV 2024posterarXiv:2403.08682
11
citations
#2381

X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer

Linglin Jing, Ying Xue, Xu Yan et al.

AAAI 2024paperarXiv:2312.07378
11
citations
#2382

Test-Time Personalization with Meta Prompt for Gaze Estimation

Huan Liu, Julia Qi, Zhenhao Li et al.

AAAI 2024paperarXiv:2401.01577
11
citations
#2383

SAVE: Protagonist Diversification with Structure Agnostic Video Editing

Yeji Song, Wonsik Shin, Junsoo Lee et al.

ECCV 2024posterarXiv:2312.02503
11
citations
#2384

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024posterarXiv:2301.13803
11
citations
#2385

BatteryML: An Open-source Platform for Machine Learning on Battery Degradation

Han Zhang, Xiaofan Gui, Shun Zheng et al.

ICLR 2024spotlight
11
citations
#2386

Causal Fairness under Unobserved Confounding: A Neural Sensitivity Framework

Maresa Schröder, Dennis Frauen, Stefan Feuerriegel

ICLR 2024posterarXiv:2311.18460
11
citations
#2387

Global Counterfactual Directions

Bartlomiej Sobieski, Przemyslaw Biecek

ECCV 2024posterarXiv:2404.12488
11
citations
#2388

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024posterarXiv:2407.21654
11
citations
#2389

Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold

Alireza Ganjdanesh, Shangqian Gao, Hirad Alipanah et al.

AAAI 2024paperarXiv:2312.14776
11
citations
#2390

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024posterarXiv:2509.20091
11
citations
#2391

One Step Closer to Unbiased Aleatoric Uncertainty Estimation

Wang Zhang, Ziwen Martin Ma, Subhro Das et al.

AAAI 2024paperarXiv:2312.10469
11
citations
#2392

Training-Free Model Merging for Multi-target Domain Adaptation

Wenyi Li, Huan-ang Gao, Mingju Gao et al.

ECCV 2024posterarXiv:2407.13771
11
citations
#2393

Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

Ge Li, Hongyi Zhou, Dominik Roth et al.

ICLR 2024oralarXiv:2401.11437
11
citations
#2394

Privacy-Preserving Optics for Enhancing Protection in Face De-Identification

Jhon Lopez, Carlos Hinojosa, Henry Arguello et al.

CVPR 2024posterarXiv:2404.00777
11
citations
#2395

Robust Depth Enhancement via Polarization Prompt Fusion Tuning

Kei IKEMURA, Yiming Huang, Felix Heide et al.

CVPR 2024posterarXiv:2404.04318
11
citations
#2396

Continuous Piecewise-Affine Based Motion Model for Image Animation

Hexiang Wang, Fengqi Liu, Qianyu Zhou et al.

AAAI 2024paperarXiv:2401.09146
11
citations
#2397

Uncertainty Regularized Evidential Regression

Kai Ye, Tiejin Chen, Hua Wei et al.

AAAI 2024paperarXiv:2401.01484
11
citations
#2398

Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

Da Luo, Yanglei Gan, Rui Hou et al.

AAAI 2024paperarXiv:2312.12021
11
citations
#2399

BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion

Zhaochen Liu, Zhixuan Li, Tingting Jiang

AAAI 2024paperarXiv:2401.01642
11
citations
#2400

Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma et al.

ECCV 2024posterarXiv:2402.05655
11
citations