Most Cited 2024 "parameterized environment configurations" Papers

12,324 papers found • Page 13 of 62

#2401

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024poster
12
citations
#2402

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024posterarXiv:2407.04947
12
citations
#2403

Robust Test-Time Adaptation for Zero-Shot Prompt Tuning

Ding-Chu Zhang, Zhi Zhou, Yufeng Li

AAAI 2024paper
12
citations
#2404

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

Zicheng Zhang, RUOBING ZHENG, Bonan Li et al.

CVPR 2024posterarXiv:2402.17364
12
citations
#2405

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024posterarXiv:2312.06729
12
citations
#2406

Pareto Deep Long-Tailed Recognition: A Conflict-Averse Solution

Zhipeng Zhou, Liu Liu, Peilin Zhao et al.

ICLR 2024oral
12
citations
#2407

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024posterarXiv:2312.06661
12
citations
#2408

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024posterarXiv:2407.17331
12
citations
#2409

ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Huiqun Li, Hanhan Zhou, Yifei Zou et al.

AAAI 2024paperarXiv:2312.15555
12
citations
#2410

Weisfeiler and Lehman Go Paths: Learning Topological Features via Path Complexes

Quang Truong, Peter Chin

AAAI 2024paperarXiv:2308.06838
12
citations
#2411

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024poster
12
citations
#2412

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024posterarXiv:2408.02157
12
citations
#2413

Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

Rashindrie Perera, Saman Halgamuge

CVPR 2024posterarXiv:2403.04492
12
citations
#2414

Task-Disruptive Background Suppression for Few-Shot Segmentation

Suho Park, SuBeen Lee, Sangeek Hyun et al.

AAAI 2024paperarXiv:2312.15894
12
citations
#2415

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024posterarXiv:2407.11950
12
citations
#2416

D3: A Methodological Exploration of Domain Division, Modeling, and Balance in Multi-Domain Recommendations

Pengyue Jia, Yichao Wang, Shanru LIN et al.

AAAI 2024paper
12
citations
#2417

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024posterarXiv:2407.14142
12
citations
#2418

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024posterarXiv:2407.10753
12
citations
#2419

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

Yutong Xie, Qi Chen, Sinuo Wang et al.

CVPR 2024posterarXiv:2404.04960
12
citations
#2420

Unsupervised Gaze Representation Learning from Multi-view Face Images

Yiwei Bao, Feng Lu

CVPR 2024poster
12
citations
#2421

S²MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering

Zhen Long, Qiyuan Wang, Yazhou Ren et al.

CVPR 2024poster
12
citations
#2422

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024posterarXiv:2407.11859
12
citations
#2423

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024posterarXiv:2406.04426
12
citations
#2424

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024posterarXiv:2407.21757
12
citations
#2425

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2024posterarXiv:2401.09786
12
citations
#2426

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks

Yixuan Weng, Minjun Zhu, Fei Xia et al.

ICLR 2024posterarXiv:2304.01665
12
citations
#2427

Generalized Planning for the Abstraction and Reasoning Corpus

Chao Lei, Nir Lipovetzky, Krista A. Ehinger

AAAI 2024paperarXiv:2401.07426
12
citations
#2428

Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing

Song Xia, Yi Yu, Jiang Xudong et al.

ICLR 2024posterarXiv:2404.09586
12
citations
#2429

Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction

Xiaoyang Lyu, Chirui Chang, Peng Dai et al.

CVPR 2024highlightarXiv:2403.19314
12
citations
#2430

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024posterarXiv:2407.13108
12
citations
#2431

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024posterarXiv:2407.14709
12
citations
#2432

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024posterarXiv:2306.12941
12
citations
#2433

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024posterarXiv:2408.08050
12
citations
#2434

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117
12
citations
#2435

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024posterarXiv:2403.14611
12
citations
#2436

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024poster
12
citations
#2437

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024posterarXiv:2407.20341
12
citations
#2438

Generalizability of Adversarial Robustness Under Distribution Shifts

Bernard Ghanem, Kumail Alhamoud, Hasan Hammoud et al.

ICLR 2024poster
12
citations
#2439

ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention

Jiawei Wang, Changjian Li

CVPR 2024posterarXiv:2311.16682
12
citations
#2440

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024posterarXiv:2301.12195
12
citations
#2441

11293 Cross-Class Feature Augmentation for Class Incremental Learning

Taehoon Kim, JaeYoo Park, Bohyung Han

AAAI 2024paper
12
citations
#2442

Generative Powers of Ten

Xiaojuan Wang, Janne Kontkanen, Brian Curless et al.

CVPR 2024highlightarXiv:2312.02149
12
citations
#2443

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024posterarXiv:2407.07518
12
citations
#2444

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024posterarXiv:2409.05162
12
citations
#2445

ZeroFlow: Scalable Scene Flow via Distillation

Kyle Vedder, Neehar Peri, Nathaniel Chodosh et al.

ICLR 2024oralarXiv:2305.10424
12
citations
#2446

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Zhiyu Zhao, Bingkun Huang, Sen Xing et al.

CVPR 2024posterarXiv:2311.03149
12
citations
#2447

H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

Minyoung Park, MIRAE DO, Yeon Jae Shin et al.

ICLR 2024spotlightarXiv:2402.08138
12
citations
#2448

Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing

Lokesh Nagalapatti, Akshay Iyer, Abir De et al.

AAAI 2024paperarXiv:2401.15447
12
citations
#2449

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024posterarXiv:2407.13083
12
citations
#2450

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024posterarXiv:2407.11294
12
citations
#2451

CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments

Xiulong Liu, Sudipta Paul, Moitreya Chatterjee et al.

AAAI 2024paperarXiv:2306.04047
12
citations
#2452

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

Zhuohang Dang, Minnan Luo, Chengyou Jia et al.

AAAI 2024paperarXiv:2312.16478
12
citations
#2453

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024posterarXiv:2404.12524
12
citations
#2454

Adversarial Attacks on the Interpretation of Neuron Activation Maximization

Géraldin Nanfack, Alexander Fulleringer, Jonathan Marty et al.

AAAI 2024paperarXiv:2306.07397
12
citations
#2455

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024posterarXiv:2407.07412
12
citations
#2456

Robust Nonparametric Regression under Poisoning Attack

Puning Zhao, Zhiguo Wan

AAAI 2024paperarXiv:2305.16771
12
citations
#2457

Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models

Luo Jiayun, Siddhesh Khandelwal, Leonid Sigal et al.

CVPR 2024posterarXiv:2311.17095
12
citations
#2458

Minimum-Norm Interpolation Under Covariate Shift

Neil Mallinar, Austin Zane, Spencer Frei et al.

ICML 2024posterarXiv:2404.00522
12
citations
#2459

Functional Diffusion

Biao Zhang, Peter Wonka

CVPR 2024posterarXiv:2311.15435
12
citations
#2460

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024posterarXiv:2408.16478
12
citations
#2461

MGNet: Learning Correspondences via Multiple Graphs

Dai Luanyuan, Xiaoyu Du, Hanwang Zhang et al.

AAAI 2024paperarXiv:2401.04984
12
citations
#2462

Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

Hongquan Zhang, Bin-Bin Gao, Yi Zeng et al.

AAAI 2024paper
12
citations
#2463

Accelerating Neural Field Training via Soft Mining

Shakiba Kheradmand, Daniel Rebain, Gopal Sharma et al.

CVPR 2024posterarXiv:2312.00075
12
citations
#2464

Generalizable Fourier Augmentation for Unsupervised Video Object Segmentation

Huihui Song, Tiankang Su, Yuhui Zheng et al.

AAAI 2024paper
12
citations
#2465

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

Jiachun Pan, Jiachun Pan, Jun Hao Liew et al.

ICLR 2024posterarXiv:2307.10711
12
citations
#2466

Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts

Kun Jin, Tongxin Yin, Zhongzhu Chen et al.

AAAI 2024paperarXiv:2305.05090
12
citations
#2467

RICA^2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV et al.

ECCV 2024poster
12
citations
#2468

Federated Online Adaptation for Deep Stereo

Matteo Poggi, Fabio Tosi

CVPR 2024posterarXiv:2405.14873
12
citations
#2469

Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck

Shifei Ding, Wei Du, Ling Ding et al.

AAAI 2024paper
12
citations
#2470

Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution

Yifan Su, Rishi Veerapaneni, Jiaoyang Li

AAAI 2024paperarXiv:2401.00315
12
citations
#2471

Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

Muyao Wang, Wenchao Chen, Bo Chen

AAAI 2024paperarXiv:2403.05406
12
citations
#2472

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains

Eunsu Baek, Keondo Park, Ji-yoon Kim et al.

CVPR 2024posterarXiv:2404.15882
12
citations
#2473

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

Yichi Zhang, Zhihao Duan, Ming Lu et al.

AAAI 2024paperarXiv:2401.11615
12
citations
#2474

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024posterarXiv:2508.16408
12
citations
#2475

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Yash Sanghvi, Yiheng Chi, Stanley Chan

ECCV 2024posterarXiv:2312.02319
12
citations
#2476

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024posterarXiv:2408.14930
12
citations
#2477

TexOct: Generating Textures of 3D Models with Octree-based Diffusion

Jialun Liu, Chenming Wu, Xinqi Liu et al.

CVPR 2024poster
12
citations
#2478

Improving Bird's Eye View Semantic Segmentation by Task Decomposition

Tianhao Zhao, Yongcan Chen, Yu Wu et al.

CVPR 2024posterarXiv:2404.01925
12
citations
#2479

On the hardness of learning under symmetries

Bobak Kiani, Thien Le, Hannah Lawrence et al.

ICLR 2024spotlightarXiv:2401.01869
12
citations
#2480

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off

Yuansan Liu, Ruqing Zhang, Mingkun Zhang et al.

AAAI 2024paperarXiv:2312.10329
12
citations
#2481

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.

ECCV 2024posterarXiv:2409.15557
12
citations
#2482

Correcting Diffusion Generation through Resampling

Yujian Liu, Yang Zhang, Tommi Jaakkola et al.

CVPR 2024highlightarXiv:2312.06038
12
citations
#2483

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024posterarXiv:2404.02697
12
citations
#2484

EX-Graph: A Pioneering Dataset Bridging Ethereum and X

Qian Wang, Zhen Zhang, Zemin Liu et al.

ICLR 2024posterarXiv:2310.01015
12
citations
#2485

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Yongyuan Liang, Yanchao Sun, Ruijie Zheng et al.

ICLR 2024oralarXiv:2307.12062
12
citations
#2486

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

Jiaqi Xu, Mengyang Wu, Xiaowei Hu et al.

ECCV 2024posterarXiv:2409.02101
12
citations
#2487

PointInfinity: Resolution-Invariant Point Diffusion Models

Zixuan Huang, Justin Johnson, Shoubhik Debnath et al.

CVPR 2024posterarXiv:2404.03566
12
citations
#2488

SINDER: Repairing the Singular Defects of DINOv2

Haoqi Wang, Tong Zhang, Mathieu Salzmann

ECCV 2024posterarXiv:2407.16826
12
citations
#2489

Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance

Giung Nam, Byeongho Heo, Juho Lee

ICLR 2024posterarXiv:2404.00860
12
citations
#2490

Language-Informed Visual Concept Learning

Sharon Lee, Yunzhi Zhang, Shangzhe Wu et al.

ICLR 2024posterarXiv:2312.03587
12
citations
#2491

Discover and Mitigate Multiple Biased Subgroups in Image Classifiers

Zeliang Zhang, Mingqian Feng, Zhiheng Li et al.

CVPR 2024posterarXiv:2403.12777
12
citations
#2492

R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning

Mengyuan Chen, Junyu Gao, Changsheng Xu

ICLR 2024spotlight
12
citations
#2493

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery

Andy V Huynh, Lauren Gillespie, Jael Lopez-Saucedo et al.

ECCV 2024posterarXiv:2409.19439
12
citations
#2494

Weakly Supervised Monocular 3D Detection with a Single-View Image

Xueying Jiang, Sheng Jin, Lewei Lu et al.

CVPR 2024posterarXiv:2402.19144
12
citations
#2495

Universal Novelty Detection Through Adaptive Contrastive Learning

Hossein Mirzaei, Mojtaba Nafez, Mohammad Jafari et al.

CVPR 2024posterarXiv:2408.10798
12
citations
#2496

Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

Yujia Liu, Chenxi Yang, Dingquan Li et al.

CVPR 2024posterarXiv:2403.11397
12
citations
#2497

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024posterarXiv:2311.17524
12
citations
#2498

Exploring Transformer Extrapolation

Zhen Qin, Yiran Zhong, Hui Deng

AAAI 2024paperarXiv:2307.10156
12
citations
#2499

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.

ECCV 2024posterarXiv:2403.12953
12
citations
#2500

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024posterarXiv:2404.15770
12
citations
#2501

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Changan Chen, Kumar Ashutosh, Rohit Girdhar et al.

CVPR 2024posterarXiv:2404.05206
12
citations
#2502

Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods

Sara Klein, Simon Weissmann, Leif Döring

ICLR 2024posterarXiv:2310.02671
12
citations
#2503

Symbolic Regression Enhanced Decision Trees for Classification Tasks

Kei Sen Fong, Mehul Motani

AAAI 2024paper
12
citations
#2504

Any-Stereo: Arbitrary Scale Disparity Estimation for Iterative Stereo Matching

Zhaohuai Liang, Changhe Li

AAAI 2024paper
12
citations
#2505

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

Yunpeng Bai, Xintao Wang, Yanpei Cao et al.

ECCV 2024poster
12
citations
#2506

C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Rongchang Li, Zhenhua Feng, Tianyang Xu et al.

ECCV 2024posterarXiv:2407.06113
12
citations
#2507

Multi-Attribute Interactions Matter for 3D Visual Grounding

Can Xu, Yuehui Han, Rui Xu et al.

CVPR 2024poster
12
citations
#2508

CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers

Shahaf Arica, Or Rubin, Sapir Gershov et al.

CVPR 2024posterarXiv:2403.07700
12
citations
#2509

Neural Causal Abstractions

Kevin Xia, Elias Bareinboim

AAAI 2024paperarXiv:2401.02602
12
citations
#2510

Double-Layer Hybrid-Label Identification Feature Selection for Multi-View Multi-Label Learning

Pingting Hao, Kunpeng Liu, Wanfu Gao

AAAI 2024paper
12
citations
#2511

Eliminating Warping Shakes for Unsupervised Online Video Stitching

Lang Nie, Chunyu Lin, Kang Liao et al.

ECCV 2024posterarXiv:2403.06378
12
citations
#2512

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024posterarXiv:2403.10082
12
citations
#2513

A Generalized Shuffle Framework for Privacy Amplification: Strengthening Privacy Guarantees and Enhancing Utility

Chen E, Yang Cao, Ge Yifei

AAAI 2024paperarXiv:2312.14388
12
citations
#2514

One Step Closer to Unbiased Aleatoric Uncertainty Estimation

Wang Zhang, Ziwen Martin Ma, Subhro Das et al.

AAAI 2024paperarXiv:2312.10469
12
citations
#2515

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Shilin Yan, Xiaohao Xu, Renrui Zhang et al.

ECCV 2024posterarXiv:2309.12303
12
citations
#2516

Data-efficient Large Vision Models through Sequential Autoregression

Zhiwei Hao, Jianyuan Guo, Chengcheng Wang et al.

ICML 2024posterarXiv:2402.04841
12
citations
#2517

LINGO-Space: Language-Conditioned Incremental Grounding for Space

Dohyun Kim, Nayoung Oh, Deokmin Hwang et al.

AAAI 2024paperarXiv:2402.01183
11
citations
#2518

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior

Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya et al.

ICLR 2024spotlightarXiv:2309.00359
11
citations
#2519

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Yuanhao Zhai, Kevin Lin, Linjie Li et al.

ECCV 2024posterarXiv:2407.10937
11
citations
#2520

Learning Implicit Representation for Reconstructing Articulated Objects

Hao Zhang, Fang Li, Samyak Rawlekar et al.

ICLR 2024posterarXiv:2401.08809
11
citations
#2521

Quantifying Task Priority for Multi-Task Optimization

Wooseong Jeong, Kuk-Jin Yoon

CVPR 2024posterarXiv:2406.02996
11
citations
#2522

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024posterarXiv:2407.10164
11
citations
#2523

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024posterarXiv:2404.11895
11
citations
#2524

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024posterarXiv:2407.13642
11
citations
#2525

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

ECCV 2024posterarXiv:2303.12001
11
citations
#2526

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024posterarXiv:2409.10473
11
citations
#2527

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

Runyi Li, Xuhan SHENG, Weiqi Li et al.

ECCV 2024posterarXiv:2404.10312
11
citations
#2528

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024poster
11
citations
#2529

Timestep-Aware Correction for Quantized Diffusion Models

Yuzhe YAO, Feng Tian, Jun Chen et al.

ECCV 2024posterarXiv:2407.03917
11
citations
#2530

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Adam Block, Dylan Foster, Akshay Krishnamurthy et al.

ICLR 2024posterarXiv:2310.11428
11
citations
#2531

AT4CTR: Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction

Qi Liu, Xuyang Hou, Defu Lian et al.

AAAI 2024paperarXiv:2312.06683
11
citations
#2532

Temporal Correlation Vision Transformer for Video Person Re-Identification

Pengfei Wu, Le Wang, Sanping Zhou et al.

AAAI 2024paper
11
citations
#2533

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong et al.

CVPR 2024posterarXiv:2401.14405
11
citations
#2534

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Zhongqun Zhang, Hengfei Wang, Ziwei Yu et al.

ECCV 2024posterarXiv:2407.12727
11
citations
#2535

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024posterarXiv:2407.21654
11
citations
#2536

Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.

Zhuoyi Yang, Heyang Jiang, Wenyi Hong et al.

ECCV 2024posterarXiv:2405.04312
11
citations
#2537

DiffusionPen: Towards Controlling the Style of Handwritten Text Generation

KONSTANTINA NIKOLAIDOU, George Retsinas, Giorgos Sfikas et al.

ECCV 2024posterarXiv:2409.06065
11
citations
#2538

Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions

Weng Fei Low, Gim Hee Lee

ECCV 2024posterarXiv:2409.17988
11
citations
#2539

Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

Wen Yin, Jian Lou, Pan Zhou et al.

CVPR 2024posterarXiv:2404.19417
11
citations
#2540

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.

CVPR 2024posterarXiv:2403.03037
11
citations
#2541

EDformer: Transformer-Based Event Denoising Across Varied Noise Levels

Bin Jiang, Bo Xiong, Bohan Qu et al.

ECCV 2024poster
11
citations
#2542

SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning

Yuxin Deng, Jiayi Ma

AAAI 2024paperarXiv:2106.04434
11
citations
#2543

StraightPCF: Straight Point Cloud Filtering

Dasith de Silva Edirimuni, Xuequan Lu, Gang Li et al.

CVPR 2024posterarXiv:2405.08322
11
citations
#2544

Dataset Quantization with Active Learning based Adaptive Sampling

Zhenghao Zhao, Yuzhang Shang, Junyi Wu et al.

ECCV 2024posterarXiv:2407.07268
11
citations
#2545

DTMFormer: Dynamic Token Merging for Boosting Transformer-Based Medical Image Segmentation

Zhehao Wang, Xian Lin, Nannan Wu et al.

AAAI 2024paper
11
citations
#2546

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Qingping Zheng, Ling Zheng, Yuanfan Guo et al.

CVPR 2024posterarXiv:2403.16643
11
citations
#2547

Finsler-Laplace-Beltrami Operators with Application to Shape Analysis

Simon Weber, Thomas Dagès, Maolin Gao et al.

CVPR 2024posterarXiv:2404.03999
11
citations
#2548

KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter

Yifan Zhan, Zhuoxiao Li, Muyao Niu et al.

ECCV 2024posterarXiv:2407.13185
11
citations
#2549

FedST: Federated Style Transfer Learning for Non-IID Image Segmentation

Boyuan Ma, Yin Xiang, Jing Tan et al.

AAAI 2024paper
11
citations
#2550

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Kihong Kim, Haneol Lee, Jihye Park et al.

ECCV 2024posterarXiv:2402.13729
11
citations
#2551

Workflow Discovery from Dialogues in the Low Data Regime

David Vazquez, Stefania Raimondo, Christopher Pal et al.

ICLR 2024poster
11
citations
#2552

Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments

Ryo Ueda, TADAHIRO TANIGUCHI

ICLR 2024posterarXiv:2311.04453
11
citations
#2553

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024posterarXiv:2409.02882
11
citations
#2554

CarFormer: Self-Driving with Learned Object-Centric Representations

Shadi Hamdan, Fatma Guney

ECCV 2024posterarXiv:2407.15843
11
citations
#2555

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024posterarXiv:2404.17672
11
citations
#2556

DP-SGD Without Clipping: The Lipschitz Neural Network Way

Louis Béthune, Thomas Massena, Thibaut Boissin et al.

ICLR 2024poster
11
citations
#2557

Bi-Causal: Group Activity Recognition via Bidirectional Causality

Youliang Zhang, Wenxuan Liu, danni xu et al.

CVPR 2024poster
11
citations
#2558

Union Subgraph Neural Networks

Jiaxing Xu, Aihu Zhang, Qingtian Bian et al.

AAAI 2024paperarXiv:2305.15747
11
citations
#2559

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Ruihuang Li, Zhengqiang ZHANG, Chenhang He et al.

ECCV 2024posterarXiv:2407.09781
11
citations
#2560

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution

Gengrui Zhang, Xiaoshuang Chen, Yao WANG et al.

AAAI 2024paperarXiv:2401.06470
11
citations
#2561

Towards Understanding and Improving Adversarial Robustness of Vision Transformers

Samyak Jain, Tanima Dutta

CVPR 2024poster
11
citations
#2562

Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

Huadong Li, Minhao Jing, Jin Wang et al.

ECCV 2024posterarXiv:2312.00844
11
citations
#2563

JointSQ: Joint Sparsification-Quantization for Distributed Learning

Weiying Xie, Haowei Li, Ma Jitao et al.

CVPR 2024poster
11
citations
#2564

Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma et al.

ECCV 2024posterarXiv:2402.05655
11
citations
#2565

Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data

Chongyi Zheng, Benjamin Eysenbach, Homer Walke et al.

ICLR 2024spotlightarXiv:2306.03346
11
citations
#2566

FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning

Junyuan Zhang, Shuang Zeng, Miao Zhang et al.

CVPR 2024poster
11
citations
#2567

Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models

Thomas Zollo, Todd Morrill, Zhun Deng et al.

ICLR 2024posterarXiv:2311.13628
11
citations
#2568

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing

Xudong Wang, Ke-Yue Zhang, Taiping Yao et al.

ECCV 2024poster
11
citations
#2569

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Yu Cao, Shaogang Gong

ECCV 2024posterarXiv:2407.07249
11
citations
#2570

SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining

Chull Hwan Song, Taebaek Hwang, Jooyoung Yoon et al.

CVPR 2024posterarXiv:2404.01156
11
citations
#2571

Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models

Zhiyao Ren, Yibing Zhan, Liang Ding et al.

AAAI 2024paper
11
citations
#2572

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024posterarXiv:2301.13803
11
citations
#2573

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Haoyu Ma, Shahin Mahdizadehaghdam, Bichen Wu et al.

CVPR 2024posterarXiv:2312.12468
11
citations
#2574

BiPer: Binary Neural Networks using a Periodic Function

Edwin Vargas, Claudia Correa, Carlos Hinojosa et al.

CVPR 2024posterarXiv:2404.01278
11
citations
#2575

A Plug-and-Play Image Registration Network

JUNHAO HU, Weijie Gan, Zhixin Sun et al.

ICLR 2024posterarXiv:2310.04297
11
citations
#2576

InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image

Jianhui Li, Shilong Liu, Zidong Liu et al.

ICLR 2024posterarXiv:2311.02826
11
citations
#2577

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu et al.

ECCV 2024posterarXiv:2311.17893
11
citations
#2578

Synergistic Global-space Camera and Human Reconstruction from Videos

Yizhou Zhao, Tuanfeng Y. Wang, Bhiksha Raj et al.

CVPR 2024posterarXiv:2405.14855
11
citations
#2579

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024posterarXiv:2509.20091
11
citations
#2580

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Haiwen Diao, Bo Wan, XU JIA et al.

ECCV 2024posterarXiv:2407.07523
11
citations
#2581

Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold

Alireza Ganjdanesh, Shangqian Gao, Hirad Alipanah et al.

AAAI 2024paperarXiv:2312.14776
11
citations
#2582

1497 Once and for All: Universal Transferable Adversarial Perturbation against Deep Hashing-Based Facial Image Retrieval

Long Tang, Dengpan Ye, Yunna Lv et al.

AAAI 2024paper
11
citations
#2583

FedLF: Layer-Wise Fair Federated Learning

Zibin Pan, Chi Li, Fangchen Yu et al.

AAAI 2024paper
11
citations
#2584

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa

ECCV 2024posterarXiv:2312.08872
11
citations
#2585

Clockwork Diffusion: Efficient Generation With Model-Step Distillation

Amirhossein Habibian, Amir Ghodrati, Noor Fathima et al.

CVPR 2024highlightarXiv:2312.08128
11
citations
#2586

Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

Da Luo, Yanglei Gan, Rui Hou et al.

AAAI 2024paperarXiv:2312.12021
11
citations
#2587

BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion

Zhaochen Liu, Zhixuan Li, Tingting Jiang

AAAI 2024paperarXiv:2401.01642
11
citations
#2588

From Activation to Initialization: Scaling Insights for Optimizing Neural Fields

Hemanth Saratchandran, Sameera Ramasinghe, Simon Lucey

CVPR 2024highlightarXiv:2403.19205
11
citations
#2589

QLABGrad: A Hyperparameter-Free and Convergence-Guaranteed Scheme for Deep Learning

Fang-Xiang Wu, Minghan Fu

AAAI 2024paperarXiv:2302.00252
11
citations
#2590

Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph

Zhengcen Li, Xinle Chang, Yueran Li et al.

ECCV 2024posterarXiv:2407.19497
11
citations
#2591

Class-Agnostic Object Counting with Text-to-Image Diffusion Model

Xiaofei Hui, Qian Wu, Hossein Rahmani et al.

ECCV 2024poster
11
citations
#2592

Privacy-Preserving Optics for Enhancing Protection in Face De-Identification

Jhon Lopez, Carlos Hinojosa, Henry Arguello et al.

CVPR 2024posterarXiv:2404.00777
11
citations
#2593

Dynamic Layer Tying for Parameter-Efficient Transformers

Tamir David-Hay, Lior Wolf

ICLR 2024poster
11
citations
#2594

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

Simon Weber, Barış Zöngür, Nikita Araslanov et al.

CVPR 2024posterarXiv:2404.03778
11
citations
#2595

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

Yi-Hao Peng, Faria Huq, Yue Jiang et al.

ECCV 2024posterarXiv:2410.00201
11
citations
#2596

Optimal Sample Complexity of Contrastive Learning

Noga Alon, Dmitrii Avdiukhin, Dor Elboim et al.

ICLR 2024spotlightarXiv:2312.00379
11
citations
#2597

RoadPainter: Points Are Ideal Navigators for Topology transformER

Zhongxing Ma, Liang Shuang, Yongkun Wen et al.

ECCV 2024posterarXiv:2407.15349
11
citations
#2598

Continuous Piecewise-Affine Based Motion Model for Image Animation

Hexiang Wang, Fengqi Liu, Qianyu Zhou et al.

AAAI 2024paperarXiv:2401.09146
11
citations
#2599

Uncertainty Regularized Evidential Regression

Kai Ye, Tiejin Chen, Hua Wei et al.

AAAI 2024paperarXiv:2401.01484
11
citations
#2600

Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem

Qianliang Wu, Haobo Jiang, Lei Luo et al.

ECCV 2024poster
11
citations