Most Cited 2024 "parameterized environment configurations" Papers

12,324 papers found • Page 13 of 62

Filters:Most Cited 2024 parameterized environment configurations Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2401

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024poster

citations

#2402

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024posterarXiv:2407.04947

citations

#2403

Robust Test-Time Adaptation for Zero-Shot Prompt Tuning

Ding-Chu Zhang, Zhi Zhou, Yufeng Li

AAAI 2024paper

citations

#2404

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

Zicheng Zhang, RUOBING ZHENG, Bonan Li et al.

CVPR 2024posterarXiv:2402.17364

citations

#2405

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024posterarXiv:2312.06729

citations

#2406

Pareto Deep Long-Tailed Recognition: A Conflict-Averse Solution

Zhipeng Zhou, Liu Liu, Peilin Zhao et al.

ICLR 2024oral

citations

#2407

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024posterarXiv:2312.06661

citations

#2408

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024posterarXiv:2407.17331

citations

#2409

ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Huiqun Li, Hanhan Zhou, Yifei Zou et al.

AAAI 2024paperarXiv:2312.15555

citations

#2410

Weisfeiler and Lehman Go Paths: Learning Topological Features via Path Complexes

Quang Truong, Peter Chin

AAAI 2024paperarXiv:2308.06838

citations

#2411

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024poster

citations

#2412

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024posterarXiv:2408.02157

citations

#2413

Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

Rashindrie Perera, Saman Halgamuge

CVPR 2024posterarXiv:2403.04492

citations

#2414

Task-Disruptive Background Suppression for Few-Shot Segmentation

Suho Park, SuBeen Lee, Sangeek Hyun et al.

AAAI 2024paperarXiv:2312.15894

citations

#2415

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024posterarXiv:2407.11950

citations

#2416

D3: A Methodological Exploration of Domain Division, Modeling, and Balance in Multi-Domain Recommendations

Pengyue Jia, Yichao Wang, Shanru LIN et al.

AAAI 2024paper

citations

#2417

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024posterarXiv:2407.14142

citations

#2418

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024posterarXiv:2407.10753

citations

#2419

PairAug: What Can Augmented Image-Text Pairs Do for Radiology?

Yutong Xie, Qi Chen, Sinuo Wang et al.

CVPR 2024posterarXiv:2404.04960

citations

#2420

Unsupervised Gaze Representation Learning from Multi-view Face Images

Yiwei Bao, Feng Lu

CVPR 2024poster

citations

#2421

S²MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering

Zhen Long, Qiyuan Wang, Yazhou Ren et al.

CVPR 2024poster

citations

#2422

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024posterarXiv:2407.11859

citations

#2423

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024posterarXiv:2406.04426

citations

#2424

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024posterarXiv:2407.21757

citations

#2425

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

ICLR 2024posterarXiv:2401.09786

citations

#2426

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks

Yixuan Weng, Minjun Zhu, Fei Xia et al.

ICLR 2024posterarXiv:2304.01665

citations

#2427

Generalized Planning for the Abstraction and Reasoning Corpus

Chao Lei, Nir Lipovetzky, Krista A. Ehinger

AAAI 2024paperarXiv:2401.07426

citations

#2428

Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing

Song Xia, Yi Yu, Jiang Xudong et al.

ICLR 2024posterarXiv:2404.09586

citations

#2429

Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction

Xiaoyang Lyu, Chirui Chang, Peng Dai et al.

CVPR 2024highlightarXiv:2403.19314

citations

#2430

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024posterarXiv:2407.13108

citations

#2431

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024posterarXiv:2407.14709

citations

#2432

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024posterarXiv:2306.12941

citations

#2433

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024posterarXiv:2408.08050

citations

#2434

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117

citations

#2435

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024posterarXiv:2403.14611

citations

#2436

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024poster

citations

#2437

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024posterarXiv:2407.20341

citations

#2438

Generalizability of Adversarial Robustness Under Distribution Shifts

Bernard Ghanem, Kumail Alhamoud, Hasan Hammoud et al.

ICLR 2024poster

citations

#2439

ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention

Jiawei Wang, Changjian Li

CVPR 2024posterarXiv:2311.16682

citations

#2440

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024posterarXiv:2301.12195

citations

#2441

11293 Cross-Class Feature Augmentation for Class Incremental Learning

Taehoon Kim, JaeYoo Park, Bohyung Han

AAAI 2024paper

citations

#2442

Generative Powers of Ten

Xiaojuan Wang, Janne Kontkanen, Brian Curless et al.

CVPR 2024highlightarXiv:2312.02149

citations

#2443

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024posterarXiv:2407.07518

citations

#2444

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024posterarXiv:2409.05162

citations

#2445

ZeroFlow: Scalable Scene Flow via Distillation

Kyle Vedder, Neehar Peri, Nathaniel Chodosh et al.

ICLR 2024oralarXiv:2305.10424

citations

#2446

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Zhiyu Zhao, Bingkun Huang, Sen Xing et al.

CVPR 2024posterarXiv:2311.03149

citations

#2447

H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields

Minyoung Park, MIRAE DO, Yeon Jae Shin et al.

ICLR 2024spotlightarXiv:2402.08138

citations

#2448

Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing

Lokesh Nagalapatti, Akshay Iyer, Abir De et al.

AAAI 2024paperarXiv:2401.15447

citations

#2449

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024posterarXiv:2407.13083

citations

#2450

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024posterarXiv:2407.11294

citations

#2451

CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments

Xiulong Liu, Sudipta Paul, Moitreya Chatterjee et al.

AAAI 2024paperarXiv:2306.04047

citations

#2452

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

Zhuohang Dang, Minnan Luo, Chengyou Jia et al.

AAAI 2024paperarXiv:2312.16478

citations

#2453

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024posterarXiv:2404.12524

citations

#2454

Adversarial Attacks on the Interpretation of Neuron Activation Maximization

Géraldin Nanfack, Alexander Fulleringer, Jonathan Marty et al.

AAAI 2024paperarXiv:2306.07397

citations

#2455

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024posterarXiv:2407.07412

citations

#2456

Robust Nonparametric Regression under Poisoning Attack

Puning Zhao, Zhiguo Wan

AAAI 2024paperarXiv:2305.16771

citations

#2457

Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models

Luo Jiayun, Siddhesh Khandelwal, Leonid Sigal et al.

CVPR 2024posterarXiv:2311.17095

citations

#2458

Minimum-Norm Interpolation Under Covariate Shift

Neil Mallinar, Austin Zane, Spencer Frei et al.

ICML 2024posterarXiv:2404.00522

citations

#2459

Functional Diffusion

Biao Zhang, Peter Wonka

CVPR 2024posterarXiv:2311.15435

citations

#2460

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024posterarXiv:2408.16478

citations

#2461

MGNet: Learning Correspondences via Multiple Graphs

Dai Luanyuan, Xiaoyu Du, Hanwang Zhang et al.

AAAI 2024paperarXiv:2401.04984

citations

#2462

Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

Hongquan Zhang, Bin-Bin Gao, Yi Zeng et al.

AAAI 2024paper

citations

#2463

Accelerating Neural Field Training via Soft Mining

Shakiba Kheradmand, Daniel Rebain, Gopal Sharma et al.

CVPR 2024posterarXiv:2312.00075

citations

#2464

Generalizable Fourier Augmentation for Unsupervised Video Object Segmentation

Huihui Song, Tiankang Su, Yuhui Zheng et al.

AAAI 2024paper

citations

#2465

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

Jiachun Pan, Jiachun Pan, Jun Hao Liew et al.

ICLR 2024posterarXiv:2307.10711

citations

#2466

Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts

Kun Jin, Tongxin Yin, Zhongzhu Chen et al.

AAAI 2024paperarXiv:2305.05090

citations

#2467

RICA^2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV et al.

ECCV 2024poster

citations

#2468

Federated Online Adaptation for Deep Stereo

Matteo Poggi, Fabio Tosi

CVPR 2024posterarXiv:2405.14873

citations

#2469

Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck

Shifei Ding, Wei Du, Ling Ding et al.

AAAI 2024paper

citations

#2470

Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution

Yifan Su, Rishi Veerapaneni, Jiaoyang Li

AAAI 2024paperarXiv:2401.00315

citations

#2471

Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

Muyao Wang, Wenchao Chen, Bo Chen

AAAI 2024paperarXiv:2403.05406

citations

#2472

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains

Eunsu Baek, Keondo Park, Ji-yoon Kim et al.

CVPR 2024posterarXiv:2404.15882

citations

#2473

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

Yichi Zhang, Zhihao Duan, Ming Lu et al.

AAAI 2024paperarXiv:2401.11615

citations

#2474

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024posterarXiv:2508.16408

citations

#2475

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Yash Sanghvi, Yiheng Chi, Stanley Chan

ECCV 2024posterarXiv:2312.02319

citations

#2476

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024posterarXiv:2408.14930

citations

#2477

TexOct: Generating Textures of 3D Models with Octree-based Diffusion

Jialun Liu, Chenming Wu, Xinqi Liu et al.

CVPR 2024poster

citations

#2478

Improving Bird's Eye View Semantic Segmentation by Task Decomposition

Tianhao Zhao, Yongcan Chen, Yu Wu et al.

CVPR 2024posterarXiv:2404.01925

citations

#2479

On the hardness of learning under symmetries

Bobak Kiani, Thien Le, Hannah Lawrence et al.

ICLR 2024spotlightarXiv:2401.01869

citations

#2480

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off

Yuansan Liu, Ruqing Zhang, Mingkun Zhang et al.

AAAI 2024paperarXiv:2312.10329

citations

#2481

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.

ECCV 2024posterarXiv:2409.15557

citations

#2482

Correcting Diffusion Generation through Resampling

Yujian Liu, Yang Zhang, Tommi Jaakkola et al.

CVPR 2024highlightarXiv:2312.06038

citations

#2483

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024posterarXiv:2404.02697

citations

#2484

EX-Graph: A Pioneering Dataset Bridging Ethereum and X

Qian Wang, Zhen Zhang, Zemin Liu et al.

ICLR 2024posterarXiv:2310.01015

citations

#2485

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Yongyuan Liang, Yanchao Sun, Ruijie Zheng et al.

ICLR 2024oralarXiv:2307.12062

citations

#2486

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

Jiaqi Xu, Mengyang Wu, Xiaowei Hu et al.

ECCV 2024posterarXiv:2409.02101

citations

#2487

PointInfinity: Resolution-Invariant Point Diffusion Models

Zixuan Huang, Justin Johnson, Shoubhik Debnath et al.

CVPR 2024posterarXiv:2404.03566

citations

#2488

SINDER: Repairing the Singular Defects of DINOv2

Haoqi Wang, Tong Zhang, Mathieu Salzmann

ECCV 2024posterarXiv:2407.16826

citations

#2489

Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance

Giung Nam, Byeongho Heo, Juho Lee

ICLR 2024posterarXiv:2404.00860

citations

#2490

Language-Informed Visual Concept Learning

Sharon Lee, Yunzhi Zhang, Shangzhe Wu et al.

ICLR 2024posterarXiv:2312.03587

citations

#2491

Discover and Mitigate Multiple Biased Subgroups in Image Classifiers

Zeliang Zhang, Mingqian Feng, Zhiheng Li et al.

CVPR 2024posterarXiv:2403.12777

citations

#2492

R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning

Mengyuan Chen, Junyu Gao, Changsheng Xu

ICLR 2024spotlight

citations

#2493

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery

Andy V Huynh, Lauren Gillespie, Jael Lopez-Saucedo et al.

ECCV 2024posterarXiv:2409.19439

citations

#2494

Weakly Supervised Monocular 3D Detection with a Single-View Image

Xueying Jiang, Sheng Jin, Lewei Lu et al.

CVPR 2024posterarXiv:2402.19144

citations

#2495

Universal Novelty Detection Through Adaptive Contrastive Learning

Hossein Mirzaei, Mojtaba Nafez, Mohammad Jafari et al.

CVPR 2024posterarXiv:2408.10798

citations

#2496

Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

Yujia Liu, Chenxi Yang, Dingquan Li et al.

CVPR 2024posterarXiv:2403.11397

citations

#2497

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024posterarXiv:2311.17524

citations

#2498

Exploring Transformer Extrapolation

Zhen Qin, Yiran Zhong, Hui Deng

AAAI 2024paperarXiv:2307.10156

citations

#2499

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.

ECCV 2024posterarXiv:2403.12953

citations

#2500

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024posterarXiv:2404.15770

citations

#2501

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Changan Chen, Kumar Ashutosh, Rohit Girdhar et al.

CVPR 2024posterarXiv:2404.05206

citations

#2502

Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods

Sara Klein, Simon Weissmann, Leif Döring

ICLR 2024posterarXiv:2310.02671

citations

#2503

Symbolic Regression Enhanced Decision Trees for Classification Tasks

Kei Sen Fong, Mehul Motani

AAAI 2024paper

citations

#2504

Any-Stereo: Arbitrary Scale Disparity Estimation for Iterative Stereo Matching

Zhaohuai Liang, Changhe Li

AAAI 2024paper

citations

#2505

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

Yunpeng Bai, Xintao Wang, Yanpei Cao et al.

ECCV 2024poster

citations

#2506

C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Rongchang Li, Zhenhua Feng, Tianyang Xu et al.

ECCV 2024posterarXiv:2407.06113

citations

#2507

Multi-Attribute Interactions Matter for 3D Visual Grounding

Can Xu, Yuehui Han, Rui Xu et al.

CVPR 2024poster

citations

#2508

CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers

Shahaf Arica, Or Rubin, Sapir Gershov et al.

CVPR 2024posterarXiv:2403.07700

citations

#2509

Neural Causal Abstractions

Kevin Xia, Elias Bareinboim

AAAI 2024paperarXiv:2401.02602

citations

#2510

Double-Layer Hybrid-Label Identification Feature Selection for Multi-View Multi-Label Learning

Pingting Hao, Kunpeng Liu, Wanfu Gao

AAAI 2024paper

citations

#2511

Eliminating Warping Shakes for Unsupervised Online Video Stitching

Lang Nie, Chunyu Lin, Kang Liao et al.

ECCV 2024posterarXiv:2403.06378

citations

#2512

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024posterarXiv:2403.10082

citations

#2513

A Generalized Shuffle Framework for Privacy Amplification: Strengthening Privacy Guarantees and Enhancing Utility

Chen E, Yang Cao, Ge Yifei

AAAI 2024paperarXiv:2312.14388

citations

#2514

One Step Closer to Unbiased Aleatoric Uncertainty Estimation

Wang Zhang, Ziwen Martin Ma, Subhro Das et al.

AAAI 2024paperarXiv:2312.10469

citations

#2515

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Shilin Yan, Xiaohao Xu, Renrui Zhang et al.

ECCV 2024posterarXiv:2309.12303

citations

#2516

Data-efficient Large Vision Models through Sequential Autoregression

Zhiwei Hao, Jianyuan Guo, Chengcheng Wang et al.

ICML 2024posterarXiv:2402.04841

citations

#2517

LINGO-Space: Language-Conditioned Incremental Grounding for Space

Dohyun Kim, Nayoung Oh, Deokmin Hwang et al.

AAAI 2024paperarXiv:2402.01183

citations

#2518

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior

Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya et al.

ICLR 2024spotlightarXiv:2309.00359

citations

#2519

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Yuanhao Zhai, Kevin Lin, Linjie Li et al.

ECCV 2024posterarXiv:2407.10937

citations

#2520

Learning Implicit Representation for Reconstructing Articulated Objects

Hao Zhang, Fang Li, Samyak Rawlekar et al.

ICLR 2024posterarXiv:2401.08809

citations

#2521

Quantifying Task Priority for Multi-Task Optimization

Wooseong Jeong, Kuk-Jin Yoon

CVPR 2024posterarXiv:2406.02996

citations

#2522

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024posterarXiv:2407.10164

citations

#2523

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024posterarXiv:2404.11895

citations

#2524

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024posterarXiv:2407.13642

citations

#2525

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

ECCV 2024posterarXiv:2303.12001

citations

#2526

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024posterarXiv:2409.10473

citations

#2527

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

Runyi Li, Xuhan SHENG, Weiqi Li et al.

ECCV 2024posterarXiv:2404.10312

citations

#2528

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024poster

citations

#2529

Timestep-Aware Correction for Quantized Diffusion Models

Yuzhe YAO, Feng Tian, Jun Chen et al.

ECCV 2024posterarXiv:2407.03917

citations

#2530

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Adam Block, Dylan Foster, Akshay Krishnamurthy et al.

ICLR 2024posterarXiv:2310.11428

citations

#2531

AT4CTR: Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction

Qi Liu, Xuyang Hou, Defu Lian et al.

AAAI 2024paperarXiv:2312.06683

citations

#2532

Temporal Correlation Vision Transformer for Video Person Re-Identification

Pengfei Wu, Le Wang, Sanping Zhou et al.

AAAI 2024paper

citations

#2533

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong et al.

CVPR 2024posterarXiv:2401.14405

citations

#2534

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Zhongqun Zhang, Hengfei Wang, Ziwei Yu et al.

ECCV 2024posterarXiv:2407.12727

citations

#2535

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024posterarXiv:2407.21654

citations

#2536

Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.

Zhuoyi Yang, Heyang Jiang, Wenyi Hong et al.

ECCV 2024posterarXiv:2405.04312

citations

#2537

DiffusionPen: Towards Controlling the Style of Handwritten Text Generation

KONSTANTINA NIKOLAIDOU, George Retsinas, Giorgos Sfikas et al.

ECCV 2024posterarXiv:2409.06065

citations

#2538

Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions

Weng Fei Low, Gim Hee Lee

ECCV 2024posterarXiv:2409.17988

citations

#2539

Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

Wen Yin, Jian Lou, Pan Zhou et al.

CVPR 2024posterarXiv:2404.19417

citations

#2540

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.

CVPR 2024posterarXiv:2403.03037

citations

#2541

EDformer: Transformer-Based Event Denoising Across Varied Noise Levels

Bin Jiang, Bo Xiong, Bohan Qu et al.

ECCV 2024poster

citations

#2542

SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning

Yuxin Deng, Jiayi Ma

AAAI 2024paperarXiv:2106.04434

citations

#2543

StraightPCF: Straight Point Cloud Filtering

Dasith de Silva Edirimuni, Xuequan Lu, Gang Li et al.

CVPR 2024posterarXiv:2405.08322

citations

#2544

Dataset Quantization with Active Learning based Adaptive Sampling

Zhenghao Zhao, Yuzhang Shang, Junyi Wu et al.

ECCV 2024posterarXiv:2407.07268

citations

#2545

DTMFormer: Dynamic Token Merging for Boosting Transformer-Based Medical Image Segmentation

Zhehao Wang, Xian Lin, Nannan Wu et al.

AAAI 2024paper

citations

#2546

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Qingping Zheng, Ling Zheng, Yuanfan Guo et al.

CVPR 2024posterarXiv:2403.16643

citations

#2547

Finsler-Laplace-Beltrami Operators with Application to Shape Analysis

Simon Weber, Thomas Dagès, Maolin Gao et al.

CVPR 2024posterarXiv:2404.03999

citations

#2548

KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter

Yifan Zhan, Zhuoxiao Li, Muyao Niu et al.

ECCV 2024posterarXiv:2407.13185

citations

#2549

FedST: Federated Style Transfer Learning for Non-IID Image Segmentation

Boyuan Ma, Yin Xiang, Jing Tan et al.

AAAI 2024paper

citations

#2550

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Kihong Kim, Haneol Lee, Jihye Park et al.

ECCV 2024posterarXiv:2402.13729

citations

#2551

Workflow Discovery from Dialogues in the Low Data Regime

David Vazquez, Stefania Raimondo, Christopher Pal et al.

ICLR 2024poster

citations

#2552

Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments

Ryo Ueda, TADAHIRO TANIGUCHI

ICLR 2024posterarXiv:2311.04453

citations

#2553

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024posterarXiv:2409.02882

citations

#2554

CarFormer: Self-Driving with Learned Object-Centric Representations

Shadi Hamdan, Fatma Guney

ECCV 2024posterarXiv:2407.15843

citations

#2555

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024posterarXiv:2404.17672

citations

#2556

DP-SGD Without Clipping: The Lipschitz Neural Network Way

Louis Béthune, Thomas Massena, Thibaut Boissin et al.

ICLR 2024poster

citations

#2557

Bi-Causal: Group Activity Recognition via Bidirectional Causality

Youliang Zhang, Wenxuan Liu, danni xu et al.

CVPR 2024poster

citations

#2558

Union Subgraph Neural Networks

Jiaxing Xu, Aihu Zhang, Qingtian Bian et al.

AAAI 2024paperarXiv:2305.15747

citations

#2559

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Ruihuang Li, Zhengqiang ZHANG, Chenhang He et al.

ECCV 2024posterarXiv:2407.09781

citations

#2560

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution

Gengrui Zhang, Xiaoshuang Chen, Yao WANG et al.

AAAI 2024paperarXiv:2401.06470

citations

#2561

Towards Understanding and Improving Adversarial Robustness of Vision Transformers

Samyak Jain, Tanima Dutta

CVPR 2024poster

citations

#2562

Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

Huadong Li, Minhao Jing, Jin Wang et al.

ECCV 2024posterarXiv:2312.00844

citations

#2563

JointSQ: Joint Sparsification-Quantization for Distributed Learning

Weiying Xie, Haowei Li, Ma Jitao et al.

CVPR 2024poster

citations

#2564

Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma et al.

ECCV 2024posterarXiv:2402.05655

citations

#2565

Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data

Chongyi Zheng, Benjamin Eysenbach, Homer Walke et al.

ICLR 2024spotlightarXiv:2306.03346

citations

#2566

FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning

Junyuan Zhang, Shuang Zeng, Miao Zhang et al.

CVPR 2024poster

citations

#2567

Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models

Thomas Zollo, Todd Morrill, Zhun Deng et al.

ICLR 2024posterarXiv:2311.13628

citations

#2568

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing

Xudong Wang, Ke-Yue Zhang, Taiping Yao et al.

ECCV 2024poster

citations

#2569

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Yu Cao, Shaogang Gong

ECCV 2024posterarXiv:2407.07249

citations

#2570

SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining

Chull Hwan Song, Taebaek Hwang, Jooyoung Yoon et al.

CVPR 2024posterarXiv:2404.01156

citations

#2571

Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models

Zhiyao Ren, Yibing Zhan, Liang Ding et al.

AAAI 2024paper

citations

#2572

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024posterarXiv:2301.13803

citations

#2573

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Haoyu Ma, Shahin Mahdizadehaghdam, Bichen Wu et al.

CVPR 2024posterarXiv:2312.12468

citations

#2574

BiPer: Binary Neural Networks using a Periodic Function

Edwin Vargas, Claudia Correa, Carlos Hinojosa et al.

CVPR 2024posterarXiv:2404.01278

citations

#2575

A Plug-and-Play Image Registration Network

JUNHAO HU, Weijie Gan, Zhixin Sun et al.

ICLR 2024posterarXiv:2310.04297

citations

#2576

InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image

Jianhui Li, Shilong Liu, Zidong Liu et al.

ICLR 2024posterarXiv:2311.02826

citations

#2577

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu et al.

ECCV 2024posterarXiv:2311.17893

citations

#2578

Synergistic Global-space Camera and Human Reconstruction from Videos

Yizhou Zhao, Tuanfeng Y. Wang, Bhiksha Raj et al.

CVPR 2024posterarXiv:2405.14855

citations

#2579

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024posterarXiv:2509.20091

citations

#2580

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Haiwen Diao, Bo Wan, XU JIA et al.

ECCV 2024posterarXiv:2407.07523

citations

#2581

Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold

Alireza Ganjdanesh, Shangqian Gao, Hirad Alipanah et al.

AAAI 2024paperarXiv:2312.14776

citations

#2582

1497 Once and for All: Universal Transferable Adversarial Perturbation against Deep Hashing-Based Facial Image Retrieval

Long Tang, Dengpan Ye, Yunna Lv et al.

AAAI 2024paper

citations

#2583

FedLF: Layer-Wise Fair Federated Learning

Zibin Pan, Chi Li, Fangchen Yu et al.

AAAI 2024paper

citations

#2584

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa

ECCV 2024posterarXiv:2312.08872

citations

#2585

Clockwork Diffusion: Efficient Generation With Model-Step Distillation

Amirhossein Habibian, Amir Ghodrati, Noor Fathima et al.

CVPR 2024highlightarXiv:2312.08128

citations

#2586

Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

Da Luo, Yanglei Gan, Rui Hou et al.

AAAI 2024paperarXiv:2312.12021

citations

#2587

BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion

Zhaochen Liu, Zhixuan Li, Tingting Jiang

AAAI 2024paperarXiv:2401.01642

citations

#2588

From Activation to Initialization: Scaling Insights for Optimizing Neural Fields

Hemanth Saratchandran, Sameera Ramasinghe, Simon Lucey

CVPR 2024highlightarXiv:2403.19205

citations

#2589

QLABGrad: A Hyperparameter-Free and Convergence-Guaranteed Scheme for Deep Learning

Fang-Xiang Wu, Minghan Fu

AAAI 2024paperarXiv:2302.00252

citations

#2590

Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph

Zhengcen Li, Xinle Chang, Yueran Li et al.

ECCV 2024posterarXiv:2407.19497

citations

#2591

Class-Agnostic Object Counting with Text-to-Image Diffusion Model

Xiaofei Hui, Qian Wu, Hossein Rahmani et al.

ECCV 2024poster

citations

#2592

Privacy-Preserving Optics for Enhancing Protection in Face De-Identification

Jhon Lopez, Carlos Hinojosa, Henry Arguello et al.

CVPR 2024posterarXiv:2404.00777

citations

#2593

Dynamic Layer Tying for Parameter-Efficient Transformers

Tamir David-Hay, Lior Wolf

ICLR 2024poster

citations

#2594

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

Simon Weber, Barış Zöngür, Nikita Araslanov et al.

CVPR 2024posterarXiv:2404.03778

citations

#2595

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

Yi-Hao Peng, Faria Huq, Yue Jiang et al.

ECCV 2024posterarXiv:2410.00201

citations

#2596

Optimal Sample Complexity of Contrastive Learning

Noga Alon, Dmitrii Avdiukhin, Dor Elboim et al.

ICLR 2024spotlightarXiv:2312.00379

citations

#2597

RoadPainter: Points Are Ideal Navigators for Topology transformER

Zhongxing Ma, Liang Shuang, Yongkun Wen et al.

ECCV 2024posterarXiv:2407.15349

citations

#2598

Continuous Piecewise-Affine Based Motion Model for Image Animation

Hexiang Wang, Fengqi Liu, Qianyu Zhou et al.

AAAI 2024paperarXiv:2401.09146

citations

#2599

Uncertainty Regularized Evidential Regression

Kai Ye, Tiejin Chen, Hua Wei et al.

AAAI 2024paperarXiv:2401.01484

citations

#2600

Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem

Qianliang Wu, Haobo Jiang, Lei Luo et al.

ECCV 2024poster

citations

← Previous

1...11 12 13 14 15...62