Most Cited 2024 "conditional score networks" Papers

12,324 papers found • Page 40 of 62

#7801

Diffusion Posterior Sampling is Computationally Intractable

Shivam Gupta, Ajil Jalal, Aditya Parulekar et al.

ICML 2024arXiv:2402.12727
#7802

PerceptAnon: Exploring the Human Perception of Image Anonymization Beyond Pseudonymization for GDPR

Kartik Patwari, Chen-Nee Chuah, Lingjuan Lyu et al.

ICML 2024
#7803

Do Topological Characteristics Help in Knowledge Distillation?

Jungeun Kim, Junwon You, Dongjin Lee et al.

ICML 2024
#7804

Stochastic Optimization with Arbitrary Recurrent Data Sampling

William Powell, Hanbaek Lyu

ICML 2024arXiv:2401.07694
#7805

Partially Stochastic Infinitely Deep Bayesian Neural Networks

Sergio Calvo Ordoñez, Matthieu Meunier, Francesco Piatti et al.

ICML 2024
#7806

Neuro-Visualizer: A Novel Auto-Encoder-Based Loss Landscape Visualization Method With an Application in Knowledge-Guided Machine Learning

Mohannad Elhamod, Anuj Karpatne

ICML 2024
#7807

Discovering Bias in Latent Space: An Unsupervised Debiasing Approach

Dyah Adila, Shuai Zhang, Boran Han et al.

ICML 2024arXiv:2406.03631
#7808

Centralized Selection with Preferences in the Presence of Biases

L. Elisa Celis, Amit Kumar, Nisheeth K. Vishnoi et al.

ICML 2024arXiv:2409.04897
#7809

Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization

Rui Li, Chaozhuo Li, Yanming Shen et al.

ICML 2024arXiv:2405.08540
#7810

DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems

Yair Schiff, Zhong Yi Wan, Jeffrey Parker et al.

ICML 2024arXiv:2402.04467
#7811

Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration

Zhongzhi Yu, Zheng Wang, Yonggan Fu et al.

ICML 2024arXiv:2406.15765
#7812

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Zhisheng Zheng, Puyuan Peng, Ziyang Ma et al.

ICML 2024arXiv:2402.01591
#7813

Rethinking Transformers in Solving POMDPs

Chenhao Lu, Ruizhe Shi, Yuyao Liu et al.

ICML 2024arXiv:2405.17358
#7814

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Hyeonah Kim, Minsu Kim, Sungsoo Ahn et al.

ICML 2024arXiv:2306.01276
#7815

From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions

Trenton Chang, Jenna Wiens

ICML 2024arXiv:2406.18865
#7816

Embodied CoT Distillation From LLM To Off-the-shelf Agents

Wonje Choi, Woo Kyung Kim, Minjong Yoo et al.

ICML 2024arXiv:2412.11499
#7817

A General Framework for Sequential Decision-Making under Adaptivity Constraints

Nuoya Xiong, Zhaoran Wang, Zhuoran Yang

ICML 2024arXiv:2306.14468
#7818

How Does Goal Relabeling Improve Sample Efficiency?

Sirui Zheng, Chenjia Bai, Zhuoran Yang et al.

ICML 2024
#7819

Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling

Zehao Dou, Minshuo Chen, Mengdi Wang et al.

ICML 2024
#7820

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Yao Mu, Junting Chen, Qing-Long Zhang et al.

ICML 2024arXiv:2402.16117
#7821

Enhancing Adversarial Robustness in SNNs with Sparse Gradients

Yujia Liu, Tong Bu, Ding Jianhao et al.

ICML 2024arXiv:2405.20355
#7822

Layerwise Change of Knowledge in Neural Networks

Xu Cheng, Lei Cheng, Zhaoran Peng et al.

ICML 2024arXiv:2409.08712
#7823

Analysis for Abductive Learning and Neural-Symbolic Reasoning Shortcuts

Xiao-Wen Yang, Wen-Da Wei, Jie-Jing Shao et al.

ICML 2024
#7824

On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis

Jerry Yao-Chieh Hu, Thomas Lin, Zhao Song et al.

ICML 2024arXiv:2402.04520
#7825

Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers

Xiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai et al.

ICML 2024arXiv:2310.02905
#7826

EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

Chung-Yiu Yau, Hoi To Wai, Parameswaran Raman et al.

ICML 2024arXiv:2404.10575
#7827

Smoothness Adaptive Hypothesis Transfer Learning

Haotian Lin, Matthew Reimherr

ICML 2024arXiv:2402.14966
#7828

Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

Chentao Cao, Zhun Zhong, Zhanke Zhou et al.

ICML 2024arXiv:2406.00806
#7829

Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers

Johann Schmidt, Sebastian Stober

ICML 2024arXiv:2405.03730
#7830

WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer

Kumar Shubham, Aishwarya Jayagopal, Syed Danish et al.

ICML 2024arXiv:2405.04078
#7831

Interacting Diffusion Processes for Event Sequence Forecasting

Mai Zeng, Florence Regol, Mark Coates

ICML 2024oralarXiv:2310.17800
#7832

Recurrent Early Exits for Federated Learning with Heterogeneous Clients

Royson Lee, Javier Fernandez-Marques, Xu Hu et al.

ICML 2024arXiv:2405.14791
#7833

On Interpolating Experts and Multi-Armed Bandits

Houshuang Chen, Yuchen He, Chihao Zhang

ICML 2024arXiv:2307.07264
#7834

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

Xinran Li, Zifan LIU, Shibo Chen et al.

ICML 2024arXiv:2405.18110
#7835

Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations Modeling

Guoqi Yu, Jing Zou, Xiaowei Hu et al.

ICML 2024arXiv:2402.12694
#7836

NeuralIndicator: Implicit Surface Reconstruction from Neural Indicator Priors

Shi-Sheng Huang, Guo Chen, Li-heng Chen et al.

ICML 2024
#7837

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ICML 2024arXiv:2311.17105
#7838

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Kaining Ying, Fanqing Meng, Jin Wang et al.

ICML 2024arXiv:2404.16006
#7839

Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

Feiran Li, Qianqian Xu, Shilong Bao et al.

ICML 2024spotlightarXiv:2405.09782
#7840

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

Jiajun Ma, Shuchen Xue, Tianyang Hu et al.

ICML 2024arXiv:2402.15170
#7841

RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning

Yukinari Hisaki, Isao Ono

ICML 2024arXiv:2408.01972
#7842

Smooth Tchebycheff Scalarization for Multi-Objective Optimization

Xi Lin, Xiaoyuan Zhang, Zhiyuan Yang et al.

ICML 2024arXiv:2402.19078
#7843

DFD: Distilling the Feature Disparity Differently for Detectors

Kang Liu, Yingyi Zhang, Jingyun Zhang et al.

ICML 2024
#7844

Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model

Fei Liu, Tong Xialiang, Mingxuan Yuan et al.

ICML 2024arXiv:2401.02051
#7845

In-Context Unlearning: Language Models as Few-Shot Unlearners

Martin Pawelczyk, Seth Neel, Himabindu Lakkaraju

ICML 2024
#7846

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban et al.

ICML 2024arXiv:2403.01857
#7847

Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs

Stelios Triantafyllou, Aleksa Sukovic, Debmalya Mandal et al.

ICML 2024arXiv:2310.11334
#7848

KernelSHAP-IQ: Weighted Least Square Optimization for Shapley Interactions

Fabian Fumagalli, Maximilian Muschalik, Patrick Kolpaczki et al.

ICML 2024
#7849

Reference Neural Operators: Learning the Smooth Dependence of Solutions of PDEs on Geometric Deformations

Ze Cheng, Zhongkai Hao, Wang Xiaoqiang et al.

ICML 2024arXiv:2405.17509
#7850

Auto-Linear Phenomenon in Subsurface Imaging

Yinan Feng, Yinpeng Chen, Peng Jin et al.

ICML 2024arXiv:2305.13314
#7851

Accelerating Parallel Sampling of Diffusion Models

Zhiwei Tang, Jiasheng Tang, Hao Luo et al.

ICML 2024arXiv:2402.09970
#7852

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

Jianliang He, Siyu Chen, Fengzhuo Zhang et al.

ICML 2024arXiv:2405.19883
#7853

Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts

Onur Celik, Aleksandar Taranovic, Gerhard Neumann

ICML 2024arXiv:2403.06966
#7854

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)

Zongxin Yang, Guikun Chen, Xiaodi Li et al.

ICML 2024oralarXiv:2401.08392
#7855

Distributed Bilevel Optimization with Communication Compression

Yutong He, Jie Hu, Xinmeng Huang et al.

ICML 2024arXiv:2405.18858
#7856

Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks

Guanhua Zhang, Moritz Hardt

ICML 2024oralarXiv:2405.01719
#7857

On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

Denys Pushkin, Raphaël Berthier, Emmanuel Abbe

ICML 2024arXiv:2406.06354
#7858

On the Weight Dynamics of Deep Normalized Networks

Christian H.X. Ali Mehmeti-Göpel, Michael Wand

ICML 2024arXiv:2306.00700
#7859

Jacobian Regularizer-based Neural Granger Causality

Wanqi Zhou, Shuanghao Bai, Shujian Yu et al.

ICML 2024arXiv:2405.08779
#7860

Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations

Jonas Beck, Nathanael Bosch, Michael Deistler et al.

ICML 2024arXiv:2402.12231
#7861

Projecting Molecules into Synthesizable Chemical Spaces

Shitong Luo, Wenhao Gao, Zuofan Wu et al.

ICML 2024arXiv:2406.04628
#7862

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta et al.

ICML 2024arXiv:2402.09727
#7863

Energy-based Backdoor Defense without Task-Specific Samples and Model Retraining

Yudong Gao, Honglong Chen, Peng Sun et al.

ICML 2024
#7864

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Yichao Fu, Peter Bailis, Ion Stoica et al.

ICML 2024arXiv:2402.02057
#7865

Don’t Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget

Florian Dorner, Moritz Hardt

ICML 2024
#7866

Causal Inference from Competing Treatments

Ana-Andreea Stoica, Vivian Y. Nastl, Moritz Hardt

ICML 2024arXiv:2406.03422
#7867

Denoising Autoregressive Representation Learning

Yazhe Li, Jorg Bornschein, Ting Chen

ICML 2024arXiv:2403.05196
#7868

On a Neural Implementation of Brenier's Polar Factorization

Nina Vesseron, Marco Cuturi

ICML 2024spotlightarXiv:2403.03071
#7869

Causally Motivated Personalized Federated Invariant Learning with Shortcut-Averse Information-Theoretic Regularization

Xueyang Tang, Song Guo, Jingcai Guo et al.

ICML 2024
#7870

Privacy Attacks in Decentralized Learning

Abdellah El Mrini, Edwige Cyffers, Aurélien Bellet

ICML 2024arXiv:2402.10001
#7871

Membership Inference Attacks on Diffusion Models via Quantile Regression

Shuai Tang, Steven Wu, Sergul Aydore et al.

ICML 2024arXiv:2312.05140
#7872

Hybrid Neural Representations for Spherical Data

Hyomin Kim, Yunhui Jang, Jaeho Lee et al.

ICML 2024oralarXiv:2402.05965
#7873

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu, Lingdong Kong, Hui Shuai et al.

ECCV 2024arXiv:2407.06190
#7874

Premise Order Matters in Reasoning with Large Language Models

Xinyun Chen, Ryan Chi, Xuezhi Wang et al.

ICML 2024arXiv:2402.08939
#7875

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang

ECCV 2024arXiv:2312.07530
#7876

Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation

Taekyung Ki, Dongchan Min, Gyeongsu Chae

ECCV 2024arXiv:2404.00636
#7877

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.

ECCV 2024arXiv:2409.01322
#7878

Disentangling Masked Autoencoders for Unsupervised Domain Generalization

An Zhang, Han Wang, Xiang Wang et al.

ECCV 2024arXiv:2407.07544
#7879

BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

Pilhyeon Lee, Hyeran Byun

ECCV 2024arXiv:2312.00083
#7880

MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description

Ziqiang Zheng, Yiwei Chen, Huimin Zeng et al.

ECCV 2024
#7881

BRAVE: Broadening the visual encoding of vision-language models

Oguzhan Fatih Kar, Alessio Tonioni, Petra Poklukar et al.

ECCV 2024arXiv:2404.07204
#7882

SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

Marko Mihajlovic, Sergey Prokudin, Siyu Tang et al.

ECCV 2024arXiv:2409.11211
#7883

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

Zhipeng Hu, Yongqiang Zhang, Chen Liu et al.

ECCV 2024
#7884

MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation

Xiaoshuai Hao, Ruikai Li, Hui Zhang et al.

ECCV 2024arXiv:2407.11682
#7885

High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs

Ruikang Xu, Mingde Yao, Yue Li et al.

ECCV 2024
#7886

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Xintao Lv, Liang Xu, Yichao Yan et al.

ECCV 2024arXiv:2407.12371
#7887

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

Ruijie Yao, Sheng Jin, Lumin Xu et al.

ECCV 2024arXiv:2308.14378
#7888

Merlin: Empowering Multimodal LLMs with Foresight Minds

En Yu, liang zhao, YANA WEI et al.

ECCV 2024
#7889

E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness

Robin Courant, Nicolas Dufour, Xi WANG et al.

ECCV 2024arXiv:2407.01516
#7890

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

Yang Miao, Francis Engelmann, Olga Vysotska et al.

ECCV 2024arXiv:2404.00469
#7891

Spectral Subsurface Scattering for Material Classification

Haejoon Lee, Aswin C. Sankaranarayanan

ECCV 2024
#7892

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Yunhao Gou, Kai Chen, Zhili LIU et al.

ECCV 2024arXiv:2403.09572
#7893

Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation

Peixi Xiong, Michael A Kozuch, Nilesh Jain

ECCV 2024
#7894

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Lin Chen, Jinsong Li, Xiaoyi Dong et al.

ECCV 2024arXiv:2311.12793
#7895

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

Pengxiang Ding, Han Zhao, Wenjie Zhang et al.

ECCV 2024arXiv:2312.14457
#7896

Cross-Input Certified Training for Universal Perturbations

Changming Xu, Gagandeep Singh

ECCV 2024arXiv:2405.09176
#7897

Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework

Wei Suo, Lanqing Lai, Mengyang Sun et al.

ECCV 2024
#7898

LiDAR-Event Stereo Fusion with Hallucinations

Luca Bartolomei, Matteo Poggi, Andrea Conti et al.

ECCV 2024arXiv:2408.04633
#7899

MMBENCH: Is Your Multi-Modal Model an All-around Player?

Yuan Liu, Haodong Duan, Yuanhan Zhang et al.

ECCV 2024arXiv:2307.06281
#7900

Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds

Shengtao Li, Ge Gao, Yudong Liu et al.

ECCV 2024arXiv:2407.13342
#7901

Unsupervised Exposure Correction

Ruodai Cui, Li Niu, Guosheng Hu

ECCV 2024arXiv:2507.17252
#7902

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins et al.

ECCV 2024arXiv:2403.13064
#7903

GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation

Bangyan Liao, Zhenjun Zhao, Lu Chen et al.

ECCV 2024arXiv:2407.13537
#7904

3D Congealing: 3D-Aware Image Alignment in the Wild

Yunzhi Zhang, Zizhang Li, Amit Raj et al.

ECCV 2024arXiv:2404.02125
#7905

Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment

Wulian Yun, Mengshi Qi, Fei Peng et al.

ECCV 2024arXiv:2407.19675
#7906

Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective

Panjian Huang, Yunjie Peng, Saihui Hou et al.

ECCV 2024
#7907

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Chirag Vashist, Shichong Peng, Ke Li

ECCV 2024arXiv:2409.17439
#7908

Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection

Kohei Yamashita, Vincent Lepetit, Ko Nishino

ECCV 2024arXiv:2312.04527
#7909

Robust Fitting on a Gate Quantum Computer

Frances Yang, Michele Sasdelli, Tat-Jun Chin

ECCV 2024arXiv:2409.02006
#7910

Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics

Shuai Yang, ZhiFei Chen, Pengguang Chen et al.

ECCV 2024arXiv:2310.17316
#7911

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024arXiv:2407.10159
#7912

3D Single-object Tracking in Point Clouds with High Temporal Variation

Qiao Wu, Kun Sun, Pei An et al.

ECCV 2024arXiv:2408.02049
#7913

Self-supervised Shape Completion via Involution and Implicit Correspondences

Mengya Liu, Ajad Chhatkuli, Janis Postels et al.

ECCV 2024arXiv:2409.15939
#7914

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers

Ziling Huang, Shin’ichi Satoh

ECCV 2024
#7915

Energy-induced Explicit quantification for Multi-modality MRI fusion

Xiaoming Qi, Yuan Zhang, Tong Wang et al.

ECCV 2024
#7916

GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation

Haonan Wang, Jie Liu, Jie Tang et al.

ECCV 2024arXiv:2407.10756
#7917

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327
#7918

ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities

CHENMING ZHU, Tai Wang, Wenwei Zhang et al.

ECCV 2024arXiv:2407.01525
#7919

See and Think: Embodied Agent in Virtual Environment

Zhonghan Zhao, Xuan Wang, Wenhao Chai et al.

ECCV 2024arXiv:2311.15209
#7920

Scalar Function Topology Divergence: Comparing Topology of 3D Objects

Ilya Trofimov, Daria Voronkova, Eduard Tulchinskii et al.

ECCV 2024arXiv:2407.08364
#7921

Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields

Yonggan Fu, Huaizhi Qu, Zhifan Ye et al.

ECCV 2024arXiv:2403.11131
#7922

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024arXiv:2409.15269
#7923

Convex Relaxations for Manifold-Valued Markov Random Fields with Approximation Guarantees

Robin Kenis, Emanuel Laude, Panagiotis Patrinos

ECCV 2024
#7924

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Baoxiong Jia, Yixin Chen, Huangyue Yu et al.

ECCV 2024arXiv:2401.09340
#7925

SENC: Handling Self-collision in Neural Cloth Simulation

Zhouyingcheng Liao, Sinan Wang, Taku Komura

ECCV 2024arXiv:2407.12479
#7926

m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Zixian Ma, Weikai Huang, Jieyu Zhang et al.

ECCV 2024arXiv:2403.11085
#7927

Plain-Det: A Plain Multi-Dataset Object Detector

cheng Shi, yuchen zhu, Sibei Yang

ECCV 2024arXiv:2407.10083
#7928

Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization

Naiyu Yin, Hanjing Wang, Yue Yu et al.

ECCV 2024
#7929

Local All-Pair Correspondence for Point Tracking

Seokju Cho, Jiahui Huang, Jisu Nam et al.

ECCV 2024arXiv:2407.15420
#7930

DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution

Shrey Singh, Prateek Keserwani, Masakazu Iwamura et al.

ECCV 2024
#7931

Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification

Linhao Qu, Dingkang Yang, Dan Huang et al.

ECCV 2024arXiv:2407.10814
#7932

AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering

Xiuyuan Chen, Yuan Lin, Yuchen Zhang et al.

ECCV 2024arXiv:2311.14906
#7933

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Seung Hyun Lee, Yinxiao Li, Junjie Ke et al.

ECCV 2024arXiv:2401.05675
#7934

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024arXiv:2408.10739
#7935

MyVLM: Personalizing VLMs for User-Specific Queries

Yuval Alaluf, Elad Richardson, Sergey Tulyakov et al.

ECCV 2024arXiv:2403.14599
#7936

Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction

Xinhang Liu, Jiaben Chen, Shiu-Hong Kao et al.

ECCV 2024arXiv:2305.15171
#7937

Collaborative Control for Geometry-Conditioned PBR Image Generation

Shimon Vainer, Mark Boss, Mathias Parger et al.

ECCV 2024arXiv:2402.05919
#7938

Look Around and Learn: Self-Training Object Detection by Exploration

Gianluca Scarpellini, Stefano Rosa, Pietro Morerio et al.

ECCV 2024arXiv:2302.03566
#7939

Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model

Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong

ECCV 2024arXiv:2407.14434
#7940

SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images

Nir Barel, Ron Aharon Shapira Weber, Nir Mualem et al.

ECCV 2024arXiv:2407.11850
#7941

DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators

Hanyang Kong, Dongze Lian, Michael Bi Mi et al.

ECCV 2024arXiv:2312.08746
#7942

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024arXiv:2409.17917
#7943

Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

Jinxing Zhou, Dan Guo, Yuxin Mao et al.

ECCV 2024arXiv:2407.08126
#7944

Think before Placement: Common Sense Enhanced Transformer for Object Placement

Yaxuan Qin, Jiayu Xu, Ruiping Wang et al.

ECCV 2024
#7945

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Jing Wu, Jiawang Bian, Xinghui Li et al.

ECCV 2024arXiv:2403.08733
#7946

Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation

Genki Kinoshita, Ko Nishino

ECCV 2024arXiv:2312.04530
#7947

AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion

Zhiheng Fu, Longguang Wang, Lian Xu et al.

ECCV 2024
#7948

GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views

Vinayak Gupta, Rongali Simhachala Venkata Girish, Mukund Varma T et al.

ECCV 2024arXiv:2407.08221
#7949

Efficient Bias Mitigation Without Privileged Information

Mateo Espinosa Zarlenga, Sankaranarayanan, Jerone Andrews et al.

ECCV 2024arXiv:2409.17691
#7950

Towards Open-Ended Visual Recognition with Large Language Models

Qihang Yu, Xiaohui Shen, Liang-Chieh Chen

ECCV 2024arXiv:2311.08400
#7951

Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

Omer Dahary, Or Patashnik, Kfir Aberman et al.

ECCV 2024arXiv:2403.16990
#7952

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Bowen Zhang, Yiji Cheng, Chunyu Wang et al.

ECCV 2024arXiv:2407.06938
#7953

IRGen: Generative Modeling for Image Retrieval

Yidan Zhang, Ting Zhang, DONG CHEN et al.

ECCV 2024arXiv:2303.10126
#7954

LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow

Hongyu Wen, Erich Liang, Jia Deng

ECCV 2024arXiv:2409.05688
#7955

Adaptive Parametric Activation

Konstantinos P Alexandridis, Jiankang Deng, Anh Nguyen et al.

ECCV 2024
#7956

Scaling Backwards: Minimal Synthetic Pre-training?

Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024arXiv:2408.00677
#7957

Towards Multi-modal Transformers in Federated Learning

Guangyu Sun, Matias Mendieta, Aritra Dutta et al.

ECCV 2024arXiv:2404.12467
#7958

FisherRF: Active View Selection and Mapping with Radiance Fields using Fisher Information

Wen Jiang, BOSHU LEI, Kostas Daniilidis

ECCV 2024
#7959

General and Task-Oriented Video Segmentation

Mu Chen, Liulei Li, Wenguan Wang et al.

ECCV 2024arXiv:2407.06540
#7960

Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy

Fadlullah Raji, John Murray-Bruce

ECCV 2024arXiv:2601.12257
#7961

Learning 3D-aware GANs from Unposed Images with Template Feature Field

XINYA CHEN, Hanlei Guo, Yanrui Bin et al.

ECCV 2024arXiv:2404.05705
#7962

Human Hair Reconstruction with Strand-Aligned 3D Gaussians

Egor Zakharov, Vanessa Sklyarova, Michael J. Black et al.

ECCV 2024arXiv:2409.14778
#7963

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024arXiv:2407.13460
#7964

CIC-BART-SSA: : Controllable Image Captioning with Structured Semantic Augmentation

Kalliopi Basioti, Mohamed A Abdelsalam, Federico Fancellu et al.

ECCV 2024arXiv:2407.11393
#7965

Rethinking Image Super Resolution from Training Data Perspectives

Go Ohtani, Ryu Tadokoro, Ryosuke Yamada et al.

ECCV 2024
#7966

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Kuo Wang, Lechao Cheng, Weikai Chen et al.

ECCV 2024arXiv:2407.21465
#7967

Learning to Robustly Reconstruct Dynamic Scenes from Low-light Spike Streams

Liwen Hu, gang ding, Mianzhi Liu et al.

ECCV 2024
#7968

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

Jiefeng Li, Ye Yuan, Davis Rempe et al.

ECCV 2024arXiv:2408.16426
#7969

Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

Zhihao Liang, Qi Zhang, WENBO HU et al.

ECCV 2024arXiv:2403.11056
#7970

Uni3DL: A Unified Model for 3D Vision-Language Understanding

Xiang Li, Jian Ding, Zhaoyang Chen et al.

ECCV 2024
#7971

G3R: Gradient Guided Generalizable Reconstruction

Yun Chen, Jingkang Wang, Ze Yang et al.

ECCV 2024arXiv:2409.19405
#7972

T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers et al.

ECCV 2024arXiv:2312.10217
#7973

Invertible Neural Warp for NeRF

Shin-Fang Chng, Ravi Garg, Hemanth Saratchandran et al.

ECCV 2024arXiv:2407.12354
#7974

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Sungyeon Kim, Boseung Jeong, Donghyun Kim et al.

ECCV 2024arXiv:2408.05749
#7975

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

Tianqi Liu, Guangcong Wang, Shoukang Hu et al.

ECCV 2024arXiv:2405.12218
#7976

MambaIR: A Simple Baseline for Image Restoration with State-Space Model

Hang Guo, Jinmin Li, Tao Dai et al.

ECCV 2024arXiv:2402.15648
#7977

I Can't Believe It's Not Scene Flow!

Ishan Khatri, Kyle Vedder, Neehar Peri et al.

ECCV 2024arXiv:2403.04739
#7978

Bi-directional Contextual Attention for 3D Dense Captioning

Minjung Kim, Hyung Suk Lim, Soonyoung Lee et al.

ECCV 2024arXiv:2408.06662
#7979

Scalable Group Choreography via Variational Phase Manifold Learning

Nhat Le, Khoa Do, Xuan Bui et al.

ECCV 2024arXiv:2407.18839
#7980

TPA3D: Triplane Attention for Fast Text-to-3D Generation

Bin-Shih Wu, HONG-EN CHEN, Sheng-Yu Huang et al.

ECCV 2024arXiv:2312.02647
#7981

Augmented Neural Fine-tuning for Efficient Backdoor Purification

Md Nazmul Karim, Abdullah Al Arafat, Umar Khalid et al.

ECCV 2024arXiv:2407.10052
#7982

Retrieval Robust to Object Motion Blur

Rong Zou, Marc Pollefeys, Denys Rozumnyi

ECCV 2024arXiv:2404.18025
#7983

Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction

Bingyu Xin, Meng Ye, Leon Axel et al.

ECCV 2024
#7984

Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization

Jiajun Hu, Jian Zhang, Lei Qi et al.

ECCV 2024arXiv:2407.15085
#7985

SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

Peishen Yan, Hao Wang, Tao Song et al.

ECCV 2024arXiv:2312.12484
#7986

SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding

Zixu Cheng, Yujiang Pu, Shaogang Gong et al.

ECCV 2024arXiv:2407.05118
#7987

How Video Meetings Change Your Expression

Sumit Sarin, Utkarsh Mall, Purva Tendulkar et al.

ECCV 2024arXiv:2406.00955
#7988

Audio-driven Talking Face Generation with Stabilized Synchronization Loss

Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.

ECCV 2024arXiv:2307.09368
#7989

Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

Bjoern Michele, Alexandre Boulch, Tuan Hung Vu et al.

ECCV 2024arXiv:2409.04409
#7990

L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model

Yuchen Hong, Haofeng Zhong, Shuchen Weng et al.

ECCV 2024
#7991

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Yu Wang, Xiaogeng Liu, Yu Li et al.

ECCV 2024arXiv:2403.09513
#7992

LetsMap: Unsupervised Representation Learning for Label-Efficient Semantic BEV Mapping

Nikhil Gosala, Kürsat Petek, B Ravi Kiran et al.

ECCV 2024
#7993

Blind image deblurring with noise-robust kernel estimation

Chanseok Lee, Jeongsol Kim, Seungmin Lee et al.

ECCV 2024
#7994

Free-Viewpoint Video of Outdoor Sports Using a Drone

Zhengdong Hong

ECCV 2024
#7995

Binomial Self-compensation for Motion Error in Dynamic 3D Scanning

Geyou Zhang, Ce Zhu, Kai Liu

ECCV 2024arXiv:2404.06693
#7996

Momentum Auxiliary Network for Supervised Local Learning

Junhao Su, Changpeng Cai, Feiyu Zhu et al.

ECCV 2024arXiv:2407.05623
#7997

Cocktail Universal Adversarial Attack on Deep Neural Networks

Shaoxin Li, Xiaofeng Liao, Xin Che et al.

ECCV 2024
#7998

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders

Carlos Hinojosa, Shuming Liu, Bernard Ghanem

ECCV 2024arXiv:2407.13036
#7999

Resilience of Entropy Model in Distributed Neural Networks

Milin Zhang, Mohammad Abdi, Shahriar Rifat et al.

ECCV 2024arXiv:2403.00942
#8000

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Yue Fan, Xiaojian Ma, Rujie Wu et al.

ECCV 2024arXiv:2403.11481