Most Cited 2024 "hdr imaging" Papers

12,324 papers found • Page 15 of 62

#2801

Towards Variable and Coordinated Holistic Co-Speech Motion Generation

Yifei Liu, Qiong Cao, Yandong Wen et al.

CVPR 2024arXiv:2404.00368
30
citations
#2802

Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation

feilong tang, Zhongxing Xu, Zhaojun QU et al.

CVPR 2024arXiv:2403.07630
30
citations
#2803

Language Models as Semantic Indexers

Bowen Jin, Hansi Zeng, Guoyin Wang et al.

ICML 2024arXiv:2310.07815
30
citations
#2804

Multi-Space Alignments Towards Universal LiDAR Segmentation

Youquan Liu, Lingdong Kong, Xiaoyang Wu et al.

CVPR 2024arXiv:2405.01538
30
citations
#2805

Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation

Bingfeng Zhang, Siyue Yu, Yunchao Wei et al.

CVPR 2024highlightarXiv:2406.11189
30
citations
#2806

Beyond Text: Frozen Large Language Models in Visual Signal Comprehension

Lei Zhu, Fangyun Wei, Yanye Lu

CVPR 2024arXiv:2403.07874
30
citations
#2807

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework

Ziyao Huang, Fan Tang, Yong Zhang et al.

CVPR 2024arXiv:2403.16510
30
citations
#2808

Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization

Hancheng Min, Enrique Mallada, Rene Vidal

ICLR 2024arXiv:2307.12851
30
citations
#2809

Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data

Yiwei Li, Peiwen Yuan, Shaoxiong Feng et al.

AAAI 2024paperarXiv:2312.12832
30
citations
#2810

Revisiting the Power of Prompt for Visual Tuning

Yuzhu Wang, Lechao Cheng, Chaowei Fang et al.

ICML 2024spotlightarXiv:2402.02382
30
citations
#2811

CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

Kangfu Mei, Mauricio Delbracio, Hossein Talebi et al.

CVPR 2024arXiv:2310.01407
30
citations
#2812

VideoCon: Robust Video-Language Alignment via Contrast Captions

Hritik Bansal, Yonatan Bitton, Idan Szpektor et al.

CVPR 2024arXiv:2311.10111
30
citations
#2813

Learning to Reject with a Fixed Predictor: Application to Decontextualization

Christopher Mohri, Daniel Andor, Eunsol Choi et al.

ICLR 2024arXiv:2301.09044
30
citations
#2814

Diff-BGM: A Diffusion Model for Video Background Music Generation

Sizhe Li, Yiming Qin, Minghang Zheng et al.

CVPR 2024arXiv:2405.11913
30
citations
#2815

EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation

Chanyoung Kim, Woojung Han, Dayun Ju et al.

CVPR 2024highlightarXiv:2403.01482
30
citations
#2816

Understanding In-Context Learning from Repetitions

Jianhao (Elliott) Yan, Jin Xu, Chiyu Song et al.

ICLR 2024arXiv:2310.00297
30
citations
#2817

Feature emergence via margin maximization: case studies in algebraic tasks

Depen Morwani, Benjamin Edelman, Costin-Andrei Oncescu et al.

ICLR 2024spotlightarXiv:2311.07568
30
citations
#2818

Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models

Hongjie Wang, Difan Liu, Yan Kang et al.

CVPR 2024arXiv:2405.05252
30
citations
#2819

CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasks

Shashank Agnihotri, Steffen Jung, Margret Keuper

ICML 2024arXiv:2302.02213
30
citations
#2820

MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation

Xiaoshuai Hao, Ruikai Li, Hui Zhang et al.

ECCV 2024arXiv:2407.11682
30
citations
#2821

PosterLlama: Bridging Design Ability of Langauge Model to Content-Aware Layout Generation

Jaejung Seol, Seojun Kim, Jaejun Yoo

ECCV 2024arXiv:2404.00995
30
citations
#2822

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

Jeonghye Kim, Su Young Lee, Woojun Kim et al.

ICLR 2024spotlightarXiv:2310.03022
30
citations
#2823

Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction

Xinhang Liu, Jiaben Chen, Shiu-Hong Kao et al.

ECCV 2024arXiv:2305.15171
30
citations
#2824

Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning

Woo-Jin Ahn, Geun-Yeong Yang, Hyunduck Choi et al.

CVPR 2024arXiv:2403.06122
30
citations
#2825

TimeX++: Learning Time-Series Explanations with Information Bottleneck

Zichuan Liu, Tianchun Wang, Jimeng Shi et al.

ICML 2024arXiv:2405.09308
30
citations
#2826

iKUN: Speak to Trackers without Retraining

Yunhao Du, Cheng Lei, Zhicheng Zhao et al.

CVPR 2024arXiv:2312.16245
30
citations
#2827

CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Yi Rong, Haoran Zhou, Lixin Yuan et al.

AAAI 2024paperarXiv:2401.01552
30
citations
#2828

Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds

Yujia Liu, Anton Obukhov, Jan D. Wegner et al.

CVPR 2024highlightarXiv:2312.04962
30
citations
#2829

Understanding Addition in Transformers

Philip Quirke, Fazl Barez

ICLR 2024arXiv:2310.13121
30
citations
#2830

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.

ECCV 2024arXiv:2403.13745
30
citations
#2831

Data Poisoning based Backdoor Attacks to Contrastive Learning

Jinghuai Zhang, Hongbin Liu, Jinyuan Jia et al.

CVPR 2024arXiv:2211.08229
30
citations
#2832

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

Haotian Yan, Ming Wu, Chuang Zhang

ICLR 2024arXiv:2404.16573
30
citations
#2833

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

Muhammad Jehanzeb Mirza, Leonid Karlinsky, Wei Lin et al.

ECCV 2024arXiv:2403.11755
30
citations
#2834

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Han Shen, Zhuoran Yang, Tianyi Chen

ICML 2024arXiv:2402.06886
30
citations
#2835

WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing

Shuokang Huang, Kaihan Li, Di You et al.

ECCV 2024arXiv:2402.09430
30
citations
#2836

Achieving Human Parity in Content-Grounded Datasets Generation

Asaf Yehudai, Boaz Carmeli, Yosi Mass et al.

ICLR 2024arXiv:2401.14367
30
citations
#2837

EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding

jiazhou zhou, Xu Zheng, Yuanhuiyi Lyu et al.

ECCV 2024arXiv:2308.03135
30
citations
#2838

EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models

Jingyuan Yang, Jiawei Feng, Hui Huang

CVPR 2024arXiv:2401.04608
30
citations
#2839

Long-Short-Range Message-Passing: A Physics-Informed Framework to Capture Non-Local Interaction for Scalable Molecular Dynamics Simulation

Yunyang Li, Yusong Wang, Lin Huang et al.

ICLR 2024arXiv:2304.13542
30
citations
#2840

Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

Yiqun Mei, Yu Zeng, He Zhang et al.

CVPR 2024arXiv:2403.09632
30
citations
#2841

LLark: A Multimodal Instruction-Following Language Model for Music

Josh Gardner, Simon Durand, Daniel Stoller et al.

ICML 2024arXiv:2310.07160
30
citations
#2842

Privacy Backdoors: Stealing Data with Corrupted Pretrained Models

Shanglun Feng, Florian Tramer

ICML 2024arXiv:2404.00473
30
citations
#2843

DP-AdamBC: Your DP-Adam Is Actually DP-SGD (Unless You Apply Bias Correction)

Qiaoyue Tang, Frederick Shpilevskiy, Mathias Lécuyer

AAAI 2024paperarXiv:2312.14334
30
citations
#2844

Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

Jiye Lee, Hanbyul Joo

CVPR 2024arXiv:2401.00847
30
citations
#2845

CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model

Aoran Xiao, Weihao Xuan, Heli Qi et al.

ECCV 2024arXiv:2402.03631
30
citations
#2846

Equivariant Deep Weight Space Alignment

Aviv Navon, Aviv Shamsian, Ethan Fetaya et al.

ICML 2024arXiv:2310.13397
30
citations
#2847

Seeing Motion at Nighttime with an Event Camera

Haoyue Liu, Shihan Peng, Lin Zhu et al.

CVPR 2024arXiv:2404.11884
30
citations
#2848

THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models

Prannay Kaul, Zhizhong Li, Hao Yang et al.

CVPR 2024arXiv:2405.05256
30
citations
#2849

DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation

Zilin Si, Gu Zhang, Qingwei Ben et al.

ICLR 2024arXiv:2403.08716
30
citations
#2850

FreeDrag: Feature Dragging for Reliable Point-based Image Editing

Pengyang Ling, Lin Chen, Pan Zhang et al.

CVPR 2024arXiv:2307.04684
30
citations
#2851

Rethinking Multi-domain Generalization with A General Learning Objective

Zhaorui Tan, Xi Yang, Kaizhu Huang

CVPR 2024arXiv:2402.18853
30
citations
#2852

UMBRAE: Unified Multimodal Brain Decoding

Weihao Xia, Raoul de Charette, Cengiz Oztireli et al.

ECCV 2024arXiv:2404.07202
30
citations
#2853

Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping

Qinliang Lin, Cheng Luo, Zenghao Niu et al.

AAAI 2024paperarXiv:2402.03951
30
citations
#2854

SHAP-EDITOR: Instruction-Guided Latent 3D Editing in Seconds

Minghao Chen, Junyu Xie, Iro Laina et al.

CVPR 2024arXiv:2312.09246
30
citations
#2855

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024arXiv:2403.00522
30
citations
#2856

Automated Statistical Model Discovery with Language Models

Michael Li, Emily Fox, Noah Goodman

ICML 2024arXiv:2402.17879
30
citations
#2857

Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Mingjia Huo, Sai Ashish Somayajula, Youwei Liang et al.

ICML 2024arXiv:2402.18059
30
citations
#2858

Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling

Rui Liu, Yifan Hu, Yi Ren et al.

AAAI 2024paperarXiv:2312.11947
30
citations
#2859

Backdoor Federated Learning by Poisoning Backdoor-Critical Layers

Haomin Zhuang, Mingxian Yu, Hao Wang et al.

ICLR 2024arXiv:2308.04466
30
citations
#2860

A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling

Wentao Qu, Yuantian Shao, Lingwu Meng et al.

CVPR 2024arXiv:2312.02719
30
citations
#2861

Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser

Qingyuan Cai, Xuecai Hu, Saihui Hou et al.

AAAI 2024paperarXiv:2403.04444
30
citations
#2862

VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation

Yang Chen, Yingwei Pan, haibo yang et al.

CVPR 2024arXiv:2403.17001
30
citations
#2863

Universal Segmentation at Arbitrary Granularity with Language Instruction

Yong Liu, Cairong Zhang, Yitong Wang et al.

CVPR 2024arXiv:2312.01623
30
citations
#2864

LEOD: Label-Efficient Object Detection for Event Cameras

Ziyi Wu, Mathias Gehrig, Qing Lyu et al.

CVPR 2024arXiv:2311.17286
30
citations
#2865

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation

Wenhao Li, Mengyuan Liu, Hong Liu et al.

CVPR 2024highlightarXiv:2311.12028
30
citations
#2866

N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields

Yash Bhalgat, Iro Laina, Joao F Henriques et al.

ECCV 2024arXiv:2403.10997
30
citations
#2867

TopoGCL: Topological Graph Contrastive Learning

Yuzhou Chen, Jose Frias, Yulia Gel

AAAI 2024paperarXiv:2406.17251
30
citations
#2868

Graph Positional and Structural Encoder

Semih Cantürk, Renming Liu, Olivier Lapointe-Gagné et al.

ICML 2024arXiv:2307.07107
30
citations
#2869

Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification

Zhiwei Zhao, Bin Liu, Yan Lu et al.

AAAI 2024paper
30
citations
#2870

CapHuman: Capture Your Moments in Parallel Universes

Chao Liang, Fan Ma, Linchao Zhu et al.

CVPR 2024arXiv:2402.00627
30
citations
#2871

LAMM: Label Alignment for Multi-Modal Prompt Learning

Jingsheng Gao, Jiacheng Ruan, Suncheng Xiang et al.

AAAI 2024paperarXiv:2312.08212
30
citations
#2872

See More Details: Efficient Image Super-Resolution by Experts Mining

Eduard Zamfir, Zongwei Wu, Nancy Mehta et al.

ICML 2024arXiv:2402.03412
30
citations
#2873

Transformer-VQ: Linear-Time Transformers via Vector Quantization

Lucas D. Lingle

ICLR 2024arXiv:2309.16354
30
citations
#2874

RetroBridge: Modeling Retrosynthesis with Markov Bridges

Ilia Igashov, Arne Schneuing, Marwin Segler et al.

ICLR 2024spotlightarXiv:2308.16212
30
citations
#2875

GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model

Ling Li, Yu Ye, Bingchuan Jiang et al.

ICML 2024arXiv:2406.18572
30
citations
#2876

BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos

Pilhyeon Lee, Hyeran Byun

ECCV 2024arXiv:2312.00083
30
citations
#2877

Closing the Curious Case of Neural Text Degeneration

Matthew Finlayson, John Hewitt, Alexander Koller et al.

ICLR 2024arXiv:2310.01693
30
citations
#2878

EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition

Xu Zheng, Addison, Lin Wang

CVPR 2024arXiv:2403.14082
30
citations
#2879

Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation

Zhuqiang Lu, Kun Hu, Chaoyue Wang et al.

AAAI 2024paperarXiv:2309.03467
30
citations
#2880

Soft Prompt Generation for Domain Generalization

Shuanghao Bai, Yuedi Zhang, Wanqi Zhou et al.

ECCV 2024arXiv:2404.19286
30
citations
#2881

Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring

Chengxu Liu, Xuan Wang, Xiangyu Xu et al.

CVPR 2024arXiv:2404.13153
30
citations
#2882

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ECCV 2024arXiv:2403.15382
30
citations
#2883

Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Junxi Chen, Liang Li, Li Su et al.

CVPR 2024
29
citations
#2884

Learning Energy Decompositions for Partial Inference in GFlowNets

Hyosoon Jang, Minsu Kim, Sungsoo Ahn

ICLR 2024arXiv:2310.03301
29
citations
#2885

ContraNovo: A Contrastive Learning Approach to Enhance De Novo Peptide Sequencing

Zhi Jin, Sheng Xu, Xiang Zhang et al.

AAAI 2024paperarXiv:2312.11584
29
citations
#2886

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Jiaxin Zhang, Dezhi Peng, Chongyu Liu et al.

CVPR 2024arXiv:2405.04408
29
citations
#2887

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu et al.

ICLR 2024arXiv:2404.12308
29
citations
#2888

I-MedSAM: Implicit Medical Image Segmentation with Segment Anything

Xiaobao Wei, Jiajun Cao, Yizhu Jin et al.

ECCV 2024arXiv:2311.17081
29
citations
#2889

Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

Jin Yang, Ping Wei, Huan Li et al.

CVPR 2024arXiv:2404.09263
29
citations
#2890

Learning to Scale Logits for Temperature-Conditional GFlowNets

Minsu Kim, Joohwan Ko, Taeyoung Yun et al.

ICML 2024arXiv:2310.02823
29
citations
#2891

Learning Natural Consistency Representation for Face Forgery Video Detection

Daichi Zhang, Zihao Xiao, Shikun Li et al.

ECCV 2024arXiv:2407.10550
29
citations
#2892

Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment

Rui Zhao, Liang Zhang, Biao Fu et al.

AAAI 2024paperarXiv:2312.15645
29
citations
#2893

Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation

Jiapeng Su, Qi Fan, Wenjie Pei et al.

CVPR 2024arXiv:2404.10322
29
citations
#2894

Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks

Hojoon Lee, Hyeonseo Cho, Hyunseung Kim et al.

ICML 2024arXiv:2406.02596
29
citations
#2895

DiffClass: Diffusion-Based Class Incremental Learning

Zichong Meng, Jie Zhang, Changdi Yang et al.

ECCV 2024arXiv:2403.05016
29
citations
#2896

NARUTO: Neural Active Reconstruction from Uncertain Target Observations

Ziyue Feng, Huangying Zhan, Zheng Chen et al.

CVPR 2024arXiv:2402.18771
29
citations
#2897

Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting

Zhicheng Wang, Liwen Xiao, Zhiguo Cao et al.

AAAI 2024paperarXiv:2305.04440
29
citations
#2898

Boosting Neural Representations for Videos with a Conditional Decoder

XINJIE ZHANG, Ren Yang, Dailan He et al.

CVPR 2024highlightarXiv:2402.18152
29
citations
#2899

Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training

Cheng Tan, Jingxuan Wei, Zhangyang Gao et al.

ECCV 2024arXiv:2311.14109
29
citations
#2900

Batched Low-Rank Adaptation of Foundation Models

Yeming Wen, Swarat Chaudhuri

ICLR 2024arXiv:2312.05677
29
citations
#2901

Higher-Order Graph Convolutional Network with Flower-Petals Laplacians on Simplicial Complexes

Yiming Huang, Yujie Zeng, Qiang Wu et al.

AAAI 2024paperarXiv:2309.12971
29
citations
#2902

Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework

Vu Minh Hieu Phan, Yutong Xie, Yuankai Qi et al.

CVPR 2024arXiv:2403.07636
29
citations
#2903

Multimodal Patient Representation Learning with Missing Modalities and Labels

Zhenbang Wu, Anant Dadu, Nicholas Tustison et al.

ICLR 2024
29
citations
#2904

Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel

Paul Hagemann, Johannes Hertrich, Fabian Altekrüger et al.

ICLR 2024arXiv:2310.03054
29
citations
#2905

Neural Refinement for Absolute Pose Regression with Feature Synthesis

Shuai Chen, Yash Bhalgat, Xinghui Li et al.

CVPR 2024arXiv:2303.10087
29
citations
#2906

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Hu Cao, Zehua Zhang, Yan Xia et al.

ECCV 2024arXiv:2407.12582
29
citations
#2907

Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding

Le Zhang, Rabiul Awal, Aishwarya Agrawal

CVPR 2024arXiv:2306.08832
29
citations
#2908

Scaling physics-informed hard constraints with mixture-of-experts

Nithin Chalapathi, Yiheng Du, Aditi Krishnapriyan

ICLR 2024oralarXiv:2402.13412
29
citations
#2909

Personalized Federated Domain-Incremental Learning based on Adaptive Knowledge Matching

Yichen Li, Wenchao Xu, Haozhao Wang et al.

ECCV 2024arXiv:2407.05005
29
citations
#2910

Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding

Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan-Bac Nguyen et al.

CVPR 2024highlightarXiv:2311.15206
29
citations
#2911

Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions

Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi

ECCV 2024arXiv:2407.16698
29
citations
#2912

A Simple Baseline for Efficient Hand Mesh Reconstruction

zhishan zhou, shihao zhou, Zhi Lv et al.

CVPR 2024arXiv:2403.01813
29
citations
#2913

HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning

Zhecan Wang, Garrett Bingham, Adams Wei Yu et al.

ECCV 2024arXiv:2407.15680
29
citations
#2914

Video Editing via Factorized Diffusion Distillation

Uriel Singer, Amit Zohar, Yuval Kirstain et al.

ECCV 2024arXiv:2403.09334
29
citations
#2915

Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding

Ozan Unal, Christos Sakaridis, Suman Saha et al.

ECCV 2024arXiv:2309.04561
29
citations
#2916

Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder

Jinseok Kim, Tae-Kyun Kim

CVPR 2024arXiv:2403.10255
29
citations
#2917

Towards Scalable and Versatile Weight Space Learning

Konstantin Schürholt, Michael Mahoney, Damian Borth

ICML 2024arXiv:2406.09997
29
citations
#2918

Dataset Distillation by Automatic Training Trajectories

Dai Liu, Jindong Gu, Hu Cao et al.

ECCV 2024arXiv:2407.14245
29
citations
#2919

Learning the greatest common divisor: explaining transformer predictions

François Charton

ICLR 2024spotlightarXiv:2308.15594
29
citations
#2920

Biased Temporal Convolution Graph Network for Time Series Forecasting with Missing Values

Xiaodan Chen, Xiucheng Li, Bo Liu et al.

ICLR 2024oral
29
citations
#2921

Self-Supervised Facial Representation Learning with Facial Region Awareness

Zheng Gao, Ioannis Patras

CVPR 2024arXiv:2403.02138
29
citations
#2922

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

Haowei Lin, Baizhou Huang, Haotian Ye et al.

ICML 2024arXiv:2402.02314
29
citations
#2923

MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders

Baijiong Lin, Weisen Jiang, Pengguang Chen et al.

ECCV 2024arXiv:2407.02228
29
citations
#2924

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning

Wenjin Hou, Shiming Chen, Shuhuang Chen et al.

CVPR 2024arXiv:2404.14808
29
citations
#2925

Object-Aware Inversion and Reassembly for Image Editing

Zhen Yang, Ganggui Ding, Wen Wang et al.

ICLR 2024arXiv:2310.12149
29
citations
#2926

Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models

Pablo Marcos-Manchón, Roberto Alcover-Couso, Juan SanMiguel et al.

CVPR 2024arXiv:2403.14291
29
citations
#2927

Hybrid Inverse Reinforcement Learning

Juntao Ren, Gokul Swamy, Steven Wu et al.

ICML 2024oralarXiv:2402.08848
29
citations
#2928

Chinese Spelling Correction as Rephrasing Language Model

Linfeng Liu, Hongqiu Wu, Hai Zhao

AAAI 2024paperarXiv:2308.08796
29
citations
#2929

Auto-Train-Once: Controller Network Guided Automatic Network Pruning from Scratch

Xidong Wu, Shangqian Gao, Zeyu Zhang et al.

CVPR 2024arXiv:2403.14729
29
citations
#2930

PolyVoice: Language Models for Speech to Speech Translation

Qianqian Dong, Zhiying Huang, Qiao Tian et al.

ICLR 2024arXiv:2306.02982
29
citations
#2931

Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection

Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker et al.

CVPR 2024arXiv:2404.01819
29
citations
#2932

Parallelizing non-linear sequential models over the sequence length

Yi Heng Lim, Qi Zhu, Joshua Selfridge et al.

ICLR 2024arXiv:2309.12252
29
citations
#2933

A Generalized Neural Diffusion Framework on Graphs

10011 Yibo Li, Xiao Wang, Hongrui Liu et al.

AAAI 2024paperarXiv:2312.08616
29
citations
#2934

On the Stability of Expressive Positional Encodings for Graphs

Yinan Huang, William Lu, Joshua Robinson et al.

ICLR 2024arXiv:2310.02579
29
citations
#2935

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

Xinyue Xu, Yi Qin, Lu Mi et al.

ICLR 2024arXiv:2401.14142
29
citations
#2936

Nuvo: Neural UV Mapping for Unruly 3D Representations

Pratul Srinivasan, Stephan J Garbin, Dor Verbin et al.

ECCV 2024arXiv:2312.05283
29
citations
#2937

BAGEL: Bootstrapping Agents by Guiding Exploration with Language

Shikhar Murty, Christopher Manning, Peter Shaw et al.

ICML 2024arXiv:2403.08140
29
citations
#2938

Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion

Hao Ai, Addison, Lin Wang

CVPR 2024arXiv:2403.16376
29
citations
#2939

UMIE: Unified Multimodal Information Extraction with Instruction Tuning

Lin Sun, Kai Zhang, Qingyuan Li et al.

AAAI 2024paperarXiv:2401.03082
29
citations
#2940

Do Efficient Transformers Really Save Computation?

Kai Yang, Jan Ackermann, Zhenyu He et al.

ICML 2024arXiv:2402.13934
29
citations
#2941

Copula Conformal prediction for multi-step time series prediction

Sophia Sun, Rose Yu

ICLR 2024oral
29
citations
#2942

UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science

Yazheng Yang, Yuqi Wang, Guang Liu et al.

ICLR 2024arXiv:2307.09249
29
citations
#2943

Entropic Open-Set Active Learning

Bardia Safaei, Vibashan VS, Celso de Melo et al.

AAAI 2024paperarXiv:2312.14126
29
citations
#2944

Cross-Covariate Gait Recognition: A Benchmark

Shinan Zou, Chao Fan, Jianbo Xiong et al.

AAAI 2024paperarXiv:2312.14404
29
citations
#2945

I'M HOI: Inertia-aware Monocular Capture of 3D Human-Object Interactions

Chengfeng Zhao, Juze Zhang, Jiashen Du et al.

CVPR 2024arXiv:2312.08869
29
citations
#2946

PhyloGFN: Phylogenetic inference with generative flow networks

MING YANG ZHOU, Zichao Yan, Elliot Layne et al.

ICLR 2024arXiv:2310.08774
29
citations
#2947

All in One Framework for Multimodal Re-identification in the Wild

He Li, Mang Ye, Ming Zhang et al.

CVPR 2024arXiv:2405.04741
29
citations
#2948

PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving

Zhili Chen, Maosheng Ye, Shuangjie Xu et al.

ECCV 2024arXiv:2311.08100
29
citations
#2949

WHAC: World-grounded Humans and Cameras

Wanqi Yin, Zhongang Cai, Chen Wei et al.

ECCV 2024arXiv:2403.12959
29
citations
#2950

Ghost on the Shell: An Expressive Representation of General 3D Shapes

Zhen Liu, Yao Feng, Yuliang Xiu et al.

ICLR 2024arXiv:2310.15168
29
citations
#2951

Adapting to Length Shift: FlexiLength Network for Trajectory Prediction

Yi Xu, Yun Fu

CVPR 2024arXiv:2404.00742
29
citations
#2952

The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding

Lorenzo Bianchi, Fabio Carrara, Nicola Messina et al.

CVPR 2024highlightarXiv:2311.17518
29
citations
#2953

Zero-1-to-3: Domain-Level Zero-Shot Cognitive Diagnosis via One Batch of Early-Bird Students towards Three Diagnostic Objectives

Weibo Gao, Qi Liu, Hao Wang et al.

AAAI 2024paperarXiv:2312.13434
29
citations
#2954

How Smooth Is Attention?

Valérie Castin, Pierre Ablin, Gabriel Peyré

ICML 2024arXiv:2312.14820
29
citations
#2955

Contrastive Learning is Spectral Clustering on Similarity Graph

Zhiquan Tan, Yifan Zhang, Jingqin Yang et al.

ICLR 2024arXiv:2303.15103
29
citations
#2956

WANDR: Intention-guided Human Motion Generation

Markos Diomataris, Nikos Athanasiou, Omid Taheri et al.

CVPR 2024arXiv:2404.15383
29
citations
#2957

UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion

Junsheng Zhou, Weiqi Zhang, Baorui Ma et al.

CVPR 2024arXiv:2404.06851
29
citations
#2958

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Xintao Lv, Liang Xu, Yichao Yan et al.

ECCV 2024arXiv:2407.12371
29
citations
#2959

Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation

Jonas Herzog

CVPR 2024arXiv:2402.17614
29
citations
#2960

TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors

Yichuan Mo, Hui Huang, Mingjie Li et al.

ICML 2024arXiv:2409.05294
29
citations
#2961

Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Xu He, Qiaochu Huang, Zhensong Zhang et al.

CVPR 2024arXiv:2404.01862
29
citations
#2962

Human Hair Reconstruction with Strand-Aligned 3D Gaussians

Egor Zakharov, Vanessa Sklyarova, Michael J. Black et al.

ECCV 2024arXiv:2409.14778
29
citations
#2963

LaneCPP: Continuous 3D Lane Detection using Physical Priors

Maximilian Pittner, Joel Janai, Alexandru Paul Condurache

CVPR 2024arXiv:2406.08381
28
citations
#2964

Offline and Online Optical Flow Enhancement for Deep Video Compression

Chuanbo Tang, Xihua Sheng, Zhuoyuan Li et al.

AAAI 2024paperarXiv:2307.05092
28
citations
#2965

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding

Tianyu Yu, Chengyue Jiang, Chao Lou et al.

AAAI 2024paperarXiv:2308.10529
28
citations
#2966

VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors

Sungwon Hwang, Min-Jung Kim, Taewoong Kang et al.

ECCV 2024arXiv:2407.02945
28
citations
#2967

Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion

Zuoyue Li, Zhenqiang Li, Zhaopeng Cui et al.

CVPR 2024highlightarXiv:2401.10786
28
citations
#2968

Inherently Interpretable Time Series Classification via Multiple Instance Learning

Joseph Early, Gavin Cheung, Kurt Cutajar et al.

ICLR 2024spotlightarXiv:2311.10049
28
citations
#2969

Nearest is Not Dearest: Towards Practical Defense against Quantization-conditioned Backdoor Attacks

Boheng Li, Yishuo Cai, Haowei Li et al.

CVPR 2024arXiv:2405.12725
28
citations
#2970

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

Ting Lei, Shaofeng Yin, Yang Liu

CVPR 2024arXiv:2404.06194
28
citations
#2971

MatchU: Matching Unseen Objects for 6D Pose Estimation from RGB-D Images

Junwen Huang, Hao Yu, Kuan-Ting Yu et al.

CVPR 2024arXiv:2403.01517
28
citations
#2972

Make-A-Shape: a Ten-Million-scale 3D Shape Model

Ka-Hei Hui, Aditya Sanghi, Arianna Rampini et al.

ICML 2024arXiv:2401.11067
28
citations
#2973

Label Propagation for Zero-shot Classification with Vision-Language Models

Vladan Stojnić, Yannis Kalantidis, Giorgos Tolias

CVPR 2024arXiv:2404.04072
28
citations
#2974

The LLM Surgeon

Tycho van der Ouderaa, Markus Nagel, Mart van Baalen et al.

ICLR 2024arXiv:2312.17244
28
citations
#2975

4D Contrastive Superflows are Dense 3D Representation Learners

Xiang Xu, Lingdong Kong, Hui Shuai et al.

ECCV 2024arXiv:2407.06190
28
citations
#2976

How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

Ryan Liu, Theodore R Sumers, Ishita Dasgupta et al.

ICML 2024arXiv:2402.07282
28
citations
#2977

Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?

Tokio Kajitsuka, Issei Sato

ICLR 2024arXiv:2307.14023
28
citations
#2978

Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation

han li, Shaohui Li, Shuangrui Ding et al.

ECCV 2024arXiv:2407.09853
28
citations
#2979

Learning to Intervene on Concept Bottlenecks

David Steinmann, Wolfgang Stammer, Felix Friedrich et al.

ICML 2024arXiv:2308.13453
28
citations
#2980

Generalized Schrödinger Bridge Matching

Guan-Horng Liu, Yaron Lipman, Maximilian Nickel et al.

ICLR 2024arXiv:2310.02233
28
citations
#2981

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models

Xavi Suau, Pieter Delobelle, Katherine Metcalf et al.

ICML 2024arXiv:2407.12824
28
citations
#2982

Graph Disentangled Contrastive Learning with Personalized Transfer for Cross-Domain Recommendation

Jing Liu, Lele Sun, Wei-zhi Nie et al.

AAAI 2024paper
28
citations
#2983

NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement

Marcos Conde, Javier Vazquez-Corral, Michael Brown et al.

AAAI 2024paperarXiv:2306.11920
28
citations
#2984

DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly

Gianluca Scarpellini, Stefano Fiorini, Francesco Giuliari et al.

CVPR 2024arXiv:2402.19302
28
citations
#2985

3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation

Dale Decatur, Itai Lang, Kfir Aberman et al.

CVPR 2024arXiv:2311.09571
28
citations
#2986

T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

Zhongqi Wang, Jie Zhang, Shiguang Shan et al.

ECCV 2024arXiv:2407.04215
28
citations
#2987

SPIN: Simultaneous Perception Interaction and Navigation

Shagun Uppal, Ananye Agarwal, Haoyu Xiong et al.

CVPR 2024arXiv:2405.07991
28
citations
#2988

Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection

Ting Lei, Shaofeng Yin, Yuxin Peng et al.

ECCV 2024arXiv:2408.02484
28
citations
#2989

IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models

Shaokun Zhang, Xiaobo Xia, Zhaoqing Wang et al.

ICLR 2024arXiv:2310.10873
28
citations
#2990

MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior

Honghua Chen, Chen Change Loy, Xingang Pan

CVPR 2024arXiv:2405.02859
28
citations
#2991

Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

Jian Chen, Ruiyi Zhang, Yufan Zhou et al.

ICLR 2024arXiv:2402.04754
28
citations
#2992

Energy-guided Entropic Neural Optimal Transport

Petr Mokrov, Alexander Korotin, Alexander Kolesov et al.

ICLR 2024arXiv:2304.06094
28
citations
#2993

Masked Structural Growth for 2x Faster Language Model Pre-training

Yiqun Yao, Zheng Zhang, Jing Li et al.

ICLR 2024arXiv:2305.02869
28
citations
#2994

Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps

Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi et al.

ICLR 2024spotlightarXiv:2302.00456
28
citations
#2995

NOLA: Compressing LoRA using Linear Combination of Random Basis

Soroush Abbasi Koohpayegani, K L Navaneet, Parsa Nooralinejad et al.

ICLR 2024oralarXiv:2310.02556
28
citations
#2996

Trackastra: Transformer-based cell tracking for live-cell microscopy

Benjamin Gallusser, Weigert Martin

ECCV 2024arXiv:2405.15700
28
citations
#2997

FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring

Geunhyuk Youk, Jihyong Oh, Munchurl Kim

CVPR 2024arXiv:2401.03707
28
citations
#2998

Critic-Guided Decision Transformer for Offline Reinforcement Learning

Yuanfu Wang, Chao Yang, Ying Wen et al.

AAAI 2024paperarXiv:2312.13716
28
citations
#2999

SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation

Heyuan Li, Ce Chen, Tianhao Shi et al.

ECCV 2024arXiv:2404.05680
28
citations
#3000

APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference

Bowen Zhao, Hannaneh Hajishirzi, Qingqing Cao

ICML 2024arXiv:2401.12200
28
citations