Poster Papers

24,624 papers found • Page 408 of 493

LipSim: A Provably Robust Perceptual Similarity Metric

Sara Ghazanfari, Alexandre Araujo, Prashanth Krishnamurthy et al.

ICLR 2024arXiv:2310.18274
13
citations

Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance

Giung Nam, Byeongho Heo, Juho Lee

ICLR 2024arXiv:2404.00860
13
citations

LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading

Yochai Yemini, Aviv Shamsian, Lior Bracha et al.

ICLR 2024arXiv:2306.03258
24
citations

LISA: Reasoning Segmentation via Large Language Model

Xin Lai, Zhuotao Tian, Yukang Chen et al.

CVPR 2024arXiv:2308.00692
742
citations

LISO: Lidar-only Self-Supervised 3D Object Detection

Stefan Baur, Frank Moosmann, Andreas Geiger

ECCV 2024arXiv:2403.07071
24
citations

Listenable Maps for Audio Classifiers

Francesco Paissan, Mirco Ravanelli, Cem Subakan

ICML 2024arXiv:2403.13086
13
citations

Listening to the noise: Blind Denoising with Gibbs Diffusion

David Heurtel-Depeiges, Charles Margossian, Ruben Ohana et al.

ICML 2024arXiv:2402.19455
4
citations

Listen, Think, and Understand

Yuan Gong, Hongyin Luo, Alexander Liu et al.

ICLR 2024arXiv:2305.10790
224
citations

Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation

Bolin Lai, Fiona Ryan, Wenqi Jia et al.

ECCV 2024arXiv:2305.03907
19
citations

Listwise Reward Estimation for Offline Preference-based Reinforcement Learning

Heewoong Choi, Sangwon Jung, Hongjoon Ahn et al.

ICML 2024arXiv:2408.04190
11
citations

LITA: Language Instructed Temporal-Localization Assistant

De-An Huang, Shijia Liao, Subhashree Radhakrishnan et al.

ECCV 2024arXiv:2403.19046
108
citations

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

Xin Liu, Muhammad Khalifa, Lu Wang

ICLR 2024arXiv:2310.19208
36
citations

LiteSAM is Actually what you Need for segment Everything

Jianhai Fu, Yuanjie Yu, Ningchuan Li et al.

ECCV 2024

LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment

Yiming Ren, Xiao Han, Yichen Yao et al.

ECCV 2024arXiv:2407.09833
5
citations

LivePhoto: Real Image Animation with Text-guided Motion Control

Xi Chen, Zhiheng Liu, Mengting Chen et al.

ECCV 2024arXiv:2312.02928
46
citations

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

Sijin Chen, Xin Chen, Chi Zhang et al.

CVPR 2024

LLaFS: When Large Language Models Meet Few-Shot Segmentation

Lanyun Zhu, Tianrun Chen, Deyi Ji et al.

CVPR 2024arXiv:2311.16926
78
citations

LLaGA: Large Language and Graph Assistant

Runjin Chen, Tong Zhao, Ajay Jaiswal et al.

ICML 2024arXiv:2402.08170
148
citations

LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention

Renrui Zhang, Jiaming Han, Chris Liu et al.

ICLR 2024

LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction

Bo Zou, Chao Yang, Yu Qiao et al.

CVPR 2024arXiv:2404.00913
8
citations

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Yanwei Li, Chengyao Wang, Jiaya Jia

ECCV 2024arXiv:2311.17043
499
citations

LLark: A Multimodal Instruction-Following Language Model for Music

Josh Gardner, Simon Durand, Daniel Stoller et al.

ICML 2024arXiv:2310.07160
30
citations

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

Hao Zhang, Hongyang Li, Feng Li et al.

ECCV 2024arXiv:2312.02949
114
citations

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Shilong Liu, Hao Cheng, Haotian Liu et al.

ECCV 2024arXiv:2311.05437
200
citations

LLaVA-UHD: an LMM Perceiving any Aspect Ratio and High-Resolution Images

Zonghao Guo, Ruyi Xu, Yuan Yao et al.

ECCV 2024arXiv:2403.11703
174
citations

Llemma: An Open Language Model for Mathematics

Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster et al.

ICLR 2024arXiv:2310.10631
402
citations

LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation

Kibum Kim, Kanghoon Yoon, Jaehyeong Jeon et al.

CVPR 2024arXiv:2310.10404
32
citations

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Pingchuan Ma, Johnson Tsun-Hsuan Wang, Minghao Guo et al.

ICML 2024arXiv:2405.09783
67
citations

LLM as Copilot for Coarse-grained Vision-and-Language Navigation

Yanyuan Qiao, Qianyi Liu, Jiajun Liu et al.

ECCV 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model

Yulin Luo, Ruichuan An, Bocheng Zou et al.

ECCV 2024arXiv:2405.02363
45
citations

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Naman Jain, Tianjun Zhang, Wei-Lin Chiang et al.

ICLR 2024arXiv:2311.14904
44
citations

LLM Augmented LLMs: Expanding Capabilities through Composition

Rachit Bansal, Bidisha Samanta, Siddharth Dalmia et al.

ICLR 2024arXiv:2401.02412
50
citations

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Hanan Gani, Shariq Bhat, Muzammal Naseer et al.

ICLR 2024arXiv:2310.10640
56
citations

LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang et al.

ICLR 2024arXiv:2309.14393
115
citations

LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang

Yuqing Zhang, Hangqi Li, Shengyu Zhang et al.

ECCV 2024
6
citations

LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation

Suhyeon Lee, Won Jun Kim, Jinho Chang et al.

ICLR 2024arXiv:2305.11490
75
citations

LLM-Empowered State Representation for Reinforcement Learning

Boyuan Wang, Yun Qu, Yuhang Jiang et al.

ICML 2024arXiv:2407.13237
24
citations

LLMGA: Multimodal Large Language Model based Generation Assistant

Bin Xia, Shiyin Wang, Yingfan Tao et al.

ECCV 2024arXiv:2311.16500
25
citations

LLMs are Good Action Recognizers

Haoxuan Qu, Yujun Cai, Jun Liu

CVPR 2024arXiv:2404.00532
45
citations

LLMs are Good Sign Language Translators

Jia Gong, Lin Geng Foo, Yixuan He et al.

CVPR 2024arXiv:2404.00925
73
citations

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Sheng JIn, Xueying Jiang, Jiaxing Huang et al.

ICLR 2024arXiv:2402.04630
40
citations

L-MAGIC: Language Model Assisted Generation of Images with Coherence

zhipeng cai, Matthias Mueller, Reiner Birkl et al.

CVPR 2024arXiv:2406.01843
7
citations

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Hao Shao, Yuxuan Hu, Letian Wang et al.

CVPR 2024arXiv:2312.07488
251
citations

LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement

Ye Yu, Fengxin Chen, Jun Yu et al.

ECCV 2024arXiv:2408.16235
6
citations

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units

Zeyu Liu, Gourav Datta, Anni Li et al.

ICLR 2024arXiv:2402.04882
17
citations

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

Yushi Lan, Fangzhou Hong, Shuai Yang et al.

ECCV 2024arXiv:2403.12019
75
citations

LNL+K: Enhancing Learning with Noisy Labels Through Noise Source Knowledge Integration

Siqi Wang, Bryan Plummer

ECCV 2024arXiv:2306.11911
2
citations

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers

Ziling Huang, Shin’ichi Satoh

ECCV 2024

Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing

Yushi Lan, Feitong Tan, Qiangeng Xu et al.

ECCV 2024

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024arXiv:2407.10528
13
citations