All Papers

34,598 papers found • Page 578 of 692

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

Nisarg Shah, Vibashan VS, Vishal M. Patel

CVPR 2024
13
citations

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation

Yuchen Su, Zhineng Chen, Zhiwen Shao et al.

AAAI 2024paperarXiv:2306.15142
17
citations

LRM: Large Reconstruction Model for Single Image to 3D

Yicong Hong, Kai Zhang, Jiuxiang Gu et al.

ICLR 2024arXiv:2311.04400
711
citations

LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks

Jianlang Chen, Xuhong Ren, Qing Guo et al.

ICLR 2024oralarXiv:2404.06247
6
citations

LRS: Enhancing Adversarial Transferability through Lipschitz Regularized Surrogate

Tao Wu, Tie Luo, D. C. Wunsch

AAAI 2024paperarXiv:2312.13118
7
citations

LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Hongbeen Park, Minjeong Park, Giljoo Nam et al.

ECCV 2024arXiv:2506.10567
3
citations

LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering

Li Sun, Zhenhao Huang, Hao Peng et al.

ICML 2024arXiv:2405.11801
20
citations

LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels

Tuo Feng, Wenguan Wang, Fan Ma et al.

CVPR 2024arXiv:2403.15173
20
citations

LSTKC: Long Short

Term Knowledge Consolidation for Lifelong Person Re-identification - Kunlun Xu, Xu Zou, Jiahuan Zhou

AAAI 2024paper

LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling

Jiaheng Liu, Jianhao Li, Kaisiyuan Wang et al.

CVPR 2024
10
citations

LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

Qihao Zhao, Yalun Dai, Hao Li et al.

CVPR 2024arXiv:2403.05854
35
citations

LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering

Jaehoon Choi, Rajvi Shah, Qinbo Li et al.

CVPR 2024

LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching

Yixun Liang, Xin Yang, Jiantao Lin et al.

CVPR 2024highlightarXiv:2311.11284
282
citations

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition

Lingfeng Liu, Dong Ni, Hangjie Yuan

ICLR 2024arXiv:2403.01412

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models

Gunho Park, baeseong park, Minsub Kim et al.

ICLR 2024arXiv:2206.09557
119
citations

LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images

Jing Zhang, Irving Fang, Hao Wu et al.

CVPR 2024highlightarXiv:2403.13171
11
citations

Lyapunov-Stable Deep Equilibrium Models

Haoyu Chu, Shikui Wei, Ting Liu et al.

AAAI 2024paperarXiv:2304.12707
8
citations

Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation

Lujie Yang, Hongkai Dai, Zhouxing Shi et al.

ICML 2024arXiv:2404.07956
34
citations

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

Seunggeun Chi, Hyung-gun Chi, Hengbo Ma et al.

ECCV 2024arXiv:2407.14502
17
citations

M^2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation

Yingshuang Zou, Yikang Ding, Xi Qiu et al.

ECCV 2024

M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis

Ning Zhang, Hiuyi Cheng, Jiayu Chen et al.

AAAI 2024paper
14
citations

M2SD:Multiple Mixing Self-Distillation for Few-Shot Class-Incremental Learning

Jinhao Lin, Ziheng Wu, Weifeng Lin et al.

AAAI 2024paper

M3C: A Framework towards Convergent, Flexible, and Unsupervised Learning of Mixture Graph Matching and Clustering

Jiaxin Lu, Zetian Jiang, Tianzhe Wang et al.

ICLR 2024arXiv:2310.18444
3
citations

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions

Mingsheng Li, Xin Chen, Chi Zhang et al.

ECCV 2024
4
citations

M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy

Hansong Zhang, Shikun Li, Pengju Wang et al.

AAAI 2024paperarXiv:2312.15927
52
citations

M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking

Jiaming Liu, Yue Wu, Maoguo Gong et al.

AAAI 2024paperarXiv:2312.06117
13
citations

M3-UDA: A New Benchmark for Unsupervised Domain Adaptive Fetal Cardiac Structure Detection

Bin Pu, Liwen Wang, Jiewen Yang et al.

CVPR 2024

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024arXiv:2409.10473
11
citations

MACE: Mass Concept Erasure in Diffusion Models

Shilin Lu, Zilan Wang, Leyang Li et al.

CVPR 2024arXiv:2403.06135
226
citations

Machine-Created Universal Language for Cross-Lingual Transfer

Yaobo Liang, Quanzhi Zhu, Junhe Zhao et al.

AAAI 2024paperarXiv:2305.13071
9
citations

Machine Learning

Powered Combinatorial Clock Auction - Ermis Nikiforos Soumalias, Jakob Weissteiner, Jakob Heiss et al.

AAAI 2024paperarXiv:2512.11133

Machine Unlearning for Image-to-Image Generative Models

Guihong Li, Hsiang Hsu, Chun-Fu Chen et al.

ICLR 2024arXiv:2402.00351
50
citations

Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning

Zhuo Huang, Chang Liu, Yinpeng Dong et al.

ICML 2024arXiv:2312.02546
23
citations

MADA: Meta-Adaptive Optimizers Through Hyper-Gradient Descent

Kaan Ozkara, Can Karakus, Parameswaran Raman et al.

ICML 2024arXiv:2401.08893
6
citations

MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction

Qiang Wang

ECCV 2024

Made to Order: Discovering monotonic temporal changes via self-supervised video ordering

Charig Yang, Weidi Xie, Andrew ZISSERMAN

ECCV 2024arXiv:2404.16828
8
citations

MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer

Jianjian Cao, Peng Ye, Shengze Li et al.

CVPR 2024arXiv:2403.02991
47
citations

Maestro: Uncovering Low-Rank Structures via Trainable Decomposition

Samuel Horváth, Stefanos Laskaridis, Shashank Rajput et al.

ICML 2024

MAFA: Managing False Negatives for Vision-Language Pre-training

Jaeseok Byun, Dohoon Kim, Taesup Moon

CVPR 2024arXiv:2312.06112
13
citations

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

Haoyu Zhao, Tianyi Lu, Jiaxi Gu et al.

ECCV 2024arXiv:2311.17338
19
citations

MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models

Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin et al.

ICML 2024arXiv:2402.01620
28
citations

MaGGIe: Masked Guided Gradual Human Instance Matting

Chuong Huynh, Seoung Wug Oh, Abhinav Shrivastava et al.

CVPR 2024arXiv:2404.16035
16
citations

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Guocheng Qian, Jinjie Mai, Abdullah Hamdi et al.

ICLR 2024arXiv:2306.17843
430
citations

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Zhongcong Xu, Jianfeng Zhang, Jun Hao Liew et al.

CVPR 2024arXiv:2311.16498
327
citations

MagiCapture: High-Resolution Multi-Concept Portrait Customization

9256 Junha Hyung, Jaeyo Shin, Jaegul Choo

AAAI 2024paperarXiv:2309.06895
25
citations

MagicDrive: Street View Generation with Diverse 3D Geometry Control

Ruiyuan Gao, Kai Chen, Enze Xie et al.

ICLR 2024arXiv:2310.02601
218
citations

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024arXiv:2410.10207
13
citations

MAGICK: A Large-scale Captioned Dataset from Matting Generated Images using Chroma Keying

Ryan Burgert, Brian Price, Jason Kuen et al.

CVPR 2024

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Kai Zhang, Yi Luan, Hexiang Hu et al.

ICML 2024arXiv:2403.19651
88
citations

MagicMirror: Fast and High-Quality Avatar Generation with Constrained Search Space

Armand Comas Massague, Di Qiu, Menglei Chai et al.

ECCV 2024
2
citations