Ser-Nam Lim
14
Papers
265
Total Citations
Papers (14)
On the Robustness of Large Multimodal Models Against Image Adversarial Attacks
CVPR 2024
80
citations
Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model
CVPR 2024
50
citations
Few-Shot Object Detection with Foundation Models
CVPR 2024
50
citations
Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
CVPR 2024
21
citations
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
CVPR 2025
19
citations
Composing Object Relations and Attributes for Image-Text Matching
CVPR 2024
18
citations
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
ICCV 2025arXiv
12
citations
Fast Encoding and Decoding for Implicit Video Representation
ECCV 2024
7
citations
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
6
citations
Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models
ICCV 2025arXiv
2
citations
Object Recognition as Next Token Prediction
CVPR 2024
0
citations
UniMODE: Unified Monocular 3D Object Detection
CVPR 2024
0
citations
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
CVPR 2024
0
citations
Generative Zero-Shot Composed Image Retrieval
CVPR 2025
0
citations