Ser-Nam Lim
45
Papers
265
Total Citations
Papers (45)
On the Robustness of Large Multimodal Models Against Image Adversarial Attacks
CVPR 2024
80
citations
Few-Shot Object Detection with Foundation Models
CVPR 2024
50
citations
Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model
CVPR 2024
50
citations
Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
CVPR 2024
21
citations
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
CVPR 2025
19
citations
Composing Object Relations and Attributes for Image-Text Matching
CVPR 2024
18
citations
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
ICCV 2025arXiv
12
citations
Fast Encoding and Decoding for Implicit Video Representation
ECCV 2024
7
citations
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
6
citations
Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models
ICCV 2025arXiv
2
citations
Towards Scalable Neural Representation for Diverse Videos
CVPR 2023arXiv
0
citations
Open Vocabulary Semantic Segmentation With Patch Aligned Contrastive Learning
CVPR 2023arXiv
0
citations
TIPI: Test Time Adaptation With Transformation Invariance
CVPR 2023
0
citations
Detecting Everything in the Open World: Towards Universal Object Detection
CVPR 2023arXiv
0
citations
Computationally Budgeted Continual Learning: What Does Matter?
CVPR 2023arXiv
0
citations
HNeRV: A Hybrid Neural Representation for Videos
CVPR 2023arXiv
0
citations
Enhancing Adversarial Example Transferability With an Intermediate Level Attack
ICCV 2019
0
citations
Cross-X Learning for Fine-Grained Visual Categorization
ICCV 2019
0
citations
Exploring Visual Engagement Signals for Representation Learning
ICCV 2021arXiv
0
citations
Joint Audio-Visual Deepfake Detection
ICCV 2021
0
citations
Deep Co-Training With Task Decomposition for Semi-Supervised Domain Adaptation
ICCV 2021arXiv
0
citations
Robustness and Generalization via Generative Adversarial Training
ICCV 2021arXiv
0
citations
Open-vocabulary Panoptic Segmentation with Embedding Modulation
ICCV 2023arXiv
0
citations
BT^2: Backward-compatible Training with Basis Transformation
ICCV 2023
0
citations
Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?
ICCV 2023arXiv
0
citations
Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors
ECCV 2020
0
citations
Quantization Guided JPEG Artifact Correction
ECCV 2020
0
citations
Curriculum Manager for Source Selection in Multi-Source Domain Adaptation
ECCV 2020
0
citations
A Metric Learning Reality Check
ECCV 2020
0
citations
What makes fake images detectable? Understanding properties that generalize
ECCV 2020
0
citations
Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions
ECCV 2022
0
citations
Totems: Physical Objects for Verifying Visual Integrity
ECCV 2022
0
citations
MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning
ECCV 2022
0
citations
Visual Prompt Tuning
ECCV 2022
0
citations
Generative Zero-Shot Composed Image Retrieval
CVPR 2025
0
citations
Object-Centric Unsupervised Image Captioning
ECCV 2022
0
citations
UniMODE: Unified Monocular 3D Object Detection
CVPR 2024
0
citations
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
CVPR 2024
0
citations
Object Recognition as Next Token Prediction
CVPR 2024
0
citations
One-Shot Domain Adaptation for Face Generation
CVPR 2020arXiv
0
citations
Intentonomy: A Dataset and Study Towards Human Intent Understanding
CVPR 2021arXiv
0
citations
Efficient Object Embedding for Spliced Image Retrieval
CVPR 2021arXiv
0
citations
On Feature Normalization and Data Augmentation
CVPR 2021arXiv
0
citations
ObjectFormer for Image Manipulation Detection and Localization
CVPR 2022arXiv
0
citations
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
CVPR 2022arXiv
0
citations