Wentao Liu
10
Papers
206
Total Citations
Papers (10)
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
ICLR 2024
104
citations
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
ICCV 2025
33
citations
CLIM: Contrastive Language-Image Mosaic for Region Representation
AAAI 2024arXiv
24
citations
F-LMM: Grounding Frozen Large Multimodal Models
CVPR 2025arXiv
21
citations
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
AAAI 2025
14
citations
UniFS: Universal Few-shot Instance Perception with Point Representations
ECCV 2024
3
citations
NADER: Neural Architecture Design via Multi-Agent Collaboration
CVPR 2025arXiv
3
citations
ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries
AAAI 2025
2
citations
Unsupervised Continual Domain Shift Learning with Multi-Prototype Modeling
CVPR 2025
2
citations
Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering
CVPR 2024
0
citations