Stan Weixian Lei
7
Papers
123
Total Citations
Papers (7)
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
CVPR 2025
123
citations
ViT-Lens: Towards Omni-modal Representations
CVPR 2024
0
citations
Generic Event Boundary Detection: A Benchmark for Event Segmentation
ICCV 2021arXiv
0
citations
Too Large; Data Reduction for Vision-Language Pre-Training
ICCV 2023arXiv
0
citations
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ICCV 2023
0
citations
Learning to Learn: How to Continuously Teach Humans and Machines
ICCV 2023arXiv
0
citations
AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant
ECCV 2022
0
citations