Dawei Leng
6
Papers
46
Total Citations
Papers (6)
WISA: World simulator assistant for physics-aware text-to-video generation
NeurIPS 2025arXiv
34
citations
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
ICCV 2025arXiv
7
citations
Bridge Diffusion Model: Bridge Chinese Text-to-Image Diffusion Model with English Communities
AAAI 2025
4
citations
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
ICLR 2025arXiv
1
citations
LMM-Det: Make Large Multimodal Models Excel in Object Detection
ICCV 2025
0
citations
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
AAAI 2025
0
citations