Shengqiong Wu
8
Papers
68
Total Citations
Papers (8)
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025
57
citations
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
CVPR 2025
4
citations
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
NeurIPS 2025arXiv
4
citations
Universal Scene Graph Generation
CVPR 2025
3
citations
NExT-GPT: Any-to-Any Multimodal LLM
ICML 2024
0
citations
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
0
citations
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
CVPR 2024
0
citations
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition
ICML 2024
0
citations