Shengqiong Wu
10
Papers
68
Total Citations
Papers (10)
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025
57
citations
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
CVPR 2025
4
citations
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
NeurIPS 2025arXiv
4
citations
Universal Scene Graph Generation
CVPR 2025
3
citations
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
CVPR 2024
0
citations
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
0
citations
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition
ICML 2024
0
citations
NExT-GPT: Any-to-Any Multimodal LLM
ICML 2024
0
citations
LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model
NeurIPS 2022
0
citations
Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion
NeurIPS 2023
0
citations