Enshen Zhou
5
Papers
933
Total Citations
Papers (5)
WorldSimBench: Towards Video Generation Models as World Simulators
ICML 2025
806
citations
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
CVPR 2024
76
citations
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
NeurIPS 2025arXiv
51
citations
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
CVPR 2025
0
citations
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
AAAI 2025
0
citations