Jieneng Chen
6
Papers
2
Total Citations
Papers (6)
Vision‑Language‑Vision Auto‑Encoder: Scalable Knowledge Distillation from Diffusion Models
NeurIPS 2025arXiv
2
citations
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models
CVPR 2025
0
citations
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
CVPR 2025
0
citations
3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark
ICCV 2025
0
citations
Medical World Model
ICCV 2025
0
citations
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
CVPR 2024
0
citations