Zhuoyang Zhang
4
Papers
207
Total Citations
Papers (4)
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CVPR 2025
203
citations
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer
ICCV 2025arXiv
4
citations
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
0
citations
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
CVPR 2024
0
citations