Pengxiang Ding
9
Papers
242
Total Citations
Papers (9)
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
AAAI 2025
106
citations
VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation
ICLR 2025
37
citations
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
ICCV 2025
23
citations
ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
ICML 2025
20
citations
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
NeurIPS 2025
18
citations
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation
ICLR 2025
17
citations
PiTe: Pixel-Temporal Alignment for Large Video-Language Model
ECCV 2024arXiv
9
citations
Expressive Forecasting of 3D Whole-Body Human Motions
AAAI 2024arXiv
8
citations
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
ICML 2025
4
citations