Jiale Cao
6
Papers
137
Total Citations
Papers (6)
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
CVPR 2024
90
citations
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
CVPR 2025
30
citations
Glad: A Streaming Scene Generator for Autonomous Driving
ICLR 2025
11
citations
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect Detection
CVPR 2025
3
citations
SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection
AAAI 2025
3
citations
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
ICCV 2025
0
citations