Weijia Li

14
Papers
139
Total Citations

Papers (14)

LEGION: Learning to Ground and Explain for Synthetic Image Detection

ICCV 2025
32
citations

Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network

ECCV 2024arXiv
31
citations

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

AAAI 2025arXiv
26
citations

SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation

CVPR 2024arXiv
25
citations

Where am I? Cross-View Geo-localization with Natural Language Descriptions

ICCV 2025arXiv
16
citations

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

NeurIPS 2025arXiv
8
citations

Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind

NeurIPS 2025arXiv
1
citations

BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception

NeurIPS 2025arXiv
0
citations

Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis

ICCV 2025arXiv
0
citations

VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis

AAAI 2025arXiv
0
citations

Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration

CVPR 2025arXiv
0
citations

Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model

CVPR 2024
0
citations

AutoOS: Make Your OS More Powerful by Exploiting Large Language Models

ICML 2024
0
citations

3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions

CVPR 2024arXiv
0
citations