Xuming He
38
Papers
0
Total Citations
Papers (38)
GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization
ICCV 2025
0
citations
NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation
NeurIPS 2025
0
citations
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
AAAI 2025
0
citations
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
AAAI 2024
0
citations
DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation
CVPR 2024
0
citations
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models
CVPR 2024
0
citations
Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning
CVPR 2024
0
citations
Indoor Scene Structure Analysis for Single Image Depth Estimation
CVPR 2015
0
citations
Multiclass Semantic Video Segmentation With Object-Level Active Inference
CVPR 2015
0
citations
Separating Objects and Clutter in Indoor Scenes
CVPR 2015
0
citations
Learning to Co-Generate Object Proposals With a Deep Structured Network
CVPR 2016
0
citations
Predicting Salient Face in Multiple-Face Videos
CVPR 2017
0
citations
Indoor Scene Parsing With Instance Segmentation, Semantic Labeling and Support Relationship Inference
CVPR 2017
0
citations
Boundary-Aware Instance Segmentation
CVPR 2017arXiv
0
citations
One-Shot Action Localization by Learning Sequence Matching Network
CVPR 2018
0
citations
Geometry-Aware Deep Network for Single-Image Novel View Synthesis
CVPR 2018arXiv
0
citations
SemStyle: Learning to Generate Stylised Image Captions Using Unaligned Text
CVPR 2018arXiv
0
citations
Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition
CVPR 2021arXiv
0
citations
Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation
CVPR 2021arXiv
0
citations
DER: Dynamically Expandable Representation for Class Incremental Learning
CVPR 2021arXiv
0
citations
Relation-aware Instance Refinement for Weakly Supervised Visual Grounding
CVPR 2021arXiv
0
citations
General Incremental Learning With Domain-Aware Categorical Representations
CVPR 2022arXiv
0
citations
SGTR: End-to-End Scene Graph Generation With Transformer
CVPR 2022arXiv
0
citations
HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models
CVPR 2023arXiv
0
citations
Structural Kernel Learning for Large Scale Multiclass Object Co-Detection
ICCV 2015
0
citations
Deep Free-Form Deformation Network for Object-Mask Registration
ICCV 2017
0
citations
Dynamic Context Correspondence Network for Semantic Alignment
ICCV 2019
0
citations
Pose-Aware Multi-Level Feature Network for Human Object Interaction Detection
ICCV 2019
0
citations
GNeRF: GAN-Based Neural Radiance Field Without Posed Camera
ICCV 2021arXiv
0
citations
Class-relation Knowledge Distillation for Novel Class Discovery
ICCV 2023arXiv
0
citations
Grounded Image Text Matching with Mismatched Relation Reasoning
ICCV 2023arXiv
0
citations
Human-centric Scene Understanding for 3D Large-scale Scenarios
ICCV 2023arXiv
0
citations
Part-aware Prototype Network for Few-shot Semantic Segmentation
ECCV 2020
0
citations
Learning Semantic Correspondence with Sparse Annotations
ECCV 2022
0
citations
Generative Negative Text Replay for Continual Vision-Language Pretraining
ECCV 2022
0
citations
Dynamic Grained Encoder for Vision Transformers
NeurIPS 2021
0
citations
ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation
NeurIPS 2023
0
citations
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition
ICML 2019
0
citations