Xuming He

38
Papers
0
Total Citations

Papers (38)

GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization

ICCV 2025
0
citations

NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation

NeurIPS 2025
0
citations

Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation

AAAI 2025
0
citations

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training

AAAI 2024
0
citations

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

CVPR 2024
0
citations

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

CVPR 2024
0
citations

Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning

CVPR 2024
0
citations

Indoor Scene Structure Analysis for Single Image Depth Estimation

CVPR 2015
0
citations

Multiclass Semantic Video Segmentation With Object-Level Active Inference

CVPR 2015
0
citations

Separating Objects and Clutter in Indoor Scenes

CVPR 2015
0
citations

Learning to Co-Generate Object Proposals With a Deep Structured Network

CVPR 2016
0
citations

Predicting Salient Face in Multiple-Face Videos

CVPR 2017
0
citations

Indoor Scene Parsing With Instance Segmentation, Semantic Labeling and Support Relationship Inference

CVPR 2017
0
citations

Boundary-Aware Instance Segmentation

CVPR 2017arXiv
0
citations

One-Shot Action Localization by Learning Sequence Matching Network

CVPR 2018
0
citations

Geometry-Aware Deep Network for Single-Image Novel View Synthesis

CVPR 2018arXiv
0
citations

SemStyle: Learning to Generate Stylised Image Captions Using Unaligned Text

CVPR 2018arXiv
0
citations

Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition

CVPR 2021arXiv
0
citations

Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation

CVPR 2021arXiv
0
citations

DER: Dynamically Expandable Representation for Class Incremental Learning

CVPR 2021arXiv
0
citations

Relation-aware Instance Refinement for Weakly Supervised Visual Grounding

CVPR 2021arXiv
0
citations

General Incremental Learning With Domain-Aware Categorical Representations

CVPR 2022arXiv
0
citations

SGTR: End-to-End Scene Graph Generation With Transformer

CVPR 2022arXiv
0
citations

HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models

CVPR 2023arXiv
0
citations

Structural Kernel Learning for Large Scale Multiclass Object Co-Detection

ICCV 2015
0
citations

Deep Free-Form Deformation Network for Object-Mask Registration

ICCV 2017
0
citations

Dynamic Context Correspondence Network for Semantic Alignment

ICCV 2019
0
citations

Pose-Aware Multi-Level Feature Network for Human Object Interaction Detection

ICCV 2019
0
citations

GNeRF: GAN-Based Neural Radiance Field Without Posed Camera

ICCV 2021arXiv
0
citations

Class-relation Knowledge Distillation for Novel Class Discovery

ICCV 2023arXiv
0
citations

Grounded Image Text Matching with Mismatched Relation Reasoning

ICCV 2023arXiv
0
citations

Human-centric Scene Understanding for 3D Large-scale Scenarios

ICCV 2023arXiv
0
citations

Part-aware Prototype Network for Few-shot Semantic Segmentation

ECCV 2020
0
citations

Learning Semantic Correspondence with Sparse Annotations

ECCV 2022
0
citations

Generative Negative Text Replay for Continual Vision-Language Pretraining

ECCV 2022
0
citations

Dynamic Grained Encoder for Vision Transformers

NeurIPS 2021
0
citations

ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation

NeurIPS 2023
0
citations

LatentGNN: Learning Efficient Non-local Relations for Visual Recognition

ICML 2019
0
citations