Yizhou Yu

52
Papers
67
Total Citations

Papers (52)

SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks

AAAI 2025
24
citations

SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation

CVPR 2025arXiv
22
citations

OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation

CVPR 2024
15
citations

Autoregressive Sequence Modeling for 3D Medical Image Representation

AAAI 2025
3
citations

Vision Function Layer in Multimodal LLMs

NeurIPS 2025
3
citations

Visual Saliency Based on Multiscale Deep Features

CVPR 2015
0
citations

Deep Contrast Learning for Salient Object Detection

CVPR 2016
0
citations

Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning

CVPR 2017arXiv
0
citations

Instance-Level Salient Object Segmentation

CVPR 2017arXiv
0
citations

Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning

CVPR 2018arXiv
0
citations

Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification From the Bottom Up

CVPR 2019
0
citations

Cross-Modal Relationship Inference for Grounding Referring Expressions

CVPR 2019
0
citations

Multi-Source Weak Supervision for Saliency Detection

CVPR 2019
0
citations

Cascaded Generative and Discriminative Learning for Microcalcification Detection in Breast Mammograms

CVPR 2019
0
citations

Cross-View Correspondence Reasoning Based on Bipartite Graph Convolutional Network for Mammogram Mass Detection

CVPR 2020
0
citations

Graph-Structured Referring Expression Reasoning in the Wild

CVPR 2020arXiv
0
citations

I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors

CVPR 2021arXiv
0
citations

Scene-Intuitive Agent for Remote Embodied Visual Grounding

CVPR 2021arXiv
0
citations

Bottom-Up Shift and Reasoning for Referring Image Segmentation

CVPR 2021
0
citations

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

CVPR 2021arXiv
0
citations

Refer-It-in-RGBD: A Bottom-Up Approach for 3D Visual Grounding in RGBD Images

CVPR 2021
0
citations

Coarse-To-Fine Domain Adaptive Semantic Segmentation With Photometric Alignment and Category-Center Regularization

CVPR 2021arXiv
0
citations

Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-Shot Learning

CVPR 2022arXiv
0
citations

Scale-Equivalent Distillation for Semi-Supervised Object Detection

CVPR 2022arXiv
0
citations

Compound Domain Generalization via Meta-Knowledge Encoding

CVPR 2022arXiv
0
citations

MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence

CVPR 2023
0
citations

Improved Distribution Matching for Dataset Condensation

CVPR 2023
0
citations

Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification

ICCV 2015
0
citations

Piecewise Flat Embedding for Image Segmentation

ICCV 2015
0
citations

HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition

ICCV 2015
0
citations

High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference

ICCV 2017arXiv
0
citations

Dynamic Graph Attention for Referring Expression Comprehension

ICCV 2019
0
citations

Motion Guided Attention for Video Salient Object Detection

ICCV 2019
0
citations

Align, Attend and Locate: Chest X-Ray Diagnosis via Contrast Induced Attention Network With Limited Supervision

ICCV 2019
0
citations

Multi-Scale Matching Networks for Semantic Correspondence

ICCV 2021arXiv
0
citations

Preservational Learning Improves Self-Supervised Medical Image Models by Reconstructing Diverse Contexts

ICCV 2021arXiv
0
citations

GraphFPN: Graph Feature Pyramid Network for Object Detection

ICCV 2021arXiv
0
citations

Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection

ICCV 2021
0
citations

EGC: Image Generation and Classification via a Diffusion Energy-Based Model

ICCV 2023arXiv
0
citations

Activate and Reject: Towards Safe Domain Generalization under Category Shift

ICCV 2023
0
citations

Propagating Over Phrase Relations for One-Stage Visual Grounding

ECCV 2020
0
citations

One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement

ECCV 2022
0
citations

Neighborhood Collective Estimation for Noisy Label Identification and Correction

ECCV 2022
0
citations

Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

ECCV 2022
0
citations

ME-PCN: Point Completion Conditioned on Mask Emptiness

ICCV 2021
0
citations

OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

CVPR 2025
0
citations

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

ICCV 2025
0
citations

FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels

AAAI 2024arXiv
0
citations

RegionGPT: Towards Region Understanding Vision Language Model

CVPR 2024
0
citations

Transductive Zero-Shot Learning with Visual Structure Constraint

NeurIPS 2019
0
citations

Mix and Reason: Reasoning over Semantic Topology with Data Mixing for Domain Generalization

NeurIPS 2022
0
citations

CODA: Generalizing to Open and Unseen Domains with Compaction and Disambiguation

NeurIPS 2023
0
citations