Yizhou Yu
52
Papers
67
Total Citations
Papers (52)
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
AAAI 2025
24
citations
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
CVPR 2025arXiv
22
citations
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
CVPR 2024
15
citations
Autoregressive Sequence Modeling for 3D Medical Image Representation
AAAI 2025
3
citations
Vision Function Layer in Multimodal LLMs
NeurIPS 2025
3
citations
Visual Saliency Based on Multiscale Deep Features
CVPR 2015
0
citations
Deep Contrast Learning for Salient Object Detection
CVPR 2016
0
citations
Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning
CVPR 2017arXiv
0
citations
Instance-Level Salient Object Segmentation
CVPR 2017arXiv
0
citations
Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning
CVPR 2018arXiv
0
citations
Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification From the Bottom Up
CVPR 2019
0
citations
Cross-Modal Relationship Inference for Grounding Referring Expressions
CVPR 2019
0
citations
Multi-Source Weak Supervision for Saliency Detection
CVPR 2019
0
citations
Cascaded Generative and Discriminative Learning for Microcalcification Detection in Breast Mammograms
CVPR 2019
0
citations
Cross-View Correspondence Reasoning Based on Bipartite Graph Convolutional Network for Mammogram Mass Detection
CVPR 2020
0
citations
Graph-Structured Referring Expression Reasoning in the Wild
CVPR 2020arXiv
0
citations
I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors
CVPR 2021arXiv
0
citations
Scene-Intuitive Agent for Remote Embodied Visual Grounding
CVPR 2021arXiv
0
citations
Bottom-Up Shift and Reasoning for Referring Image Segmentation
CVPR 2021
0
citations
Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation
CVPR 2021arXiv
0
citations
Refer-It-in-RGBD: A Bottom-Up Approach for 3D Visual Grounding in RGBD Images
CVPR 2021
0
citations
Coarse-To-Fine Domain Adaptive Semantic Segmentation With Photometric Alignment and Category-Center Regularization
CVPR 2021arXiv
0
citations
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-Shot Learning
CVPR 2022arXiv
0
citations
Scale-Equivalent Distillation for Semi-Supervised Object Detection
CVPR 2022arXiv
0
citations
Compound Domain Generalization via Meta-Knowledge Encoding
CVPR 2022arXiv
0
citations
MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence
CVPR 2023
0
citations
Improved Distribution Matching for Dataset Condensation
CVPR 2023
0
citations
Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification
ICCV 2015
0
citations
Piecewise Flat Embedding for Image Segmentation
ICCV 2015
0
citations
HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition
ICCV 2015
0
citations
High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference
ICCV 2017arXiv
0
citations
Dynamic Graph Attention for Referring Expression Comprehension
ICCV 2019
0
citations
Motion Guided Attention for Video Salient Object Detection
ICCV 2019
0
citations
Align, Attend and Locate: Chest X-Ray Diagnosis via Contrast Induced Attention Network With Limited Supervision
ICCV 2019
0
citations
Multi-Scale Matching Networks for Semantic Correspondence
ICCV 2021arXiv
0
citations
Preservational Learning Improves Self-Supervised Medical Image Models by Reconstructing Diverse Contexts
ICCV 2021arXiv
0
citations
GraphFPN: Graph Feature Pyramid Network for Object Detection
ICCV 2021arXiv
0
citations
Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection
ICCV 2021
0
citations
EGC: Image Generation and Classification via a Diffusion Energy-Based Model
ICCV 2023arXiv
0
citations
Activate and Reject: Towards Safe Domain Generalization under Category Shift
ICCV 2023
0
citations
Propagating Over Phrase Relations for One-Stage Visual Grounding
ECCV 2020
0
citations
One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement
ECCV 2022
0
citations
Neighborhood Collective Estimation for Noisy Label Identification and Correction
ECCV 2022
0
citations
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels
ECCV 2022
0
citations
ME-PCN: Point Completion Conditioned on Mask Emptiness
ICCV 2021
0
citations
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
CVPR 2025
0
citations
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis
ICCV 2025
0
citations
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
AAAI 2024arXiv
0
citations
RegionGPT: Towards Region Understanding Vision Language Model
CVPR 2024
0
citations
Transductive Zero-Shot Learning with Visual Structure Constraint
NeurIPS 2019
0
citations
Mix and Reason: Reasoning over Semantic Topology with Data Mixing for Domain Generalization
NeurIPS 2022
0
citations
CODA: Generalizing to Open and Unseen Domains with Compaction and Disambiguation
NeurIPS 2023
0
citations