Yan Huang

39
Papers
47
Total Citations

Papers (39)

HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection

AAAI 2025
14
citations

Zero-Shot Low-Light Image Enhancement via Latent Diffusion Models

AAAI 2025
11
citations

Free Lunch for Gait Recognition: A Novel Relation Descriptor

ECCV 2024
10
citations

Open-Vocabulary Octree-Graph for 3D Scene Understanding

ICCV 2025
6
citations

Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition

CVPR 2025
3
citations

Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation

AAAI 2025
2
citations

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow

ICCV 2025
1
citations

Investigating Compositional Challenges in Vision-Language Models for Visual Grounding

CVPR 2024
0
citations

Sparse Coding for Classification via Discrimination Ensemble

CVPR 2016
0
citations

Instance-Aware Image and Sentence Matching With Selective Multimodal LSTM

CVPR 2017arXiv
0
citations

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-Identification

CVPR 2017
0
citations

Mask-Guided Contrastive Attention Model for Person Re-Identification

CVPR 2018
0
citations

Aligning Infinite-Dimensional Covariance Matrices in Reproducing Kernel Hilbert Spaces for Domain Adaptation

CVPR 2018
0
citations

M3: Multimodal Memory Modelling for Video Captioning

CVPR 2018
0
citations

Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model

CVPR 2019
0
citations

Box-Driven Class-Wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation

CVPR 2019
0
citations

Local Relationship Learning With Person-Specific Shape Regularization for Facial Action Unit Detection

CVPR 2019
0
citations

Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation

CVPR 2021arXiv
0
citations

Dynamic Texture Recognition via Orthogonal Tensor Dictionary Learning

ICCV 2015
0
citations

Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning

ICCV 2015
0
citations

ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching

ICCV 2019
0
citations

SBSGAN: Suppression of Inter-Domain Background Shift for Person Re-Identification

ICCV 2019
0
citations

Clothing Status Awareness for Long-Term Person Re-Identification

ICCV 2021
0
citations

PlanarTrack: A Large-scale Challenging Benchmark for Planar Object Tracking

ICCV 2023arXiv
0
citations

Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach

ECCV 2020
0
citations

Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification

ECCV 2020
0
citations

Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution

NeurIPS 2015
0
citations

Learning Semantic Concepts and Order for Image and Sentence Matching

CVPR 2018arXiv
0
citations

PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization

ICCV 2025
0
citations

DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception

ICCV 2025
0
citations

TDeLTA: A Light-Weight and Robust Table Detection Method Based on Learning Text Arrangement

AAAI 2024arXiv
0
citations

Selective and Orthogonal Feature Activation for Pedestrian Attribute Recognition

AAAI 2024
0
citations

Context-Guided Spatio-Temporal Video Grounding

CVPR 2024
0
citations

Attribute-Guided Pedestrian Retrieval: Bridging Person Re-ID with Internal Attribute Variability

CVPR 2024
0
citations

RetGK: Graph Kernels based on Return Probabilities of Random Walks

NeurIPS 2018
0
citations

Unfolding the Alternating Optimization for Blind Super Resolution

NeurIPS 2020
0
citations

Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision

NeurIPS 2021
0
citations

MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching

NeurIPS 2022
0
citations

Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation

NeurIPS 2023
0
citations