Ming Tang

31
Papers
367
Total Citations

Papers (31)

AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models

AAAI 2024arXiv
240
citations

Fluctuation-Based Adaptive Structured Pruning for Large Language Models

AAAI 2024arXiv
96
citations

Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models

ECCV 2024
30
citations

MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing

ICCV 2025
1
citations

FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation

NeurIPS 2025arXiv
0
citations

Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing

AAAI 2024arXiv
0
citations

Self-Supervised Representation Learning from Arbitrary Scenarios

CVPR 2024
0
citations

High-Speed Tracking With Multi-Kernel Correlation Filters

CVPR 2018arXiv
0
citations

Semantic Alignment: Finding Semantically Consistent Ground-Truth for Facial Landmark Detection

CVPR 2019
0
citations

Part-Aware Context Network for Human Parsing

CVPR 2020
0
citations

Adaptive Class Suppression Loss for Long-Tail Object Detection

CVPR 2021arXiv
0
citations

Improving Multiple Object Tracking With Single Object Tracking

CVPR 2021
0
citations

C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection

CVPR 2022
0
citations

UniVIP: A Unified Framework for Self-Supervised Visual Pre-Training

CVPR 2022arXiv
0
citations

ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection

CVPR 2023arXiv
0
citations

Multi-Kernel Correlation Filter for Visual Tracking

ICCV 2015
0
citations

Fast-deepKCF Without Boundary Effect

ICCV 2019
0
citations

High-Performance Discriminative Tracking With Transformers

ICCV 2021
0
citations

Identity-Guided Human Semantic Parsing for Person Re-Identification

ECCV 2020
0
citations

Learning Feature Embeddings for Discriminant Model based Tracking

ECCV 2020
0
citations

Large Batch Optimization for Object Detection: Training COCO in 12 Minutes

ECCV 2020
0
citations

Adaptive Variance Based Label Distribution Learning For Facial Age Estimation

ECCV 2020
0
citations

Blended Grammar Network for Human Parsing

ECCV 2020
0
citations

Regularizing Vector Embedding in Bottom-Up Human Pose Estimation

ECCV 2022
0
citations

PASS: Part-Aware Self-Supervised Pre-training for Person Re-identification

ECCV 2022
0
citations

UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection

CVPR 2025
0
citations

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability

CVPR 2025
0
citations

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

ICCV 2025
0
citations

VPR-Cloak: A First Look at Privacy Cloak Against Visual Place Recognition

ICCV 2025
0
citations

MST: Masked Self-Supervised Transformer for Visual Representation

NeurIPS 2021
0
citations

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

NeurIPS 2022
0
citations