Tao Wang

38
Papers
73
Total Citations

Papers (38)

Zero-Shot Aerial Object Detection with Visual Description Regularization

AAAI 2024arXiv
18
citations

FRIH: Fine-Grained Region-Aware Image Harmonization

AAAI 2024arXiv
16
citations

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

AAAI 2024arXiv
15
citations

HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model

ICCV 2025
9
citations

Foundations of Top-$k$ Decoding for Language Models

NeurIPS 2025arXiv
8
citations

A Hubness Perspective on Representation Learning for Graph-Based Multi-View Clustering

CVPR 2025
3
citations

StickMotion: Generating 3D Human Motions by Drawing a Stickman

CVPR 2025
3
citations

Trend-Aware Supervision: On Learning Invariance for Semi-supervised Facial Action Unit Intensity Estimation

AAAI 2024arXiv
1
citations

Distilling Object Detectors With Fine-Grained Feature Imitation

CVPR 2019
0
citations

Few-Shot Adaptive Faster R-CNN

CVPR 2019
0
citations

Central Similarity Quantization for Efficient Image and Video Retrieval

CVPR 2020arXiv
0
citations

Revisiting Knowledge Distillation via Label Smoothing Regularization

CVPR 2020
0
citations

Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax

CVPR 2020arXiv
0
citations

Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning

CVPR 2021
0
citations

PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision

CVPR 2022arXiv
0
citations

Learning To Detect and Segment for Open Vocabulary Object Detection

CVPR 2023arXiv
0
citations

Deformable Surface Tracking by Graph Matching

ICCV 2019
0
citations

PnP-DETR: Towards Efficient Visual Analysis With Transformers

ICCV 2021
0
citations

End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks

ICCV 2021
0
citations

Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet

ICCV 2021arXiv
0
citations

Real-Time Image Enhancer via Learnable Spatial-Aware 3D Lookup Tables

ICCV 2021arXiv
0
citations

Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring

ICCV 2021
0
citations

Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning

ICCV 2021
0
citations

The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation

ECCV 2020
0
citations

On Mitigating Hard Clusters for Face Clustering

ECCV 2022
0
citations

BézierPalm: A Free Lunch for Palmprint Recognition

ECCV 2022
0
citations

Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach

ECCV 2022
0
citations

Learning Combinatorial Solver for Graph Matching

CVPR 2020
0
citations

MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration

ICCV 2025
0
citations

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

ICCV 2025
0
citations

SALS: Sparse Attention in Latent Space for KV Cache Compression

NeurIPS 2025
0
citations

Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

CVPR 2024
0
citations

SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement

CVPR 2024
0
citations

Mollification Effects of Policy Gradient Methods

ICML 2024
0
citations

Controlled Decoding from Language Models

ICML 2024
0
citations

Rethinking Image Restoration for Object Detection

NeurIPS 2022
0
citations

Fractal Landscapes in Policy Optimization

NeurIPS 2023
0
citations

Punctuation-level Attack: Single-shot and Single Punctuation Can Fool Text Models

NeurIPS 2023
0
citations