Feng Zheng

37
Papers
116
Total Citations

Papers (37)

Enabling Deep Residual Networks for Weakly Supervised Object Detection

ECCV 2020
49
citations

LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos

CVPR 2025arXiv
32
citations

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

ICLR 2025arXiv
16
citations

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

AAAI 2024arXiv
13
citations

OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization

NeurIPS 2025arXiv
4
citations

Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion

ECCV 2024arXiv
1
citations

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

NeurIPS 2025arXiv
1
citations

A₀ : An Affordance-Aware Hierarchical Model for General Robotic Manipulation

ICCV 2025
0
citations

Block Image Compressive Sensing with Local and Global Information Interaction

AAAI 2024
0
citations

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt

AAAI 2024arXiv
0
citations

Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes

CVPR 2024
0
citations

Salience-Guided Cascaded Suppression Network for Person Re-Identification

CVPR 2020
0
citations

One-Shot Adversarial Attacks on Visual Tracking With Dual Attention

CVPR 2020
0
citations

Noise-Aware Fully Webly Supervised Object Detection

CVPR 2020
0
citations

Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification

CVPR 2021
0
citations

Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net

CVPR 2021
0
citations

Class-Aware Contrastive Semi-Supervised Learning

CVPR 2022arXiv
0
citations

Meta Distribution Alignment for Generalizable Person Re-Identification

CVPR 2022
0
citations

Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression

CVPR 2022arXiv
0
citations

Accelerating Vision-Language Pretraining With Free Language Modeling

CVPR 2023arXiv
0
citations

Resource-Efficient RGBD Aerial Tracking

CVPR 2023
0
citations

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline

CVPR 2023arXiv
0
citations

Saliency-Associated Object Tracking

ICCV 2021arXiv
0
citations

Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation

ICCV 2021arXiv
0
citations

FREE: Feature Refinement for Generalized Zero-Shot Learning

ICCV 2021arXiv
0
citations

DepthTrack: Unveiling the Power of RGBD Tracking

ICCV 2021
0
citations

End-to-End Dense Video Captioning With Parallel Decoding

ICCV 2021arXiv
0
citations

Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples

ICCV 2023arXiv
0
citations

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models

ICCV 2023arXiv
0
citations

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

ICCV 2023arXiv
0
citations

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models

ICCV 2023arXiv
0
citations

S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning

ECCV 2022
0
citations

Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline

ECCV 2022
0
citations

Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks

ECCV 2022
0
citations

Multi-task Additive Models for Robust Estimation and Automatic Structure Discovery

NeurIPS 2020
0
citations

SoftPatch: Unsupervised Anomaly Detection with Noisy Data

NeurIPS 2022arXiv
0
citations

Real3D-AD: A Dataset of Point Cloud Anomaly Detection

NeurIPS 2023arXiv
0
citations