Feng Zheng
10
Papers
67
Total Citations
Papers (10)
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos
CVPR 2025arXiv
32
citations
MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
ICLR 2025
16
citations
Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning
AAAI 2024arXiv
13
citations
OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization
NeurIPS 2025
4
citations
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
NeurIPS 2025arXiv
1
citations
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
ECCV 2024
1
citations
Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes
CVPR 2024
0
citations
A₀ : An Affordance-Aware Hierarchical Model for General Robotic Manipulation
ICCV 2025
0
citations
Block Image Compressive Sensing with Local and Global Information Interaction
AAAI 2024
0
citations
Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt
AAAI 2024
0
citations