Feng Zheng

10

Papers

67

Total Citations

Papers (10)

LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

NeurIPS 2025arXiv

Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion

Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes

A₀ : An Affordance-Aware Hierarchical Model for General Robotic Manipulation

Block Image Compressive Sensing with Local and Global Information Interaction

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt