Zhenyu Zhang
14
Papers
82
Total Citations
Papers (14)
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
AAAI 2025
38
citations
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
ICCV 2025
22
citations
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization
CVPR 2025
7
citations
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
AAAI 2025
7
citations
AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization
AAAI 2024arXiv
4
citations
Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent
ICCV 2025
2
citations
StrandHead: Text to Hair-Disentangled 3D Head Avatars Using Human-Centric Priors
ICCV 2025
1
citations
ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents
NeurIPS 2025
1
citations
Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once
ICML 2024
0
citations
CaM: Cache Merging for Memory-efficient LLMs Inference
ICML 2024
0
citations
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
ICML 2024
0
citations
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
ICML 2024
0
citations
Tri-Perspective View Decomposition for Geometry-Aware Depth Completion
CVPR 2024
0
citations
Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference
ICML 2024
0
citations