Zhuo Chen
18
Papers
225
Total Citations
Papers (18)
ELLA-V: Stable Neural Codec Language Modeling with Alignment-Guided Sequence Reordering
AAAI 2025
64
citations
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations
AAAI 2024arXiv
49
citations
Language Model Can Listen While Speaking
AAAI 2025
47
citations
DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation
ICML 2025
36
citations
Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning
ICLR 2025
10
citations
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
ICCV 2025arXiv
9
citations
3D-Aware Face Editing via Warping-Guided Latent Direction Learning
CVPR 2024
6
citations
AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction
ICLR 2025
3
citations
Infer the Whole from a Glimpse of a Part: Keypoint-Based Knowledge Graph for Vehicle Re-Identification
AAAI 2025
1
citations
TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision
ICML 2024
0
citations
One-for-More: Continual Diffusion Model for Anomaly Detection
CVPR 2025
0
citations
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems
CVPR 2025
0
citations
Dataset Distillation as Data Compression: A Rate-Utility Perspective
ICCV 2025
0
citations
AMDANet: Attention-Driven Multi-Perspective Discrepancy Alignment for RGB-Infrared Image Fusion and Segmentation
ICCV 2025
0
citations
K-ON: Stacking Knowledge on the Head Layer of Large Language Model
AAAI 2025
0
citations
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation
AAAI 2025
0
citations
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR 2024
0
citations
Scaling Mesh Generation via Compressive Tokenization
CVPR 2025
0
citations