Zhuo Chen

18
Papers
225
Total Citations

Papers (18)

ELLA-V: Stable Neural Codec Language Modeling with Alignment-Guided Sequence Reordering

AAAI 2025
64
citations

Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations

AAAI 2024arXiv
49
citations

Language Model Can Listen While Speaking

AAAI 2025
47
citations

DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation

ICML 2025
36
citations

Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning

ICLR 2025
10
citations

Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

ICCV 2025arXiv
9
citations

3D-Aware Face Editing via Warping-Guided Latent Direction Learning

CVPR 2024
6
citations

AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction

ICLR 2025
3
citations

Infer the Whole from a Glimpse of a Part: Keypoint-Based Knowledge Graph for Vehicle Re-Identification

AAAI 2025
1
citations

TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision

ICML 2024
0
citations

One-for-More: Continual Diffusion Model for Anomaly Detection

CVPR 2025
0
citations

Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems

CVPR 2025
0
citations

Dataset Distillation as Data Compression: A Rate-Utility Perspective

ICCV 2025
0
citations

AMDANet: Attention-Driven Multi-Perspective Discrepancy Alignment for RGB-Infrared Image Fusion and Segmentation

ICCV 2025
0
citations

K-ON: Stacking Knowledge on the Head Layer of Large Language Model

AAAI 2025
0
citations

Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation

AAAI 2025
0
citations

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

CVPR 2024
0
citations

Scaling Mesh Generation via Compressive Tokenization

CVPR 2025
0
citations