Yu Zhang

71
Papers
1,223
Total Citations

Papers (71)

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

ICLR 2024
554
citations

Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data

NeurIPS 2017arXiv
368
citations

SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting

CVPR 2024
165
citations

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

CVPR 2025
44
citations

MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders

ECCV 2024
25
citations

HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation

CVPR 2024
18
citations

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

AAAI 2025
16
citations

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

AAAI 2025
12
citations

BHViT: Binarized Hybrid Vision Transformer

CVPR 2025
6
citations

SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images

ICCV 2025
6
citations

Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs Without Real Data Replay

AAAI 2025
4
citations

MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition

NeurIPS 2025
2
citations

Object-level Correlation for Few-Shot Segmentation

ICCV 2025arXiv
2
citations

Open Your Eyes: Vision Enhances Message Passing Neural Networks in Link Prediction

ICML 2025
1
citations

Semantic Object Segmentation via Detection in Weakly Labeled Video

CVPR 2015
0
citations

3D Reconstruction in the Presence of Glasses by Acoustic and Stereo Fusion

CVPR 2015
0
citations

Exploit Bounding Box Annotations for Multi-Label Object Recognition

CVPR 2016
0
citations

What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors

CVPR 2017
0
citations

Causes and Corrections for Bimodal Multi-Path Scanning With Structured Light

CVPR 2019
0
citations

Structure-Preserving Stereoscopic View Synthesis With Multi-Scale Adversarial Correlation Matching

CVPR 2019
0
citations

Learning Event-Based Motion Deblurring

CVPR 2020arXiv
0
citations

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

CVPR 2020arXiv
0
citations

Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection

CVPR 2021
0
citations

Sparse Multi-Path Corrections in Fringe Projection Profilometry

CVPR 2021
0
citations

Balanced and Hierarchical Relation Learning for One-Shot Object Detection

CVPR 2022
0
citations

AutoMine: An Unmanned Mine Dataset

CVPR 2022
0
citations

Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

CVPR 2023
0
citations

PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration

CVPR 2023
0
citations

Leveraging per Image-Token Consistency for Vision-Language Pre-Training

CVPR 2023arXiv
0
citations

Range-Nullspace Video Frame Interpolation With Focalized Motion Estimation

CVPR 2023
0
citations

Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector

ICCV 2017
0
citations

Multi-Class Part Parsing With Joint Boundary-Semantic Awareness

ICCV 2019
0
citations

Training Weakly Supervised Video Frame Interpolation With Events

ICCV 2021
0
citations

Personalized Image Semantic Segmentation

ICCV 2021arXiv
0
citations

E2NeRF: Event Enhanced Neural Radiance Fields from Blurry Images

ICCV 2023
0
citations

Learning Trajectory-Word Alignments for Video-Language Tasks

ICCV 2023arXiv
0
citations

Adaptive Positional Encoding for Bundle-Adjusting Neural Radiance Fields

ICCV 2023
0
citations

Multi-view Self-supervised Disentanglement for General Image Denoising

ICCV 2023arXiv
0
citations

Deep Image Clustering with Category-Style Representation

ECCV 2020
0
citations

Learning to See in the Dark with Events

ECCV 2020
0
citations

PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry

ECCV 2022
0
citations

Deep Bayesian Video Frame Interpolation

ECCV 2022
0
citations

Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation

ECCV 2022
0
citations

An Efficient Person Clustering Algorithm for Open Checkout-Free Groceries

ECCV 2022
0
citations

Selectivity or Invariance: Boundary-Aware Salient Object Detection

ICCV 2019
0
citations

EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling

CVPR 2025
0
citations

PLAN: Proactive Low-Rank Allocation for Continual Learning

ICCV 2025
0
citations

HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation

AAAI 2025
0
citations

Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation

AAAI 2025
0
citations

Multi-Label Ranking Loss Minimization for Matrix Completion

AAAI 2025
0
citations

SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving

AAAI 2024
0
citations

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains

AAAI 2024arXiv
0
citations

Memory-Efficient Reversible Spiking Neural Networks

AAAI 2024
0
citations

Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors

CVPR 2024
0
citations

NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation

CVPR 2024
0
citations

CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding

ICML 2024
0
citations

Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding Perspective

ICML 2024
0
citations

MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization

ICML 2024
0
citations

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

NeurIPS 2018
0
citations

Learning to Multitask

NeurIPS 2018
0
citations

Multi-Objective Meta Learning

NeurIPS 2021
0
citations

Effective Meta-Regularization by Kernelized Proximal Regularization

NeurIPS 2021
0
citations

Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

NeurIPS 2022
0
citations

Dynamic Sparse Network for Time Series Classification: Learning What to “See”

NeurIPS 2022
0
citations

Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator

NeurIPS 2023
0
citations

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

NeurIPS 2023
0
citations

CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection

NeurIPS 2023
0
citations

Interpreting Unsupervised Anomaly Detection in Security via Rule Extraction

NeurIPS 2023
0
citations

MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers

NeurIPS 2023
0
citations

Transfer Learning via Learning to Transfer

ICML 2018
0
citations

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

ICML 2018
0
citations