Yu Zhang
71
Papers
1,223
Total Citations
Papers (71)
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
ICLR 2024
554
citations
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data
NeurIPS 2017arXiv
368
citations
SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting
CVPR 2024
165
citations
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
44
citations
MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders
ECCV 2024
25
citations
HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation
CVPR 2024
18
citations
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
AAAI 2025
16
citations
Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition
AAAI 2025
12
citations
BHViT: Binarized Hybrid Vision Transformer
CVPR 2025
6
citations
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images
ICCV 2025
6
citations
Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs Without Real Data Replay
AAAI 2025
4
citations
MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition
NeurIPS 2025
2
citations
Object-level Correlation for Few-Shot Segmentation
ICCV 2025arXiv
2
citations
Open Your Eyes: Vision Enhances Message Passing Neural Networks in Link Prediction
ICML 2025
1
citations
Semantic Object Segmentation via Detection in Weakly Labeled Video
CVPR 2015
0
citations
3D Reconstruction in the Presence of Glasses by Acoustic and Stereo Fusion
CVPR 2015
0
citations
Exploit Bounding Box Annotations for Multi-Label Object Recognition
CVPR 2016
0
citations
What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors
CVPR 2017
0
citations
Causes and Corrections for Bimodal Multi-Path Scanning With Structured Light
CVPR 2019
0
citations
Structure-Preserving Stereoscopic View Synthesis With Multi-Scale Adversarial Correlation Matching
CVPR 2019
0
citations
Learning Event-Based Motion Deblurring
CVPR 2020arXiv
0
citations
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
CVPR 2020arXiv
0
citations
Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection
CVPR 2021
0
citations
Sparse Multi-Path Corrections in Fringe Projection Profilometry
CVPR 2021
0
citations
Balanced and Hierarchical Relation Learning for One-Shot Object Detection
CVPR 2022
0
citations
AutoMine: An Unmanned Mine Dataset
CVPR 2022
0
citations
Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection
CVPR 2023
0
citations
PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration
CVPR 2023
0
citations
Leveraging per Image-Token Consistency for Vision-Language Pre-Training
CVPR 2023arXiv
0
citations
Range-Nullspace Video Frame Interpolation With Focalized Motion Estimation
CVPR 2023
0
citations
Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector
ICCV 2017
0
citations
Multi-Class Part Parsing With Joint Boundary-Semantic Awareness
ICCV 2019
0
citations
Training Weakly Supervised Video Frame Interpolation With Events
ICCV 2021
0
citations
Personalized Image Semantic Segmentation
ICCV 2021arXiv
0
citations
E2NeRF: Event Enhanced Neural Radiance Fields from Blurry Images
ICCV 2023
0
citations
Learning Trajectory-Word Alignments for Video-Language Tasks
ICCV 2023arXiv
0
citations
Adaptive Positional Encoding for Bundle-Adjusting Neural Radiance Fields
ICCV 2023
0
citations
Multi-view Self-supervised Disentanglement for General Image Denoising
ICCV 2023arXiv
0
citations
Deep Image Clustering with Category-Style Representation
ECCV 2020
0
citations
Learning to See in the Dark with Events
ECCV 2020
0
citations
PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry
ECCV 2022
0
citations
Deep Bayesian Video Frame Interpolation
ECCV 2022
0
citations
Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
ECCV 2022
0
citations
An Efficient Person Clustering Algorithm for Open Checkout-Free Groceries
ECCV 2022
0
citations
Selectivity or Invariance: Boundary-Aware Salient Object Detection
ICCV 2019
0
citations
EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling
CVPR 2025
0
citations
PLAN: Proactive Low-Rank Allocation for Continual Learning
ICCV 2025
0
citations
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
AAAI 2025
0
citations
Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation
AAAI 2025
0
citations
Multi-Label Ranking Loss Minimization for Matrix Completion
AAAI 2025
0
citations
SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving
AAAI 2024
0
citations
Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
AAAI 2024arXiv
0
citations
Memory-Efficient Reversible Spiking Neural Networks
AAAI 2024
0
citations
Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors
CVPR 2024
0
citations
NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation
CVPR 2024
0
citations
CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding
ICML 2024
0
citations
Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding Perspective
ICML 2024
0
citations
MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
ICML 2024
0
citations
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
NeurIPS 2018
0
citations
Learning to Multitask
NeurIPS 2018
0
citations
Multi-Objective Meta Learning
NeurIPS 2021
0
citations
Effective Meta-Regularization by Kernelized Proximal Regularization
NeurIPS 2021
0
citations
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
NeurIPS 2022
0
citations
Dynamic Sparse Network for Time Series Classification: Learning What to “See”
NeurIPS 2022
0
citations
Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator
NeurIPS 2023
0
citations
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
NeurIPS 2023
0
citations
CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection
NeurIPS 2023
0
citations
Interpreting Unsupervised Anomaly Detection in Security via Rule Extraction
NeurIPS 2023
0
citations
MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers
NeurIPS 2023
0
citations
Transfer Learning via Learning to Transfer
ICML 2018
0
citations
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
ICML 2018
0
citations