Yujiu Yang
39
Papers
320
Total Citations
Papers (39)
Improving Video Generation with Human Feedback
NeurIPS 2025
106
citations
CoSeR: Bridging Image and Language for Cognitive Super-Resolution
CVPR 2024
71
citations
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
CVPR 2025arXiv
36
citations
Spurious Feature Diversification Improves Out-of-distribution Generalization
ICLR 2024
33
citations
Universal Segmentation at Arbitrary Granularity with Language Instruction
CVPR 2024
30
citations
Scalable Image Tokenization with Index Backpropagation Quantization
ICCV 2025
16
citations
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
ICCV 2025
12
citations
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
ICLR 2025
7
citations
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
ICCV 2025
5
citations
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver
CVPR 2025
4
citations
Accelerating Neural Network Optimization Through an Automated Control Theory Lens
CVPR 2022
0
citations
Seeing What You Miss: Vision-Language Pre-Training With Semantic Completion Learning
CVPR 2023arXiv
0
citations
3D GAN Inversion With Facial Symmetry Prior
CVPR 2023arXiv
0
citations
GLeaD: Improving GANs With a Generator-Leading Task
CVPR 2023arXiv
0
citations
RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer
CVPR 2023
0
citations
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-Training Model
CVPR 2023arXiv
0
citations
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
ICCV 2023arXiv
0
citations
Masked Autoencoders Are Stronger Knowledge Distillers
ICCV 2023
0
citations
ToonTalker: Cross-Domain Face Reenactment
ICCV 2023arXiv
0
citations
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors
ICCV 2023
0
citations
Sparse Adversarial Attack via Perturbation Factorization
ECCV 2020
0
citations
High-Fidelity GAN Inversion with Padding Space
ECCV 2022
0
citations
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
ECCV 2022
0
citations
Learning Quality-Aware Dynamic Memory for Video Object Segmentation
ECCV 2022
0
citations
Global Spectral Filter Memory Network for Video Object Segmentation
ECCV 2022
0
citations
Learning Adaptive Warping for Real-World Rolling Shutter Correction
CVPR 2022arXiv
0
citations
DnLUT: Ultra-Efficient Color Image Denoising via Channel-Aware Lookup Tables
CVPR 2025
0
citations
ProReflow: Progressive Reflow with Decomposed Velocity
CVPR 2025
0
citations
Advancing Visual Large Language Model for Multi-granular Versatile Perception
ICCV 2025
0
citations
Rolling Shutter Correction with Intermediate Distortion Flow Estimation
CVPR 2024
0
citations
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
CVPR 2024
0
citations
Incremental Residual Concept Bottleneck Models
CVPR 2024
0
citations
Compressing Convolutional Neural Networks via Factorized Convolutional Filters
CVPR 2019
0
citations
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
CVPR 2021arXiv
0
citations
Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
CVPR 2021arXiv
0
citations
Adder Attention for Vision Transformer
NeurIPS 2021
0
citations
Rethinking Alignment in Video Super-Resolution Transformers
NeurIPS 2022
0
citations
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
NeurIPS 2023
0
citations
Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment
NeurIPS 2023
0
citations