Yujiu Yang

39
Papers
319
Total Citations

Papers (39)

Improving Video Generation with Human Feedback

NeurIPS 2025
106
citations

CoSeR: Bridging Image and Language for Cognitive Super-Resolution

CVPR 2024
71
citations

IDOL: Instant Photorealistic 3D Human Creation from a Single Image

CVPR 2025
35
citations

Spurious Feature Diversification Improves Out-of-distribution Generalization

ICLR 2024
33
citations

Universal Segmentation at Arbitrary Granularity with Language Instruction

CVPR 2024
30
citations

Scalable Image Tokenization with Index Backpropagation Quantization

ICCV 2025
16
citations

InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models

ICCV 2025
12
citations

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

ICLR 2025
7
citations

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

ICCV 2025
5
citations

HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver

CVPR 2025
4
citations

Accelerating Neural Network Optimization Through an Automated Control Theory Lens

CVPR 2022
0
citations

Seeing What You Miss: Vision-Language Pre-Training With Semantic Completion Learning

CVPR 2023arXiv
0
citations

3D GAN Inversion With Facial Symmetry Prior

CVPR 2023arXiv
0
citations

GLeaD: Improving GANs With a Generator-Leading Task

CVPR 2023arXiv
0
citations

RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer

CVPR 2023
0
citations

MAP: Multimodal Uncertainty-Aware Vision-Language Pre-Training Model

CVPR 2023arXiv
0
citations

Global Knowledge Calibration for Fast Open-Vocabulary Segmentation

ICCV 2023arXiv
0
citations

Masked Autoencoders Are Stronger Knowledge Distillers

ICCV 2023
0
citations

ToonTalker: Cross-Domain Face Reenactment

ICCV 2023arXiv
0
citations

UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors

ICCV 2023
0
citations

Sparse Adversarial Attack via Perturbation Factorization

ECCV 2020
0
citations

High-Fidelity GAN Inversion with Padding Space

ECCV 2022
0
citations

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN

ECCV 2022
0
citations

Learning Quality-Aware Dynamic Memory for Video Object Segmentation

ECCV 2022
0
citations

Global Spectral Filter Memory Network for Video Object Segmentation

ECCV 2022
0
citations

Learning Adaptive Warping for Real-World Rolling Shutter Correction

CVPR 2022arXiv
0
citations

DnLUT: Ultra-Efficient Color Image Denoising via Channel-Aware Lookup Tables

CVPR 2025
0
citations

ProReflow: Progressive Reflow with Decomposed Velocity

CVPR 2025
0
citations

Advancing Visual Large Language Model for Multi-granular Versatile Perception

ICCV 2025
0
citations

Rolling Shutter Correction with Intermediate Distortion Flow Estimation

CVPR 2024
0
citations

Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

CVPR 2024
0
citations

Incremental Residual Concept Bottleneck Models

CVPR 2024
0
citations

Compressing Convolutional Neural Networks via Factorized Convolutional Filters

CVPR 2019
0
citations

TediGAN: Text-Guided Diverse Face Image Generation and Manipulation

CVPR 2021arXiv
0
citations

Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation

CVPR 2021arXiv
0
citations

Adder Attention for Vision Transformer

NeurIPS 2021
0
citations

Rethinking Alignment in Video Super-Resolution Transformers

NeurIPS 2022
0
citations

SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation

NeurIPS 2023
0
citations

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

NeurIPS 2023
0
citations