Sifei Liu

47
Papers
374
Total Citations

Papers (47)

Learning Affinity via Spatial Propagation Networks

NeurIPS 2017arXiv
300
citations

Describe Anything: Detailed Localized Image and Video Captioning

ICCV 2025
49
citations

BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations

CVPR 2025
11
citations

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

CVPR 2025arXiv
9
citations

Parallel Sequence Modeling via Generalized Spatial Propagation Network

CVPR 2025arXiv
3
citations

3D-SPATIAL MULTIMODAL MEMORY

ICLR 2025
2
citations

A Unified Approach for Text- and Image-guided 4D Scene Generation

CVPR 2024
0
citations

HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data

CVPR 2024
0
citations

Communication-Efficient Collaborative Perception via Information Filling with Codebook

CVPR 2024
0
citations

RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos

CVPR 2024
0
citations

Compositional Text-to-Image Generation with Dense Blob Representations

ICML 2024
0
citations

Multi-Objective Convolutional Learning for Face Labeling

CVPR 2015
0
citations

Generative Face Completion

CVPR 2017arXiv
0
citations

Learning Dual Convolutional Neural Networks for Low-Level Vision

CVPR 2018arXiv
0
citations

SCOPS: Self-Supervised Co-Part Segmentation

CVPR 2019
0
citations

Learning Linear Transformations for Fast Image and Video Style Transfer

CVPR 2019
0
citations

Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments

CVPR 2019
0
citations

Self-Supervised Viewpoint Learning From Image Collections

CVPR 2020arXiv
0
citations

Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time

CVPR 2021arXiv
0
citations

Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes

CVPR 2021arXiv
0
citations

Learning to Track Instances without Video Annotations

CVPR 2021arXiv
0
citations

Learning Continuous Image Representation With Local Implicit Image Function

CVPR 2021arXiv
0
citations

CoordGAN: Self-Supervised Dense Correspondences Emerge From GANs

CVPR 2022arXiv
0
citations

GroupViT: Semantic Segmentation Emerges From Text Supervision

CVPR 2022arXiv
0
citations

Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters

CVPR 2023arXiv
0
citations

Self-Supervised Super-Plane for Neural 3D Reconstruction

CVPR 2023
0
citations

Affordance Diffusion: Synthesizing Hand-Object Interactions

CVPR 2023arXiv
0
citations

Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

ICCV 2017arXiv
0
citations

Learning Propagation for Arbitrarily-Structured Data

ICCV 2019
0
citations

Video Autoencoder: Self-Supervised Disentanglement of Static 3D Structure and Motion

ICCV 2021arXiv
0
citations

Self-Supervised Object Detection via Generative Image Synthesis

ICCV 2021arXiv
0
citations

Video Matting via Consistency-Regularized Graph Neural Networks

ICCV 2021
0
citations

Self-supervised Single-view 3D Reconstruction via Semantic Consistency

ECCV 2020
0
citations

Autoregressive 3D Shape Generation via Canonical Mapping

ECCV 2022
0
citations

Scraping Textures from Natural Images for Synthesis and Editing

ECCV 2022
0
citations

Open-Vocabulary Panoptic Segmentation With Text-to-Image Diffusion Models

CVPR 2023arXiv
0
citations

Scaling Vision Pre-Training to 4K Resolution

CVPR 2025
0
citations

NVILA: Efficient Frontier Visual Language Models

CVPR 2025
0
citations

Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal

ICCV 2025
0
citations

COLMAP-Free 3D Gaussian Splatting

CVPR 2024
0
citations

RegionGPT: Towards Region Understanding Vision Language Model

CVPR 2024
0
citations

Context-aware Synthesis and Placement of Object Instances

NeurIPS 2018
0
citations

Joint-task Self-supervised Learning for Temporal Correspondence

NeurIPS 2019
0
citations

Online Adaptation for Consistent Mesh Reconstruction in the Wild

NeurIPS 2020
0
citations

Coupled Segmentation and Edge Learning via Dynamic Graph Propagation

NeurIPS 2021
0
citations

Learning 3D Dense Correspondence via Canonical Point Autoencoder

NeurIPS 2021
0
citations

Generalizable One-shot 3D Neural Head Avatar

NeurIPS 2023
0
citations