Sifei Liu
47
Papers
374
Total Citations
Papers (47)
Learning Affinity via Spatial Propagation Networks
NeurIPS 2017arXiv
300
citations
Describe Anything: Detailed Localized Image and Video Captioning
ICCV 2025
49
citations
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
CVPR 2025
11
citations
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
CVPR 2025arXiv
9
citations
Parallel Sequence Modeling via Generalized Spatial Propagation Network
CVPR 2025arXiv
3
citations
3D-SPATIAL MULTIMODAL MEMORY
ICLR 2025
2
citations
A Unified Approach for Text- and Image-guided 4D Scene Generation
CVPR 2024
0
citations
HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data
CVPR 2024
0
citations
Communication-Efficient Collaborative Perception via Information Filling with Codebook
CVPR 2024
0
citations
RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos
CVPR 2024
0
citations
Compositional Text-to-Image Generation with Dense Blob Representations
ICML 2024
0
citations
Multi-Objective Convolutional Learning for Face Labeling
CVPR 2015
0
citations
Generative Face Completion
CVPR 2017arXiv
0
citations
Learning Dual Convolutional Neural Networks for Low-Level Vision
CVPR 2018arXiv
0
citations
SCOPS: Self-Supervised Co-Part Segmentation
CVPR 2019
0
citations
Learning Linear Transformations for Fast Image and Video Style Transfer
CVPR 2019
0
citations
Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments
CVPR 2019
0
citations
Self-Supervised Viewpoint Learning From Image Collections
CVPR 2020arXiv
0
citations
Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time
CVPR 2021arXiv
0
citations
Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes
CVPR 2021arXiv
0
citations
Learning to Track Instances without Video Annotations
CVPR 2021arXiv
0
citations
Learning Continuous Image Representation With Local Implicit Image Function
CVPR 2021arXiv
0
citations
CoordGAN: Self-Supervised Dense Correspondences Emerge From GANs
CVPR 2022arXiv
0
citations
GroupViT: Semantic Segmentation Emerges From Text Supervision
CVPR 2022arXiv
0
citations
Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters
CVPR 2023arXiv
0
citations
Self-Supervised Super-Plane for Neural 3D Reconstruction
CVPR 2023
0
citations
Affordance Diffusion: Synthesizing Hand-Object Interactions
CVPR 2023arXiv
0
citations
Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
ICCV 2017arXiv
0
citations
Learning Propagation for Arbitrarily-Structured Data
ICCV 2019
0
citations
Video Autoencoder: Self-Supervised Disentanglement of Static 3D Structure and Motion
ICCV 2021arXiv
0
citations
Self-Supervised Object Detection via Generative Image Synthesis
ICCV 2021arXiv
0
citations
Video Matting via Consistency-Regularized Graph Neural Networks
ICCV 2021
0
citations
Self-supervised Single-view 3D Reconstruction via Semantic Consistency
ECCV 2020
0
citations
Autoregressive 3D Shape Generation via Canonical Mapping
ECCV 2022
0
citations
Scraping Textures from Natural Images for Synthesis and Editing
ECCV 2022
0
citations
Open-Vocabulary Panoptic Segmentation With Text-to-Image Diffusion Models
CVPR 2023arXiv
0
citations
Scaling Vision Pre-Training to 4K Resolution
CVPR 2025
0
citations
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
0
citations
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal
ICCV 2025
0
citations
COLMAP-Free 3D Gaussian Splatting
CVPR 2024
0
citations
RegionGPT: Towards Region Understanding Vision Language Model
CVPR 2024
0
citations
Context-aware Synthesis and Placement of Object Instances
NeurIPS 2018
0
citations
Joint-task Self-supervised Learning for Temporal Correspondence
NeurIPS 2019
0
citations
Online Adaptation for Consistent Mesh Reconstruction in the Wild
NeurIPS 2020
0
citations
Coupled Segmentation and Edge Learning via Dynamic Graph Propagation
NeurIPS 2021
0
citations
Learning 3D Dense Correspondence via Canonical Point Autoencoder
NeurIPS 2021
0
citations
Generalizable One-shot 3D Neural Head Avatar
NeurIPS 2023
0
citations