Jianfei Cai

67
Papers
532
Total Citations

Papers (67)

Learning Progressive Joint Propagation for Human Motion Prediction

ECCV 2020
187
citations

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

ECCV 2024
179
citations

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

ECCV 2020
64
citations

DrVideo: Document Retrieval Based Long Video Understanding

CVPR 2025
39
citations

How Far Can We Compress Instant-NGP-Based NeRF?

CVPR 2024
32
citations

Diversified and Personalized Multi-rater Medical Image Segmentation

CVPR 2024
16
citations

Efficient Stitchable Task Adaptation

CVPR 2024
7
citations

Stitched ViTs are Flexible Vision Backbones

ECCV 2024arXiv
4
citations

McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction

ECCV 2024
3
citations

Differentiable Convex Polyhedra Optimization from Multi-view Images

ECCV 2024
1
citations

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

CVPR 2017
0
citations

Object Co-Skeletonization With Co-Segmentation

CVPR 2017
0
citations

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval With Generative Models

CVPR 2018arXiv
0
citations

Alive Caricature From 2D to 3D

CVPR 2018arXiv
0
citations

Pluralistic Image Completion

CVPR 2019
0
citations

Scene Graph Generation With External Knowledge and Image Reconstruction

CVPR 2019
0
citations

Auto-Encoding Scene Graphs for Image Captioning

CVPR 2019
0
citations

3D Hand Shape and Pose Estimation From a Single RGB Image

CVPR 2019
0
citations

Exploring Bottom-Up and Top-Down Cues With Attentive Learning for Webly Supervised Object Detection

CVPR 2020arXiv
0
citations

End-to-End 3D Point Cloud Instance Segmentation Without Detection

CVPR 2020
0
citations

The Spatially-Correlative Loss for Various Image Translation Tasks

CVPR 2021arXiv
0
citations

RSG: A Simple but Effective Module for Learning Imbalanced Datasets

CVPR 2021arXiv
0
citations

Causal Attention for Vision-Language Tasks

CVPR 2021arXiv
0
citations

GMFlow: Learning Optical Flow via Global Matching

CVPR 2022arXiv
0
citations

Bridging Global Context Interactions for High-Fidelity Image Completion

CVPR 2022arXiv
0
citations

ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues

CVPR 2022arXiv
0
citations

Dynamic Focus-Aware Positional Queries for Semantic Segmentation

CVPR 2023arXiv
0
citations

MARLIN: Masked Autoencoder for Facial Video Representation LearnINg

CVPR 2023arXiv
0
citations

Transformer Scale Gate for Semantic Segmentation

CVPR 2023arXiv
0
citations

Stitchable Neural Networks

CVPR 2023arXiv
0
citations

JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking

CVPR 2023
0
citations

MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition

ICCV 2015
0
citations

An Empirical Study of Language CNN for Image Captioning

ICCV 2017arXiv
0
citations

Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks

ICCV 2019
0
citations

Learning to Collocate Neural Modules for Image Captioning

ICCV 2019
0
citations

Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds

ICCV 2019
0
citations

Unpaired Image Captioning via Scene Graph Alignments

ICCV 2019
0
citations

CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing

ICCV 2021
0
citations

Domain-Invariant Disentangled Network for Generalizable Object Detection

ICCV 2021
0
citations

High-Resolution Optical Flow From 1D Attention and Correlation

ICCV 2021arXiv
0
citations

Learning Meta-Class Memory for Few-Shot Semantic Segmentation

ICCV 2021arXiv
0
citations

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder

ICCV 2021
0
citations

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

CVPR 2025
0
citations

Auto-Parsing Network for Image Captioning and Visual Question Answering

ICCV 2021arXiv
0
citations

ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces

ICCV 2023
0
citations

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

ICCV 2023arXiv
0
citations

Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation

ECCV 2020
0
citations

ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing

ECCV 2022
0
citations

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

ECCV 2022
0
citations

Object-Compositional Neural Implicit Surfaces

ECCV 2022
0
citations

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation

ECCV 2022
0
citations

Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation

ECCV 2022
0
citations

Scalable Vision Transformers With Hierarchical Pooling

ICCV 2021arXiv
0
citations

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

CVPR 2025
0
citations

VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior

ICCV 2025
0
citations

Generative Region-Language Pretraining for Open-Ended Object Detection

CVPR 2024
0
citations

Taming Stable Diffusion for Text to 360 Panorama Image Generation

CVPR 2024
0
citations

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

CVPR 2024
0
citations

Sharpness-Aware Data Generation for Zero-shot Quantization

ICML 2024
0
citations

Exploit Bounding Box Annotations for Multi-Label Object Recognition

CVPR 2016
0
citations

Modality and Component Aware Feature Fusion For RGB-D Scene Classification

CVPR 2016
0
citations

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks With Privileged Information

CVPR 2017
0
citations

Self-Supervised Relationship Probing

NeurIPS 2020
0
citations

EcoFormer: Energy-Saving Attention with Linear Complexity

NeurIPS 2022
0
citations

Fast Vision Transformers with HiLo Attention

NeurIPS 2022
0
citations

MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation

NeurIPS 2022
0
citations

Generalized Robust Bayesian Committee Machine for Large-scale Gaussian Process Regression

ICML 2018
0
citations