Jianfei Cai

49
Papers
277
Total Citations

Papers (49)

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

ECCV 2024arXiv
179
citations

DrVideo: Document Retrieval Based Long Video Understanding

CVPR 2025arXiv
39
citations

How Far Can We Compress Instant-NGP-Based NeRF?

CVPR 2024arXiv
32
citations

Diversified and Personalized Multi-rater Medical Image Segmentation

CVPR 2024arXiv
16
citations

Efficient Stitchable Task Adaptation

CVPR 2024arXiv
7
citations

McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction

ECCV 2024arXiv
3
citations

Differentiable Convex Polyhedra Optimization from Multi-view Images

ECCV 2024arXiv
1
citations

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

CVPR 2025arXiv
0
citations

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

CVPR 2025
0
citations

VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior

ICCV 2025arXiv
0
citations

Stitched ViTs are Flexible Vision Backbones

ECCV 2024arXiv
0
citations

Generative Region-Language Pretraining for Open-Ended Object Detection

CVPR 2024
0
citations

Taming Stable Diffusion for Text to 360 Panorama Image Generation

CVPR 2024
0
citations

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

CVPR 2024arXiv
0
citations

Sharpness-Aware Data Generation for Zero-shot Quantization

ICML 2024
0
citations

Exploring Bottom-Up and Top-Down Cues With Attentive Learning for Webly Supervised Object Detection

CVPR 2020arXiv
0
citations

End-to-End 3D Point Cloud Instance Segmentation Without Detection

CVPR 2020
0
citations

The Spatially-Correlative Loss for Various Image Translation Tasks

CVPR 2021arXiv
0
citations

RSG: A Simple but Effective Module for Learning Imbalanced Datasets

CVPR 2021arXiv
0
citations

Causal Attention for Vision-Language Tasks

CVPR 2021arXiv
0
citations

GMFlow: Learning Optical Flow via Global Matching

CVPR 2022arXiv
0
citations

Bridging Global Context Interactions for High-Fidelity Image Completion

CVPR 2022arXiv
0
citations

ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues

CVPR 2022arXiv
0
citations

Dynamic Focus-Aware Positional Queries for Semantic Segmentation

CVPR 2023arXiv
0
citations

MARLIN: Masked Autoencoder for Facial Video Representation LearnINg

CVPR 2023arXiv
0
citations

Transformer Scale Gate for Semantic Segmentation

CVPR 2023arXiv
0
citations

Stitchable Neural Networks

CVPR 2023arXiv
0
citations

JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking

CVPR 2023
0
citations

CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing

ICCV 2021
0
citations

Domain-Invariant Disentangled Network for Generalizable Object Detection

ICCV 2021
0
citations

High-Resolution Optical Flow From 1D Attention and Correlation

ICCV 2021arXiv
0
citations

Learning Meta-Class Memory for Few-Shot Semantic Segmentation

ICCV 2021arXiv
0
citations

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder

ICCV 2021
0
citations

Scalable Vision Transformers With Hierarchical Pooling

ICCV 2021arXiv
0
citations

Auto-Parsing Network for Image Captioning and Visual Question Answering

ICCV 2021arXiv
0
citations

ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces

ICCV 2023
0
citations

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

ICCV 2023arXiv
0
citations

Learning Progressive Joint Propagation for Human Motion Prediction

ECCV 2020
0
citations

Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning

ECCV 2020
0
citations

Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation

ECCV 2020
0
citations

ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing

ECCV 2022
0
citations

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

ECCV 2022
0
citations

Object-Compositional Neural Implicit Surfaces

ECCV 2022
0
citations

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation

ECCV 2022
0
citations

Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation

ECCV 2022
0
citations

Self-Supervised Relationship Probing

NeurIPS 2020
0
citations

EcoFormer: Energy-Saving Attention with Linear Complexity

NeurIPS 2022arXiv
0
citations

Fast Vision Transformers with HiLo Attention

NeurIPS 2022arXiv
0
citations

MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation

NeurIPS 2022arXiv
0
citations