Jianfei Cai
49
Papers
277
Total Citations
Papers (49)
HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
ECCV 2024arXiv
179
citations
DrVideo: Document Retrieval Based Long Video Understanding
CVPR 2025arXiv
39
citations
How Far Can We Compress Instant-NGP-Based NeRF?
CVPR 2024arXiv
32
citations
Diversified and Personalized Multi-rater Medical Image Segmentation
CVPR 2024arXiv
16
citations
Efficient Stitchable Task Adaptation
CVPR 2024arXiv
7
citations
McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction
ECCV 2024arXiv
3
citations
Differentiable Convex Polyhedra Optimization from Multi-view Images
ECCV 2024arXiv
1
citations
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
CVPR 2025arXiv
0
citations
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
CVPR 2025
0
citations
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
ICCV 2025arXiv
0
citations
Stitched ViTs are Flexible Vision Backbones
ECCV 2024arXiv
0
citations
Generative Region-Language Pretraining for Open-Ended Object Detection
CVPR 2024
0
citations
Taming Stable Diffusion for Text to 360 Panorama Image Generation
CVPR 2024
0
citations
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
CVPR 2024arXiv
0
citations
Sharpness-Aware Data Generation for Zero-shot Quantization
ICML 2024
0
citations
Exploring Bottom-Up and Top-Down Cues With Attentive Learning for Webly Supervised Object Detection
CVPR 2020arXiv
0
citations
End-to-End 3D Point Cloud Instance Segmentation Without Detection
CVPR 2020
0
citations
The Spatially-Correlative Loss for Various Image Translation Tasks
CVPR 2021arXiv
0
citations
RSG: A Simple but Effective Module for Learning Imbalanced Datasets
CVPR 2021arXiv
0
citations
Causal Attention for Vision-Language Tasks
CVPR 2021arXiv
0
citations
GMFlow: Learning Optical Flow via Global Matching
CVPR 2022arXiv
0
citations
Bridging Global Context Interactions for High-Fidelity Image Completion
CVPR 2022arXiv
0
citations
ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues
CVPR 2022arXiv
0
citations
Dynamic Focus-Aware Positional Queries for Semantic Segmentation
CVPR 2023arXiv
0
citations
MARLIN: Masked Autoencoder for Facial Video Representation LearnINg
CVPR 2023arXiv
0
citations
Transformer Scale Gate for Semantic Segmentation
CVPR 2023arXiv
0
citations
Stitchable Neural Networks
CVPR 2023arXiv
0
citations
JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking
CVPR 2023
0
citations
CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing
ICCV 2021
0
citations
Domain-Invariant Disentangled Network for Generalizable Object Detection
ICCV 2021
0
citations
High-Resolution Optical Flow From 1D Attention and Correlation
ICCV 2021arXiv
0
citations
Learning Meta-Class Memory for Few-Shot Semantic Segmentation
ICCV 2021arXiv
0
citations
A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder
ICCV 2021
0
citations
Scalable Vision Transformers With Hierarchical Pooling
ICCV 2021arXiv
0
citations
Auto-Parsing Network for Image Captioning and Visual Question Answering
ICCV 2021arXiv
0
citations
ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces
ICCV 2023
0
citations
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
ICCV 2023arXiv
0
citations
Learning Progressive Joint Propagation for Human Motion Prediction
ECCV 2020
0
citations
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning
ECCV 2020
0
citations
Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation
ECCV 2020
0
citations
ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing
ECCV 2022
0
citations
Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields
ECCV 2022
0
citations
Object-Compositional Neural Implicit Surfaces
ECCV 2022
0
citations
Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation
ECCV 2022
0
citations
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation
ECCV 2022
0
citations
Self-Supervised Relationship Probing
NeurIPS 2020
0
citations
EcoFormer: Energy-Saving Attention with Linear Complexity
NeurIPS 2022arXiv
0
citations
Fast Vision Transformers with HiLo Attention
NeurIPS 2022arXiv
0
citations
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
NeurIPS 2022arXiv
0
citations