Long Mai
19
Papers
1
Total Citations
Papers (19)
TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models
ICCV 2025
1
citations
Progressive Growing of Video Tokenizers for Temporally Compact Latent Spaces
ICCV 2025
0
citations
REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder
ICCV 2025
0
citations
Kernel Fusion for Better Image Deblurring
CVPR 2015
0
citations
Composition-Preserving Deep Photo Aesthetics Assessment
CVPR 2016
0
citations
Video Frame Interpolation via Adaptive Convolution
CVPR 2017arXiv
0
citations
Spatial-Semantic Image Search by Visual Feature Synthesis
CVPR 2017
0
citations
Strike (With) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects
CVPR 2019
0
citations
Structure-Guided Ranking Loss for Single Image Depth Prediction
CVPR 2020
0
citations
Context-Aware Group Captioning via Self-Attention and Contrastive Features
CVPR 2020arXiv
0
citations
Active Speakers in Context
CVPR 2020arXiv
0
citations
Learning To Recover 3D Scene Shape From a Single Image
CVPR 2021arXiv
0
citations
Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging
CVPR 2021arXiv
0
citations
Motion-Adjustable Neural Implicit Video Representation
CVPR 2022
0
citations
Video Frame Interpolation via Adaptive Separable Convolution
ICCV 2017arXiv
0
citations
MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input
ICCV 2019
0
citations
An Internal Learning Approach to Video Inpainting
ICCV 2019
0
citations
GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting
ICCV 2025
0
citations
BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images
NeurIPS 2020
0
citations