Long Mai

19
Papers
1
Total Citations

Papers (19)

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

ICCV 2025
1
citations

Progressive Growing of Video Tokenizers for Temporally Compact Latent Spaces

ICCV 2025
0
citations

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

ICCV 2025
0
citations

Kernel Fusion for Better Image Deblurring

CVPR 2015
0
citations

Composition-Preserving Deep Photo Aesthetics Assessment

CVPR 2016
0
citations

Video Frame Interpolation via Adaptive Convolution

CVPR 2017arXiv
0
citations

Spatial-Semantic Image Search by Visual Feature Synthesis

CVPR 2017
0
citations

Strike (With) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects

CVPR 2019
0
citations

Structure-Guided Ranking Loss for Single Image Depth Prediction

CVPR 2020
0
citations

Context-Aware Group Captioning via Self-Attention and Contrastive Features

CVPR 2020arXiv
0
citations

Active Speakers in Context

CVPR 2020arXiv
0
citations

Learning To Recover 3D Scene Shape From a Single Image

CVPR 2021arXiv
0
citations

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging

CVPR 2021arXiv
0
citations

Motion-Adjustable Neural Implicit Video Representation

CVPR 2022
0
citations

Video Frame Interpolation via Adaptive Separable Convolution

ICCV 2017arXiv
0
citations

MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input

ICCV 2019
0
citations

An Internal Learning Approach to Video Inpainting

ICCV 2019
0
citations

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

ICCV 2025
0
citations

BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

NeurIPS 2020
0
citations