Jose M. Alvarez
38
Papers
170
Total Citations
Papers (38)
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
CVPR 2024
169
citations
MDP: Multidimensional Vision Model Pruning with Latency Constraint
CVPR 2025
1
citations
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
CVPR 2025
0
citations
PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models
CVPR 2025
0
citations
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training
ICCV 2025
0
citations
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection
CVPR 2024
0
citations
Improving Distant 3D Object Detection Using 2D Box Supervision
CVPR 2024
0
citations
Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation
CVPR 2024
0
citations
Emotion Recognition in Context
CVPR 2017
0
citations
Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
CVPR 2020arXiv
0
citations
Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion
CVPR 2020arXiv
0
citations
Optimal Quantization Using Scaled Codebook
CVPR 2021
0
citations
Self-Supervised Learning of Depth Inference for Multi-View Stereo
CVPR 2021arXiv
0
citations
See Through Gradients: Image Batch Recovery via GradInversion
CVPR 2021arXiv
0
citations
FreeSOLO: Learning To Segment Objects Without Annotations
CVPR 2022arXiv
0
citations
Non-Parametric Depth Distribution Modelling Based Depth Inference for Multi-View Stereo
CVPR 2022arXiv
0
citations
Not All Labels Are Equal: Rationalizing the Labeling Costs for Training Object Detection
CVPR 2022arXiv
0
citations
Panoptic SegFormer: Delving Deeper Into Panoptic Segmentation With Transformers
CVPR 2022arXiv
0
citations
A-ViT: Adaptive Tokens for Efficient Vision Transformer
CVPR 2022
0
citations
How Much More Data Do I Need? Estimating Requirements for Downstream Tasks
CVPR 2022
0
citations
Vision Transformers Are Good Mask Auto-Labelers
CVPR 2023arXiv
0
citations
Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions
CVPR 2023arXiv
0
citations
VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion
CVPR 2023arXiv
0
citations
Bringing Background Into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation
ICCV 2017arXiv
0
citations
Domain-Adaptive Deep Network Compression
ICCV 2017arXiv
0
citations
Active Learning for Deep Object Detection via Probabilistic Modeling
ICCV 2021arXiv
0
citations
Fully Attentional Networks with Self-emerging Token Labeling
ICCV 2023
0
citations
FB-BEV: BEV Representation from Forward-Backward View Transformations
ICCV 2023
0
citations
Towards Viewpoint Robustness in Bird's Eye View Segmentation
ICCV 2023
0
citations
FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
ICCV 2023arXiv
0
citations
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's-Eye View
ICCV 2023
0
citations
When To Prune? A Policy Towards Early Structural Pruning
CVPR 2022arXiv
0
citations
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
CVPR 2025
0
citations
ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks
NeurIPS 2020
0
citations
Distilling Image Classifiers in Object Detectors
NeurIPS 2021
0
citations
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
NeurIPS 2021
0
citations
Structural Pruning via Latency-Saliency Knapsack
NeurIPS 2022
0
citations
Optimizing Data Collection for Machine Learning
NeurIPS 2022
0
citations