Jose M. Alvarez

38
Papers
170
Total Citations

Papers (38)

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

CVPR 2024
169
citations

MDP: Multidimensional Vision Model Pruning with Latency Constraint

CVPR 2025
1
citations

Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video

CVPR 2025
0
citations

PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models

CVPR 2025
0
citations

Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training

ICCV 2025
0
citations

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

CVPR 2024
0
citations

Improving Distant 3D Object Detection Using 2D Box Supervision

CVPR 2024
0
citations

Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

CVPR 2024
0
citations

Emotion Recognition in Context

CVPR 2017
0
citations

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

CVPR 2020arXiv
0
citations

Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion

CVPR 2020arXiv
0
citations

Optimal Quantization Using Scaled Codebook

CVPR 2021
0
citations

Self-Supervised Learning of Depth Inference for Multi-View Stereo

CVPR 2021arXiv
0
citations

See Through Gradients: Image Batch Recovery via GradInversion

CVPR 2021arXiv
0
citations

FreeSOLO: Learning To Segment Objects Without Annotations

CVPR 2022arXiv
0
citations

Non-Parametric Depth Distribution Modelling Based Depth Inference for Multi-View Stereo

CVPR 2022arXiv
0
citations

Not All Labels Are Equal: Rationalizing the Labeling Costs for Training Object Detection

CVPR 2022arXiv
0
citations

Panoptic SegFormer: Delving Deeper Into Panoptic Segmentation With Transformers

CVPR 2022arXiv
0
citations

A-ViT: Adaptive Tokens for Efficient Vision Transformer

CVPR 2022
0
citations

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

CVPR 2022
0
citations

Vision Transformers Are Good Mask Auto-Labelers

CVPR 2023arXiv
0
citations

Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions

CVPR 2023arXiv
0
citations

VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion

CVPR 2023arXiv
0
citations

Bringing Background Into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation

ICCV 2017arXiv
0
citations

Domain-Adaptive Deep Network Compression

ICCV 2017arXiv
0
citations

Active Learning for Deep Object Detection via Probabilistic Modeling

ICCV 2021arXiv
0
citations

Fully Attentional Networks with Self-emerging Token Labeling

ICCV 2023
0
citations

FB-BEV: BEV Representation from Forward-Backward View Transformations

ICCV 2023
0
citations

Towards Viewpoint Robustness in Bird's Eye View Segmentation

ICCV 2023
0
citations

FocalFormer3D: Focusing on Hard Instance for 3D Object Detection

ICCV 2023arXiv
0
citations

Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's-Eye View

ICCV 2023
0
citations

When To Prune? A Policy Towards Early Structural Pruning

CVPR 2022arXiv
0
citations

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

CVPR 2025
0
citations

ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks

NeurIPS 2020
0
citations

Distilling Image Classifiers in Object Detectors

NeurIPS 2021
0
citations

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

NeurIPS 2021
0
citations

Structural Pruning via Latency-Saliency Knapsack

NeurIPS 2022
0
citations

Optimizing Data Collection for Machine Learning

NeurIPS 2022
0
citations