Silvio Savarese
50
Papers
637
Total Citations
Papers (50)
Learning Transferrable Representations for Unsupervised Domain Adaptation
NeurIPS 2016
281
citations
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
192
citations
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024
164
citations
Data-Driven 3D Voxel Patterns for Object Category Recognition
CVPR 2015
0
citations
Enriching Object Detection With 2D-3D Registration and Continuous Viewpoint Estimation
CVPR 2015
0
citations
Watch-n-Patch: Unsupervised Understanding of Actions and Relations
CVPR 2015
0
citations
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes
CVPR 2016
0
citations
Social LSTM: Human Trajectory Prediction in Crowded Spaces
CVPR 2016
0
citations
3D Semantic Parsing of Large-Scale Indoor Spaces
CVPR 2016
0
citations
Deep Metric Learning via Lifted Structured Feature Embedding
CVPR 2016
0
citations
Structural-RNN: Deep Learning on Spatio-Temporal Graphs
CVPR 2016
0
citations
Feedback Networks
CVPR 2017arXiv
0
citations
Deep View Morphing
CVPR 2017arXiv
0
citations
Social Scene Understanding: End-To-End Multi-Person Action Localization and Collective Activity Recognition
CVPR 2017arXiv
0
citations
Demo2Vec: Reasoning Object Affordances From Online Videos
CVPR 2018
0
citations
Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks
CVPR 2018arXiv
0
citations
Taskonomy: Disentangling Task Transfer Learning
CVPR 2018arXiv
0
citations
Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View
CVPR 2018
0
citations
Adversarial Feature Augmentation for Unsupervised Domain Adaptation
CVPR 2018arXiv
0
citations
Deep Learning Under Privileged Information Using Heteroscedastic Dropout
CVPR 2018arXiv
0
citations
Gibson Env: Real-World Perception for Embodied Agents
CVPR 2018arXiv
0
citations
TopNet: Structural Point Cloud Decoder
CVPR 2019
0
citations
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
CVPR 2019
0
citations
Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression
CVPR 2019
0
citations
SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints
CVPR 2019
0
citations
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
CVPR 2019
0
citations
DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion
CVPR 2019
0
citations
Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration
CVPR 2019
0
citations
Topological Planning With Transformers for Vision-and-Language Navigation
CVPR 2021arXiv
0
citations
JRDB-Act: A Large-Scale Dataset for Spatio-Temporal Action, Social Group and Activity Detection
CVPR 2022
0
citations
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding
CVPR 2023
0
citations
Procedure-Aware Pretraining for Instructional Video Understanding
CVPR 2023
0
citations
Unsupervised Semantic Parsing of Video Collections
ICCV 2015
0
citations
Action Recognition by Hierarchical Mid-Level Action Elements
ICCV 2015
0
citations
Learning to Track: Online Multi-Object Tracking by Decision Making
ICCV 2015
0
citations
Text2Data: Low-Resource Data Generation with Textual Control
AAAI 2025
0
citations
Lattice Long Short-Term Memory for Human Action Recognition
ICCV 2017arXiv
0
citations
Situational Fusion of Visual Representation for Visual Navigation
ICCV 2019
0
citations
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera
ICCV 2019
0
citations
TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild
ICCV 2021arXiv
0
citations
Generative Sparse Detection Networks for 3D Single-shot Object Detection
ECCV 2020
0
citations
Universal Correspondence Network
NeurIPS 2016arXiv
0
citations
Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies
ICCV 2017arXiv
0
citations
Unified Training of Universal Time Series Forecasting Transformers
ICML 2024
0
citations
A Coarse-to-Fine Model for 3D Pose Estimation and Sub-Category Recognition
CVPR 2015
0
citations
Generalizing to Unseen Domains via Adversarial Data Augmentation
NeurIPS 2018
0
citations
Regression Planning Networks
NeurIPS 2019
0
citations
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks
NeurIPS 2019
0
citations
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
NeurIPS 2022
0
citations
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
NeurIPS 2023
0
citations