Alexander G. Schwing
47
Papers
322
Total Citations
Papers (47)
Putting the Object Back into Video Object Segmentation
CVPR 2024
182
citations
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds
CVPR 2025
80
citations
GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh
CVPR 2024
53
citations
LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
ICLR 2025
7
citations
Learning to Segment Under Various Forms of Weak Supervision
CVPR 2015
0
citations
Efficient Deep Learning for Stereo Matching
CVPR 2016
0
citations
Semantic Image Inpainting With Deep Generative Models
CVPR 2017arXiv
0
citations
Creativity: Generating Diverse Questions Using Variational Autoencoders
CVPR 2017arXiv
0
citations
Generative Modeling Using the Sliced Wasserstein Distance
CVPR 2018arXiv
0
citations
Convolutional Image Captioning
CVPR 2018arXiv
0
citations
Two Can Play This Game: Visual Dialog With Discriminative Question Generation and Answering
CVPR 2018arXiv
0
citations
Unsupervised Textual Grounding: Linking Words to Image Concepts
CVPR 2018arXiv
0
citations
Factor Graph Attention
CVPR 2019
0
citations
SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation - A Synthetic Dataset and Baselines
CVPR 2019
0
citations
Diverse Generation for Multi-Agent Sports Games
CVPR 2019
0
citations
Two Body Problem: Collaborative Visual Task Completion
CVPR 2019
0
citations
Max-Sliced Wasserstein Distance and Its Use for GANs
CVPR 2019
0
citations
Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech
CVPR 2019
0
citations
A Simple Baseline for Audio-Visual Scene-Aware Dialog
CVPR 2019
0
citations
Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis
CVPR 2020
0
citations
Dynamic Neural Relational Inference
CVPR 2020
0
citations
Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?
CVPR 2020arXiv
0
citations
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection
CVPR 2020arXiv
0
citations
Panoptic Segmentation Forecasting
CVPR 2021arXiv
0
citations
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data
CVPR 2021arXiv
0
citations
3D Spatial Recognition Without Spatially Labeled 3D
CVPR 2021arXiv
0
citations
Total Variation Optimization Layers for Computer Vision
CVPR 2022arXiv
0
citations
Joint Forecasting of Panoptic Segmentations With Difference Attention
CVPR 2022arXiv
0
citations
Masked-Attention Mask Transformer for Universal Image Segmentation
CVPR 2022arXiv
0
citations
Neural Volumetric Object Selection
CVPR 2022arXiv
0
citations
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
CVPR 2023arXiv
0
citations
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation
CVPR 2023
0
citations
AutoFocusFormer: Image Segmentation off the Grid
CVPR 2023arXiv
0
citations
Monocular Object Instance Segmentation and Depth Ordering With CNNs
ICCV 2015
0
citations
Assignment-Space-Based Multi-Object Tracking and Segmentation
ICCV 2021
0
citations
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
ICCV 2021arXiv
0
citations
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents
ICCV 2021arXiv
0
citations
GridToPix: Training Embodied Agents With Minimal Supervision
ICCV 2021arXiv
0
citations
UFO²: A Unified Framework towards Omni-supervised Object Detection
ECCV 2020
0
citations
Proposal-based Video Completion
ECCV 2020
0
citations
Generative Multiplane Images: Making a 2D GAN 3D-Aware
ECCV 2022
0
citations
Initialization and Alignment for Adversarial Texture Optimization
ECCV 2022
0
citations
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
CVPR 2025
0
citations
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
ECCV 2022
0
citations
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations
CVPR 2025
0
citations
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows
CVPR 2024
0
citations
Rent3D: Floor-Plan Priors for Monocular Layout Estimation
CVPR 2015
0
citations