Alexander G. Schwing

47
Papers
322
Total Citations

Papers (47)

Putting the Object Back into Video Object Segmentation

CVPR 2024
182
citations

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

CVPR 2025
80
citations

GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

CVPR 2024
53
citations

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

ICLR 2025
7
citations

Learning to Segment Under Various Forms of Weak Supervision

CVPR 2015
0
citations

Efficient Deep Learning for Stereo Matching

CVPR 2016
0
citations

Semantic Image Inpainting With Deep Generative Models

CVPR 2017arXiv
0
citations

Creativity: Generating Diverse Questions Using Variational Autoencoders

CVPR 2017arXiv
0
citations

Generative Modeling Using the Sliced Wasserstein Distance

CVPR 2018arXiv
0
citations

Convolutional Image Captioning

CVPR 2018arXiv
0
citations

Two Can Play This Game: Visual Dialog With Discriminative Question Generation and Answering

CVPR 2018arXiv
0
citations

Unsupervised Textual Grounding: Linking Words to Image Concepts

CVPR 2018arXiv
0
citations

Factor Graph Attention

CVPR 2019
0
citations

SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation - A Synthetic Dataset and Baselines

CVPR 2019
0
citations

Diverse Generation for Multi-Agent Sports Games

CVPR 2019
0
citations

Two Body Problem: Collaborative Visual Task Completion

CVPR 2019
0
citations

Max-Sliced Wasserstein Distance and Its Use for GANs

CVPR 2019
0
citations

Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech

CVPR 2019
0
citations

A Simple Baseline for Audio-Visual Scene-Aware Dialog

CVPR 2019
0
citations

Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis

CVPR 2020
0
citations

Dynamic Neural Relational Inference

CVPR 2020
0
citations

Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?

CVPR 2020arXiv
0
citations

Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection

CVPR 2020arXiv
0
citations

Panoptic Segmentation Forecasting

CVPR 2021arXiv
0
citations

SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data

CVPR 2021arXiv
0
citations

3D Spatial Recognition Without Spatially Labeled 3D

CVPR 2021arXiv
0
citations

Total Variation Optimization Layers for Computer Vision

CVPR 2022arXiv
0
citations

Joint Forecasting of Panoptic Segmentations With Difference Attention

CVPR 2022arXiv
0
citations

Masked-Attention Mask Transformer for Universal Image Segmentation

CVPR 2022arXiv
0
citations

Neural Volumetric Object Selection

CVPR 2022arXiv
0
citations

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

CVPR 2023arXiv
0
citations

Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation

CVPR 2023
0
citations

AutoFocusFormer: Image Segmentation off the Grid

CVPR 2023arXiv
0
citations

Monocular Object Instance Segmentation and Depth Ordering With CNNs

ICCV 2015
0
citations

Assignment-Space-Based Multi-Object Tracking and Segmentation

ICCV 2021
0
citations

The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation

ICCV 2021arXiv
0
citations

Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents

ICCV 2021arXiv
0
citations

GridToPix: Training Embodied Agents With Minimal Supervision

ICCV 2021arXiv
0
citations

UFO²: A Unified Framework towards Omni-supervised Object Detection

ECCV 2020
0
citations

Proposal-based Video Completion

ECCV 2020
0
citations

Generative Multiplane Images: Making a 2D GAN 3D-Aware

ECCV 2022
0
citations

Initialization and Alignment for Adversarial Texture Optimization

ECCV 2022
0
citations

MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

CVPR 2025
0
citations

XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

ECCV 2022
0
citations

RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations

CVPR 2025
0
citations

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

CVPR 2024
0
citations

Rent3D: Floor-Plan Priors for Monocular Layout Estimation

CVPR 2015
0
citations