Alan Yuille

65
Papers
150
Total Citations

Papers (65)

Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses

ECCV 2020
58
citations

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

ICCV 2025arXiv
49
citations

RadGPT: Constructing 3D Image-Text Tumor Datasets

ICCV 2025
23
citations

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction

ICCV 2025arXiv
16
citations

HISR: Hybrid Implicit Surface Representation for Photorealistic 3D Human Reconstruction

AAAI 2024arXiv
4
citations

VideoAuteur: Towards Long Narrative Video Generation

ICCV 2025arXiv
0
citations

Rejuvenating image-GPT as Strong Visual Representation Learners

ICML 2024
0
citations

Mask Guided Matting via Progressive Refinement Network

CVPR 2021arXiv
0
citations

Self-Supervised Pillar Motion Learning for Autonomous Driving

CVPR 2021arXiv
0
citations

VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation

CVPR 2021
0
citations

CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

CVPR 2021arXiv
0
citations

Progressive Stage-Wise Learning for Unsupervised Feature Representation Enhancement

CVPR 2021arXiv
0
citations

DetectoRS: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution

CVPR 2021arXiv
0
citations

Robust Instance Segmentation Through Reasoning About Multi-Object Occlusion

CVPR 2021arXiv
0
citations

Deeply Shape-Guided Cascade for Instance Segmentation

CVPR 2021arXiv
0
citations

MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers

CVPR 2021
0
citations

Weakly Supervised Instance Segmentation for Videos With Temporal Mask Consistency

CVPR 2021arXiv
0
citations

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

CVPR 2022arXiv
0
citations

Amodal Segmentation Through Out-of-Task and Out-of-Distribution Generalization With a Bayesian Model

CVPR 2022arXiv
0
citations

Learning From Temporal Gradient for Semi-Supervised Action Recognition

CVPR 2022arXiv
0
citations

Masked Feature Prediction for Self-Supervised Visual Pre-Training

CVPR 2022arXiv
0
citations

A Simple Data Mixing Prior for Improving Self-Supervised Learning

CVPR 2022
0
citations

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

CVPR 2022
0
citations

Learning Part Segmentation Through Unsupervised Domain Adaptation From Synthetic Vehicles

CVPR 2022arXiv
0
citations

Point-Level Region Contrast for Object Detection Pre-Training

CVPR 2022arXiv
0
citations

Simulated Adversarial Testing of Face Recognition Models

CVPR 2022arXiv
0
citations

Lite Vision Transformer With Enhanced Self-Attention

CVPR 2022arXiv
0
citations

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

CVPR 2022arXiv
0
citations

TransMix: Attend To Mix for Vision Transformers

CVPR 2022arXiv
0
citations

Recurrent Multimodal Interaction for Referring Image Segmentation

ICCV 2017arXiv
0
citations

SORT: Second-Order Response Transform for Visual Recognition

ICCV 2017arXiv
0
citations

Adversarial Examples for Semantic Segmentation and Object Detection

ICCV 2017arXiv
0
citations

Genetic CNN

ICCV 2017arXiv
0
citations

ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond

ICCV 2017arXiv
0
citations

Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection

ICCV 2017arXiv
0
citations

Exploring Simple 3D Multi-Object Tracking for Autonomous Driving

ICCV 2021arXiv
0
citations

Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images

ICCV 2021
0
citations

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation

ICCV 2021
0
citations

3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation

ICCV 2023arXiv
0
citations

CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection

ICCV 2023arXiv
0
citations

Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape

ICCV 2023arXiv
0
citations

Diffusion Models as Masked Autoencoders

ICCV 2023arXiv
0
citations

CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans

ICCV 2023
0
citations

SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-Training

ICCV 2023arXiv
0
citations

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

ECCV 2020
0
citations

JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans

ECCV 2020
0
citations

Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots

ECCV 2020
0
citations

PatchAttack: A Black-box Texture-based Attack with Reinforcement Learning

ECCV 2020
0
citations

Explicit Occlusion Reasoning for Multi-Person 3D Human Pose Estimation

ECCV 2022
0
citations

"PartImageNet: A Large, High-Quality Dataset of Parts"

ECCV 2022
0
citations

OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images

ECCV 2022
0
citations

Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features

ECCV 2022
0
citations

In Defense of Image Pre-training for Spatiotemporal Recognition

ECCV 2022
0
citations

In Defense of Online Models for Video Instance Segmentation

ECCV 2022
0
citations

k-Means Mask Transformer

ECCV 2022
0
citations

CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation

ECCV 2022
0
citations

Coarse-to-Fine Incremental Few-Shot Learning

ECCV 2022
0
citations

Context-Enhanced Stereo Transformer

ECCV 2022
0
citations

Are Labels Necessary for Neural Architecture Search?

ECCV 2020
0
citations

Scaling 3D Compositional Models for Robust Classification and Pose Estimation

ICCV 2025
0
citations

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

ICCV 2025
0
citations

Medical World Model

ICCV 2025
0
citations

Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data

ICCV 2025
0
citations

Learning Deep Structured Models

ICML 2015
0
citations

Gradually Updated Neural Networks for Large-Scale Image Recognition

ICML 2018
0
citations