Yu-Xiong Wang

57
Papers
1,030
Total Citations

Papers (57)

Learning to Model the Tail

NeurIPS 2017
701
citations

Learning from Small Sample Sets by Combining Unsupervised Meta-Training with CNNs

NeurIPS 2016
79
citations

RandAR: Decoder-only Autoregressive Visual Generation in Random Orders

CVPR 2025
61
citations

Frozen Transformers in Language Models Are Effective Visual Encoder Layers

ICLR 2024
48
citations

Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion

CVPR 2024
25
citations

GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation

CVPR 2025
21
citations

Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought

CVPR 2025
19
citations

RMem: Restricted Memory Banks Improve Video Object Segmentation

CVPR 2024
18
citations

TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding

CVPR 2024
18
citations

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing

CVPR 2024
15
citations

Region-Based Representations Revisited

CVPR 2024
14
citations

InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

CVPR 2025
7
citations

Refer to Any Segmentation Mask Group With Vision-Language Prompts

ICCV 2025
2
citations

AgMMU: A Comprehensive Agricultural Multimodal Understanding Benchmark

NeurIPS 2025
2
citations

Discovering Objects That Can Move

CVPR 2022arXiv
0
citations

Embracing Single Stride 3D Object Detector With Sparse Transformer

CVPR 2022arXiv
0
citations

Long-Tailed Recognition via Weight Balancing

CVPR 2022arXiv
0
citations

DIVeR: Real-Time and Accurate Neural Radiance Fields With Deterministic Integration for Volume Rendering

CVPR 2022arXiv
0
citations

Object Discovery From Motion-Guided Tokens

CVPR 2023arXiv
0
citations

BEV-Guided Multi-Modality Fusion for Driving Perception

CVPR 2023
0
citations

Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking

CVPR 2023arXiv
0
citations

NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds

CVPR 2023arXiv
0
citations

Contrastive Mean Teacher for Domain Adaptive Object Detectors

CVPR 2023arXiv
0
citations

Learning Compositional Representations for Few-Shot Recognition

ICCV 2019
0
citations

Meta-Learning to Detect Rare Objects

ICCV 2019
0
citations

On the Importance of Distractors for Few-Shot Classification

ICCV 2021arXiv
0
citations

Learning To Hallucinate Examples From Extrinsic and Intrinsic Supervision

ICCV 2021
0
citations

Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation

ICCV 2021arXiv
0
citations

Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study

ICCV 2023
0
citations

Video State-Changing Object Segmentation

ICCV 2023
0
citations

InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion

ICCV 2023arXiv
0
citations

MV-Map: Offboard HD-Map Generation with Multi-view Consistency

ICCV 2023
0
citations

Improving Equivariance in State-of-the-Art Supervised Depth and Normal Predictors

ICCV 2023
0
citations

Towards Streaming Perception

ECCV 2020
0
citations

PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees

ECCV 2022
0
citations

Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors

ECCV 2022
0
citations

Multi-task View Synthesis with Neural Radiance Fields

ICCV 2023
0
citations

InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

CVPR 2025
0
citations

Floating No More: Object-Ground Reconstruction from a Single Image

CVPR 2025
0
citations

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

ICCV 2025
0
citations

Situational Awareness Matters in 3D Vision Language Reasoning

CVPR 2024
0
citations

Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models

ICML 2024
0
citations

Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching

ICML 2024
0
citations

ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories

ICML 2024
0
citations

Model Recommendation: Generating Object Detectors From Few Samples

CVPR 2015
0
citations

Growing a Brain: Fine-Tuning by Increasing Model Capacity

CVPR 2017arXiv
0
citations

Low-Shot Learning From Imaginary Data

CVPR 2018arXiv
0
citations

Image Deformation Meta-Networks for One-Shot Learning

CVPR 2019
0
citations

Hallucination Improves Few-Shot Object Detection

CVPR 2021arXiv
0
citations

DAP: Detection-Aware Pre-Training With Weak Supervision

CVPR 2021arXiv
0
citations

CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations

NeurIPS 2022
0
citations

Continual Learning with Evolving Class Ontologies

NeurIPS 2022
0
citations

A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories

NeurIPS 2023
0
citations

Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models

NeurIPS 2023
0
citations

YouTubePD: A Multimodal Benchmark for Parkinson’s Disease Analysis

NeurIPS 2023
0
citations

HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

NeurIPS 2023
0
citations

ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields

NeurIPS 2023
0
citations