Jun Xiao

37
Papers
133
Total Citations

Papers (37)

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

ICML 2025
63
citations

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

CVPR 2025
19
citations

Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

ICML 2025
18
citations

Let LRMs Break Free from Overthinking via Self-Braking Tuning

NeurIPS 2025
13
citations

Janus-Pro-R1: Advancing Collaborative Visual Comprehension and Generation via Reinforcement Learning

NeurIPS 2025
6
citations

Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration

AAAI 2024arXiv
6
citations

DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism

ECCV 2024
4
citations

MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing

CVPR 2025
2
citations

The Four Color Theorem for Cell Instance Segmentation

ICML 2025
1
citations

Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation

ICCV 2025
1
citations

Counterfactual Samples Synthesizing for Robust Visual Question Answering

CVPR 2020arXiv
0
citations

End-to-End 3D Point Cloud Instance Segmentation Without Detection

CVPR 2020
0
citations

Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles

CVPR 2021arXiv
0
citations

Classification-Then-Grounding: Reformulating Video Scene Graphs As Temporal Bipartite Graphs

CVPR 2022
0
citations

The Devil Is in the Labels: Noisy Label Correction for Robust Scene Graph Generation

CVPR 2022
0
citations

VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation

CVPR 2023
0
citations

Bit-Shrinking: Limiting Instantaneous Sharpness for Improving Post-Training Quantization

CVPR 2023
0
citations

Counterfactual Critic Multi-Agent Training for Scene Graph Generation

ICCV 2019
0
citations

Compositional Feature Augmentation for Unbiased Scene Graph Generation

ICCV 2023arXiv
0
citations

Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation

ICCV 2023arXiv
0
citations

SSF: Accelerating Training of Spiking Neural Networks with Stabilized Spiking Flow

ICCV 2023
0
citations

Rethinking Data Augmentation for Robust Visual Question Answering

ECCV 2022
0
citations

Explicit Image Caption Editing

ECCV 2022
0
citations

Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility

CVPR 2025
0
citations

D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation

CVPR 2025
0
citations

TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model

CVPR 2025
0
citations

Activating Sparse Part Concepts for 3D Class Incremental Learning

CVPR 2025
0
citations

Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation

AAAI 2024
0
citations

Towards Progressive Multi-Frequency Representation for Image Warping

CVPR 2024
0
citations

Distributionally Generative Augmentation for Fair Facial Attribute Classification

CVPR 2024
0
citations

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning

CVPR 2017
0
citations

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

CVPR 2018arXiv
0
citations

Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction

CVPR 2019
0
citations

SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization

NeurIPS 2022
0
citations

Two Heads are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning

NeurIPS 2023
0
citations

Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models

NeurIPS 2023
0
citations

Decompose Novel into Known: Part Concept Learning For 3D Novel Class Discovery

NeurIPS 2023
0
citations