Jun Xiao
37
Papers
133
Total Citations
Papers (37)
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
ICML 2025
63
citations
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
CVPR 2025
19
citations
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
ICML 2025
18
citations
Let LRMs Break Free from Overthinking via Self-Braking Tuning
NeurIPS 2025
13
citations
Janus-Pro-R1: Advancing Collaborative Visual Comprehension and Generation via Reinforcement Learning
NeurIPS 2025
6
citations
Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration
AAAI 2024arXiv
6
citations
DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
ECCV 2024
4
citations
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
CVPR 2025
2
citations
The Four Color Theorem for Cell Instance Segmentation
ICML 2025
1
citations
Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation
ICCV 2025
1
citations
Counterfactual Samples Synthesizing for Robust Visual Question Answering
CVPR 2020arXiv
0
citations
End-to-End 3D Point Cloud Instance Segmentation Without Detection
CVPR 2020
0
citations
Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles
CVPR 2021arXiv
0
citations
Classification-Then-Grounding: Reformulating Video Scene Graphs As Temporal Bipartite Graphs
CVPR 2022
0
citations
The Devil Is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
CVPR 2022
0
citations
VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation
CVPR 2023
0
citations
Bit-Shrinking: Limiting Instantaneous Sharpness for Improving Post-Training Quantization
CVPR 2023
0
citations
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
ICCV 2019
0
citations
Compositional Feature Augmentation for Unbiased Scene Graph Generation
ICCV 2023arXiv
0
citations
Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation
ICCV 2023arXiv
0
citations
SSF: Accelerating Training of Spiking Neural Networks with Stabilized Spiking Flow
ICCV 2023
0
citations
Rethinking Data Augmentation for Robust Visual Question Answering
ECCV 2022
0
citations
Explicit Image Caption Editing
ECCV 2022
0
citations
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility
CVPR 2025
0
citations
D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation
CVPR 2025
0
citations
TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model
CVPR 2025
0
citations
Activating Sparse Part Concepts for 3D Class Incremental Learning
CVPR 2025
0
citations
Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation
AAAI 2024
0
citations
Towards Progressive Multi-Frequency Representation for Image Warping
CVPR 2024
0
citations
Distributionally Generative Augmentation for Fair Facial Attribute Classification
CVPR 2024
0
citations
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
CVPR 2017
0
citations
Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks
CVPR 2018arXiv
0
citations
Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction
CVPR 2019
0
citations
SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization
NeurIPS 2022
0
citations
Two Heads are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning
NeurIPS 2023
0
citations
Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models
NeurIPS 2023
0
citations
Decompose Novel into Known: Part Concept Learning For 3D Novel Class Discovery
NeurIPS 2023
0
citations