Kun Wang
27
Papers
159
Total Citations
Papers (27)
GoT: Unleashing Reasoning Capability of MLLM for Visual Generation and Editing
NeurIPS 2025
60
citations
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
ICCV 2025
17
citations
Graph Sparsification via Mixture of Graphs
ICLR 2025
17
citations
ViLLa: Video Reasoning Segmentation with Large Language Model
ICCV 2025
16
citations
Kill Two Birds with One Stone: Rethinking Data Augmentation for Deep Long-tailed Learning
ICLR 2024
15
citations
Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion
CVPR 2025arXiv
12
citations
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
AAAI 2025
8
citations
Auditing Meta-Cognitive Hallucinations in Reasoning Large Language Models
NeurIPS 2025
5
citations
AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization
AAAI 2024arXiv
4
citations
Breaking the Discretization Barrier of Continuous Physics Simulation Learning
NeurIPS 2025
4
citations
Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains
NeurIPS 2025
1
citations
Siamese DETR
CVPR 2023arXiv
0
citations
Chained Cascade Network for Object Detection
ICCV 2017
0
citations
Regularizing Nighttime Weirdness: Efficient Self-Supervised Monocular Depth Estimation in the Dark
ICCV 2021arXiv
0
citations
LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform
ECCV 2020
0
citations
Adapting Object Detectors with Conditional Domain Normalization
ECCV 2020
0
citations
Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion
ECCV 2022
0
citations
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
ECCV 2022
0
citations
RigNet: Repetitive Image Guided Network for Depth Completion
ECCV 2022
0
citations
Scene Graph Generation From Objects, Phrases and Region Captions
ICCV 2017arXiv
0
citations
Earthfarsser: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model
AAAI 2024
0
citations
Tri-Perspective View Decomposition for Geometry-Aware Depth Completion
CVPR 2024
0
citations
Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance
CVPR 2024
0
citations
Gradient-based Visual Explanation for Transformer-based CLIP
ICML 2024
0
citations
Exploring Forensic Dental Identification with Deep Learning
NeurIPS 2021
0
citations
Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment
NeurIPS 2023
0
citations
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
NeurIPS 2023
0
citations