Kun Wang

27
Papers
159
Total Citations

Papers (27)

GoT: Unleashing Reasoning Capability of MLLM for Visual Generation and Editing

NeurIPS 2025
60
citations

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

ICCV 2025
17
citations

Graph Sparsification via Mixture of Graphs

ICLR 2025
17
citations

ViLLa: Video Reasoning Segmentation with Large Language Model

ICCV 2025
16
citations

Kill Two Birds with One Stone: Rethinking Data Augmentation for Deep Long-tailed Learning

ICLR 2024
15
citations

Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion

CVPR 2025arXiv
12
citations

Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video

AAAI 2025
8
citations

Auditing Meta-Cognitive Hallucinations in Reasoning Large Language Models

NeurIPS 2025
5
citations

AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization

AAAI 2024arXiv
4
citations

Breaking the Discretization Barrier of Continuous Physics Simulation Learning

NeurIPS 2025
4
citations

Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains

NeurIPS 2025
1
citations

Siamese DETR

CVPR 2023arXiv
0
citations

Chained Cascade Network for Object Detection

ICCV 2017
0
citations

Regularizing Nighttime Weirdness: Efficient Self-Supervised Monocular Depth Estimation in the Dark

ICCV 2021arXiv
0
citations

LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform

ECCV 2020
0
citations

Adapting Object Detectors with Conditional Domain Normalization

ECCV 2020
0
citations

Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion

ECCV 2022
0
citations

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation

ECCV 2022
0
citations

RigNet: Repetitive Image Guided Network for Depth Completion

ECCV 2022
0
citations

Scene Graph Generation From Objects, Phrases and Region Captions

ICCV 2017arXiv
0
citations

Earthfarsser: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model

AAAI 2024
0
citations

Tri-Perspective View Decomposition for Geometry-Aware Depth Completion

CVPR 2024
0
citations

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

CVPR 2024
0
citations

Gradient-based Visual Explanation for Transformer-based CLIP

ICML 2024
0
citations

Exploring Forensic Dental Identification with Deep Learning

NeurIPS 2021
0
citations

Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment

NeurIPS 2023
0
citations

TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation

NeurIPS 2023
0
citations