Kun Wang

27

Papers

159

Total Citations

Papers (27)

GoT: Unleashing Reasoning Capability of MLLM for Visual Generation and Editing

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Graph Sparsification via Mixture of Graphs

ViLLa: Video Reasoning Segmentation with Large Language Model

Kill Two Birds with One Stone: Rethinking Data Augmentation for Deep Long-tailed Learning

Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion

Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video

Auditing Meta-Cognitive Hallucinations in Reasoning Large Language Models

AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization

Breaking the Discretization Barrier of Continuous Physics Simulation Learning

Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains

Siamese DETR

Chained Cascade Network for Object Detection

Regularizing Nighttime Weirdness: Efficient Self-Supervised Monocular Depth Estimation in the Dark

LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform

Adapting Object Detectors with Conditional Domain Normalization

Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation

RigNet: Repetitive Image Guided Network for Depth Completion

Scene Graph Generation From Objects, Phrases and Region Captions

Earthfarsser: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model

Tri-Perspective View Decomposition for Geometry-Aware Depth Completion

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

Gradient-based Visual Explanation for Transformer-based CLIP

Exploring Forensic Dental Identification with Deep Learning

Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment

TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation