Kai Xu

25
Papers
175
Total Citations

Papers (25)

DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection

AAAI 2024arXiv
79
citations

Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement

ICLR 2024
61
citations

Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes

CVPR 2024
14
citations

Learning Cross-hand Policies of High-DOF Reaching and Grasping

ECCV 2024arXiv
7
citations

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

CVPR 2025
5
citations

Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model

AAAI 2025
5
citations

Physical-aware Neural Radiance Fields for Efficient Exposure Correction

AAAI 2025
2
citations

Progressive Correspondence Regenerator for Robust 3D Registration

CVPR 2025
2
citations

Wave-MambaAD: Wavelet-driven State Space Model for Multi-class Unsupervised Anomaly Detection

ICCV 2025
0
citations

A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds

ICCV 2025
0
citations

Diagnosing Pretrained Models for Out-of-distribution Detection

ICCV 2025
0
citations

MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

CVPR 2024
0
citations

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

CVPR 2024
0
citations

Practical Hamiltonian Monte Carlo on Riemannian Manifolds via Relativity Theory

ICML 2024
0
citations

Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding

ICML 2024
0
citations

GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

ICML 2024
0
citations

RestorGS: Depth-aware Gaussian Splatting for Efficient 3D Scene Restoration

CVPR 2025
0
citations

Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration

ICML 2024
0
citations

VideoDirector: Precise Video Editing via Text-to-Video Models

CVPR 2025
0
citations

VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis

CVPR 2025
0
citations

ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting

CVPR 2025
0
citations

CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs

ICCV 2025
0
citations

Self-supervised Learning of Hybrid Part-aware 3D Representations of 2D Gaussians and Superquadrics

ICCV 2025
0
citations

Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction

ICCV 2025
0
citations

MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos

ICCV 2025
0
citations