Xiangyang Ji

37
Papers
141
Total Citations

Papers (37)

ParCo: Part-Coordinating Text-to-Motion Synthesis

ECCV 2024arXiv
43
citations

Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

CVPR 2025
23
citations

Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning

AAAI 2025
23
citations

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

ECCV 2024
13
citations

EventGPT: Event Stream Understanding with Multimodal Large Language Models

CVPR 2025arXiv
9
citations

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

ICML 2025
8
citations

GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation

CVPR 2025
4
citations

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

CVPR 2025
3
citations

KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation

CVPR 2024
3
citations

PlugMark: A Plug-in Zero-Watermarking Framework for Diffusion Models

ICCV 2025
3
citations

FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation

ECCV 2024
2
citations

Joint Asymmetric Loss for Learning with Noisy Labels

ICCV 2025arXiv
2
citations

Towards Understanding How Knowledge Evolves in Large Vision-Language Models

CVPR 2025
2
citations

Active Event-based Stereo Vision

CVPR 2025
1
citations

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

ICCV 2025
1
citations

Know2Vec: A Black-Box Proxy for Neural Network Retrieval

AAAI 2025
1
citations

Learning Scale-Aware Spatio-temporal Implicit Representation for Event-based Motion Deblurring

ICML 2024
0
citations

The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks

ICML 2024
0
citations

LLM-Empowered State Representation for Reinforcement Learning

ICML 2024
0
citations

Data-free Neural Representation Compression with Riemannian Neural Dynamics

ICML 2024
0
citations

DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework

CVPR 2025
0
citations

Kepler codebook

ICML 2024
0
citations

UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image

CVPR 2025
0
citations

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition

CVPR 2025
0
citations

Enhanced Event-based Dense Stereo via Cross-Sensor Knowledge Distillation

ICCV 2025
0
citations

DyGS-SLAM: Real-Time Accurate Localization and Gaussian Reconstruction for Dynamic Scenes

ICCV 2025
0
citations

Street Gaussians without 3D Object Tracker

ICCV 2025
0
citations

SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models

ICCV 2025
0
citations

Can We Achieve Efficient Diffusion Without Self-Attention? Distilling Self-Attention into Convolutions

ICCV 2025
0
citations

Bridging the Gap Between Ideal and Real-world Evaluation: Benchmarking AI-Generated Image Detection in Challenging Scenarios

ICCV 2025
0
citations

Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning

NeurIPS 2025
0
citations

Delving into Cascaded Instability: A Lipschitz Continuity View on Image Restoration and Object Detection Synergy

NeurIPS 2025arXiv
0
citations

Parallel Vertex Diffusion for Unified Visual Grounding

AAAI 2024arXiv
0
citations

ShapeMatcher: Self-Supervised Joint Shape Canonicalization Segmentation Retrieval and Deformation

CVPR 2024
0
citations

MOHO: Learning Single-view Hand-held Object Reconstruction with Multi-view Occlusion-Aware Supervision

CVPR 2024
0
citations

FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation

CVPR 2024
0
citations

SynFog: A Photo-realistic Synthetic Fog Dataset based on End-to-end Imaging Simulation for Advancing Real-World Defogging in Autonomous Driving

CVPR 2024
0
citations