Xiang Li

31
Papers
262
Total Citations

Papers (31)

Decoding Natural Images from EEG for Object Recognition

ICLR 2024
92
citations

ImageFolder: Autoregressive Image Generation with Folded Tokens

ICLR 2025
63
citations

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

CVPR 2025
32
citations

Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties

CVPR 2025
20
citations

From Words to Worth: Newborn Article Impact Prediction with LLM

AAAI 2025
11
citations

Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection

AAAI 2025
7
citations

In-Hand 3D Object Reconstruction from a Monocular RGB Video

AAAI 2024arXiv
7
citations

Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

CVPR 2024
6
citations

Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling

NeurIPS 2025
6
citations

AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization

AAAI 2024arXiv
4
citations

Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation

CVPR 2025
4
citations

REOBench: Benchmarking Robustness of Earth Observation Foundation Models

NeurIPS 2025
3
citations

Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis

AAAI 2025
2
citations

Handows: A Palm-Based Interactive Multi-Window Management System in Virtual Reality

ISMAR 2025
2
citations

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

ICCV 2025
1
citations

Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent

AAAI 2025
1
citations

Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic Perspective

ICML 2025
1
citations

TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

AAAI 2025
0
citations

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

ICCV 2025
0
citations

CrossKD: Cross-Head Knowledge Distillation for Object Detection

CVPR 2024
0
citations

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

CVPR 2024
0
citations

VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models

CVPR 2024
0
citations

QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition

CVPR 2024
0
citations

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark

CVPR 2025
0
citations

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

CVPR 2025
0
citations

A General Framework for Learning from Weak Supervision

ICML 2024
0
citations

Completing Visual Objects via Bridging Generation and Segmentation

ICML 2024
0
citations

Advancing Textual Prompt Learning with Anchored Attributes

ICCV 2025
0
citations

Position: TrustLLM: Trustworthiness in Large Language Models

ICML 2024
0
citations

Backdoor Attacks on Neural Networks via One-Bit Flip

ICCV 2025
0
citations

Leveraging Large Language Models for Node Generation in Few-Shot Learning on Text-Attributed Graphs

AAAI 2025
0
citations