Yong Liu
49
Papers
580
Total Citations
Papers (49)
ToolACE: Winning the Points of LLM Function Calling
ICLR 2025
114
citations
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
CVPR 2024
108
citations
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
ICCV 2025
58
citations
Sundial: A Family of Highly Capable Time Series Foundation Models
ICML 2025
55
citations
REEF: Representation Encoding Fingerprints for Large Language Models
ICLR 2025
31
citations
Universal Segmentation at Arbitrary Granularity with Language Instruction
CVPR 2024
30
citations
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
ICCV 2025
22
citations
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
AAAI 2025
20
citations
MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
ICLR 2025
16
citations
Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning
AAAI 2024arXiv
13
citations
Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective
ICLR 2025
13
citations
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
ICCV 2025
12
citations
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
ICCV 2025
11
citations
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain
NeurIPS 2025arXiv
9
citations
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
AAAI 2024arXiv
8
citations
TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation
CVPR 2025arXiv
7
citations
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
CVPR 2025
7
citations
Understanding Fairness Surrogate Functions in Algorithmic Fairness
ICLR 2025
7
citations
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
CVPR 2024
7
citations
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
ICCV 2025
5
citations
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver
CVPR 2025
4
citations
OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks
ECCV 2024
4
citations
Understanding Model Ensemble in Transferable Adversarial Attack
ICML 2025
4
citations
Decentralized Federated Learning with Model Caching on Mobile Agents
AAAI 2025
4
citations
High-Dimensional Analysis for Generalized Nonlinear Regression: From Asymptotics to Algorithm
AAAI 2024
3
citations
Action Detail Matters: Refining Video Recognition with Local Action Queries
CVPR 2025
3
citations
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation
ICCV 2025arXiv
2
citations
PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution
ICCV 2025
1
citations
Stability and Generalization of Zeroth-Order Decentralized Stochastic Gradient Descent with Changing Topology
AAAI 2025
1
citations
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model
CVPR 2025
1
citations
Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt
AAAI 2024
0
citations
ASWT-SGNN: Adaptive Spectral Wavelet Transform-Based Self-Supervised Graph Neural Network
AAAI 2024
0
citations
WaveNet: Tackling Non-stationary Graph Signals via Graph Spectral Wavelets
AAAI 2024
0
citations
Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding
AAAI 2024
0
citations
Perfect Alignment May be Poisonous to Graph Contrastive Learning
ICML 2024
0
citations
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
CVPR 2024
0
citations
AdaO2B: Adaptive Online to Batch Conversion for Out-of-Distribution Generalization
AAAI 2025
0
citations
MaxQ: Multi-Axis Query for N:M Sparsity Network
CVPR 2024
0
citations
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
CVPR 2024
0
citations
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
AAAI 2025
0
citations
An Aggregation-Free Federated Learning for Tackling Data Heterogeneity
CVPR 2024
0
citations
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
CVPR 2024
0
citations
Timer: Generative Pre-trained Transformers Are Large Time Series Models
ICML 2024
0
citations
Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning
AAAI 2025
0
citations
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
AAAI 2025
0
citations
Concentration Inequalities for General Functions of Heavy-Tailed Random Variables
ICML 2024
0
citations
Algorithmic Stability Unleashed: Generalization Bounds with Unbounded Losses
ICML 2024
0
citations
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
AAAI 2025
0
citations
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
0
citations