Yong Liu

49
Papers
580
Total Citations

Papers (49)

ToolACE: Winning the Points of LLM Function Calling

ICLR 2025
114
citations

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models

CVPR 2024
108
citations

DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving

ICCV 2025
58
citations

Sundial: A Family of Highly Capable Time Series Foundation Models

ICML 2025
55
citations

REEF: Representation Encoding Fingerprints for Large Language Models

ICLR 2025
31
citations

Universal Segmentation at Arbitrary Granularity with Language Instruction

CVPR 2024
30
citations

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

ICCV 2025
22
citations

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

AAAI 2025
20
citations

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

ICLR 2025
16
citations

Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning

AAAI 2024arXiv
13
citations

Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective

ICLR 2025
13
citations

InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models

ICCV 2025
12
citations

Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

ICCV 2025
11
citations

OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain

NeurIPS 2025arXiv
9
citations

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

AAAI 2024arXiv
8
citations

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation

CVPR 2025arXiv
7
citations

SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes

CVPR 2025
7
citations

Understanding Fairness Surrogate Functions in Algorithmic Fairness

ICLR 2025
7
citations

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

CVPR 2024
7
citations

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

ICCV 2025
5
citations

HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver

CVPR 2025
4
citations

OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks

ECCV 2024
4
citations

Understanding Model Ensemble in Transferable Adversarial Attack

ICML 2025
4
citations

Decentralized Federated Learning with Model Caching on Mobile Agents

AAAI 2025
4
citations

High-Dimensional Analysis for Generalized Nonlinear Regression: From Asymptotics to Algorithm

AAAI 2024
3
citations

Action Detail Matters: Refining Video Recognition with Local Action Queries

CVPR 2025
3
citations

Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation

ICCV 2025arXiv
2
citations

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution

ICCV 2025
1
citations

Stability and Generalization of Zeroth-Order Decentralized Stochastic Gradient Descent with Changing Topology

AAAI 2025
1
citations

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model

CVPR 2025
1
citations

Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt

AAAI 2024
0
citations

ASWT-SGNN: Adaptive Spectral Wavelet Transform-Based Self-Supervised Graph Neural Network

AAAI 2024
0
citations

WaveNet: Tackling Non-stationary Graph Signals via Graph Spectral Wavelets

AAAI 2024
0
citations

Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding

AAAI 2024
0
citations

Perfect Alignment May be Poisonous to Graph Contrastive Learning

ICML 2024
0
citations

Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

CVPR 2024
0
citations

AdaO2B: Adaptive Online to Batch Conversion for Out-of-Distribution Generalization

AAAI 2025
0
citations

MaxQ: Multi-Axis Query for N:M Sparsity Network

CVPR 2024
0
citations

Open-Vocabulary Segmentation with Semantic-Assisted Calibration

CVPR 2024
0
citations

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering

AAAI 2025
0
citations

An Aggregation-Free Federated Learning for Tackling Data Heterogeneity

CVPR 2024
0
citations

SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

CVPR 2024
0
citations

Timer: Generative Pre-trained Transformers Are Large Time Series Models

ICML 2024
0
citations

Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning

AAAI 2025
0
citations

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving

AAAI 2025
0
citations

Concentration Inequalities for General Functions of Heavy-Tailed Random Variables

ICML 2024
0
citations

Algorithmic Stability Unleashed: Generalization Bounds with Unbounded Losses

ICML 2024
0
citations

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis

AAAI 2025
0
citations

LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation

ICCV 2025
0
citations