Jing Liu

64
Papers
576
Total Citations

Papers (64)

Learning Progressive Joint Propagation for Human Motion Prediction

ECCV 2020
187
citations

Temporal Adaptive RGBT Tracking with Modality Prompt

AAAI 2024arXiv
71
citations

Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection

CVPR 2024
70
citations

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

ICLR 2024
69
citations

Open-Vocabulary Video Anomaly Detection

CVPR 2024
64
citations

AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion

CVPR 2025
25
citations

Numerical Pruning for Efficient Autoregressive Models

AAAI 2025
22
citations

Signed Graph Neural Ordinary Differential Equation for Modeling Continuous-Time Dynamics

AAAI 2024arXiv
15
citations

Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs

ICLR 2025
14
citations

ID-Patch: Robust ID Association for Group Photo Personalization

CVPR 2025
10
citations

Context-aware Dynamic Pruning for Speech Foundation Models

ICLR 2025
7
citations

Efficient Stitchable Task Adaptation

CVPR 2024
7
citations

COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection

CVPR 2025
6
citations

AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks

AAAI 2025
6
citations

Breaking the Encoder Barrier for Seamless Video-Language Understanding

ICCV 2025
3
citations

SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer

AAAI 2024
0
citations

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

CVPR 2024
0
citations

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

CVPR 2024
0
citations

Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation

CVPR 2024
0
citations

Automated Loss function Search for Class-imbalanced Node Classification

ICML 2024
0
citations

A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment

CVPR 2017arXiv
0
citations

Dual Attention Network for Scene Segmentation

CVPR 2019
0
citations

MSCap: Multi-Style Image Captioning With Unpaired Stylized Text

CVPR 2019
0
citations

Deep Incremental Hashing Network for Efficient Image Retrieval

CVPR 2019
0
citations

Normalized and Geometry-Aware Self-Attention Network for Image Captioning

CVPR 2020arXiv
0
citations

AQD: Towards Accurate Quantized Object Detection

CVPR 2021arXiv
0
citations

Video Event Restoration Based on Keyframes for Video Anomaly Detection

CVPR 2023arXiv
0
citations

Boosting Verified Training for Robust Image Classifications via Abstraction

CVPR 2023arXiv
0
citations

Dynamic Focus-Aware Positional Queries for Semantic Segmentation

CVPR 2023arXiv
0
citations

MOSO: Decomposing MOtion, Scene and Object for Video Prediction

CVPR 2023arXiv
0
citations

OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis

CVPR 2023arXiv
0
citations

Adaptive Context Network for Scene Parsing

ICCV 2019
0
citations

HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering

ICCV 2021
0
citations

Scalable Vision Transformers With Hierarchical Pooling

ICCV 2021arXiv
0
citations

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception

ICCV 2023arXiv
0
citations

BiViT: Extremely Compressed Binary Vision Transformers

ICCV 2023arXiv
0
citations

March in Chat: Interactive Prompting for Remote Embodied Referring Expression

ICCV 2023arXiv
0
citations

LoTE-Animal: A Long Time-span Dataset for Endangered Animal Behavior Understanding

ICCV 2023
0
citations

Deep Transferring Quantization

ECCV 2020
0
citations

Generative Low-bitwidth Data Free Quantization

ECCV 2020
0
citations

Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision

ECCV 2020
0
citations

Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection

ECCV 2022
0
citations

Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception

ICCV 2023arXiv
0
citations

QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge

CVPR 2025
0
citations

Efficient Motion-Aware Video MLLM

CVPR 2025
0
citations

ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity

ICCV 2025
0
citations

Learning Beyond Still Frames: Scaling Vision-Language Models with Video

ICCV 2025
0
citations

MotionCtrl: A Real-time Controllable Vision-Language-Motion Model

ICCV 2025
0
citations

Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities

ICCV 2025
0
citations

COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation

ICCV 2025
0
citations

M2OST: Many-to-one Regression for Predicting Spatial Transcriptomics from Digital Pathology Images

AAAI 2025
0
citations

DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection

AAAI 2025
0
citations

TRAIL: Trust-Aware Client Scheduling for Semi-Decentralized Federated Learning

AAAI 2025
0
citations

FedCross: Intertemporal Federated Learning Under Evolutionary Games

AAAI 2025
0
citations

Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage

AAAI 2025
0
citations

Channel Merging: Preserving Specialization for Merged Experts

AAAI 2025
0
citations

Graph Contrastive Learning with Joint Spectral Augmentation of Attribute and Topology

AAAI 2025
0
citations

Discrimination-aware Channel Pruning for Deep Neural Networks

NeurIPS 2018
0
citations

EcoFormer: Energy-Saving Attention with Linear Complexity

NeurIPS 2022
0
citations

CoPur: Certifiably Robust Collaborative Inference via Feature Purification

NeurIPS 2022
0
citations

PTQD: Accurate Post-Training Quantization for Diffusion Models

NeurIPS 2023
0
citations

How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception

NeurIPS 2023
0
citations

VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

NeurIPS 2023
0
citations

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER

NeurIPS 2023
0
citations