Jing Wang
31
Papers
107
Total Citations
Papers (31)
WISA: World simulator assistant for physics-aware text-to-video generation
NeurIPS 2025
33
citations
Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement
AAAI 2024arXiv
21
citations
SURER: Structure-Adaptive Unified Graph Neural Network for Multi-View Clustering
AAAI 2024
16
citations
Online Video Understanding: OVBench and VideoChat-Online
CVPR 2025arXiv
9
citations
SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering
AAAI 2025
7
citations
What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context
ICLR 2025arXiv
7
citations
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
ICCV 2025
5
citations
StreamForest: Efficient Online Video Understanding with Persistent Event Memory
NeurIPS 2025
3
citations
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
CVPR 2025
3
citations
CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework
AAAI 2025
1
citations
Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling
ECCV 2024
1
citations
AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering
CVPR 2025
1
citations
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models
CVPR 2025
0
citations
SAMPLE: Semantic Alignment through Temporal-Adaptive Multimodal Prompt Learning for Event-Based Open-Vocabulary Action Recognition
ICCV 2025
0
citations
MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding
ICCV 2025
0
citations
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
CVPR 2025
0
citations
Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation
ICCV 2025
0
citations
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
CVPR 2025
0
citations
Learning with Adaptive Resource Allocation
ICML 2024
0
citations
Handling Heterogeneous Curvatures in Bandit LQR Control
ICML 2024
0
citations
Walk and Learn: Facial Attribute Representation Learning From Egocentric Video and Contextual Data
CVPR 2016
0
citations
Learning To Filter: Siamese Relation Network for Robust Tracking
CVPR 2021arXiv
0
citations
Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship
CVPR 2021
0
citations
Scene Text Retrieval via Joint Text Detection and Similarity Learning
CVPR 2021arXiv
0
citations
Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation
CVPR 2021
0
citations
From Two to One: A New Scene Text Recognizer With Visual Language Modeling Network
ICCV 2021arXiv
0
citations
AlphaVC: High-Performance and Efficient Learned Video Compression
ECCV 2022
0
citations
Content-Oriented Learned Image Compression
ECCV 2022
0
citations
Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation
ECCV 2022
0
citations
Detecting Tampered Scene Text in the Wild
ECCV 2022
0
citations
Provable Variable Selection for Streaming Features
ICML 2018
0
citations