Jing Wang

31
Papers
107
Total Citations

Papers (31)

WISA: World simulator assistant for physics-aware text-to-video generation

NeurIPS 2025
33
citations

Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement

AAAI 2024arXiv
21
citations

SURER: Structure-Adaptive Unified Graph Neural Network for Multi-View Clustering

AAAI 2024
16
citations

Online Video Understanding: OVBench and VideoChat-Online

CVPR 2025arXiv
9
citations

SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering

AAAI 2025
7
citations

What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context

ICLR 2025arXiv
7
citations

Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation

ICCV 2025
5
citations

StreamForest: Efficient Online Video Understanding with Persistent Event Memory

NeurIPS 2025
3
citations

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution

CVPR 2025
3
citations

CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework

AAAI 2025
1
citations

Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling

ECCV 2024
1
citations

AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering

CVPR 2025
1
citations

An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models

CVPR 2025
0
citations

SAMPLE: Semantic Alignment through Temporal-Adaptive Multimodal Prompt Learning for Event-Based Open-Vocabulary Action Recognition

ICCV 2025
0
citations

MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding

ICCV 2025
0
citations

Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation

CVPR 2025
0
citations

Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation

ICCV 2025
0
citations

WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models

CVPR 2025
0
citations

Learning with Adaptive Resource Allocation

ICML 2024
0
citations

Handling Heterogeneous Curvatures in Bandit LQR Control

ICML 2024
0
citations

Walk and Learn: Facial Attribute Representation Learning From Egocentric Video and Contextual Data

CVPR 2016
0
citations

Learning To Filter: Siamese Relation Network for Robust Tracking

CVPR 2021arXiv
0
citations

Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship

CVPR 2021
0
citations

Scene Text Retrieval via Joint Text Detection and Similarity Learning

CVPR 2021arXiv
0
citations

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation

CVPR 2021
0
citations

From Two to One: A New Scene Text Recognizer With Visual Language Modeling Network

ICCV 2021arXiv
0
citations

AlphaVC: High-Performance and Efficient Learned Video Compression

ECCV 2022
0
citations

Content-Oriented Learned Image Compression

ECCV 2022
0
citations

Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation

ECCV 2022
0
citations

Detecting Tampered Scene Text in the Wild

ECCV 2022
0
citations

Provable Variable Selection for Streaming Features

ICML 2018
0
citations