Jian Wang

74
Papers
530
Total Citations

Papers (74)

SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery

CVPR 2024
236
citations

Premise Selection for Theorem Proving by Deep Graph Embedding

NeurIPS 2017arXiv
142
citations

RobustSAM: Segment Anything Robustly on Degraded Images

CVPR 2024
35
citations

Cooper: Coordinating Specialized Agents towards a Complex Dialogue Goal

AAAI 2024arXiv
27
citations

KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems

ICML 2025
26
citations

DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer

CVPR 2024
19
citations

Robust Communicative Multi-Agent Reinforcement Learning with Active Defense

AAAI 2024arXiv
8
citations

Training-Free Text-Guided Image Editing with Visual Autoregressive Model

ICCV 2025arXiv
7
citations

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input

CVPR 2025
5
citations

POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation

CVPR 2025
5
citations

Delving Deep into Engagement Prediction of Short Videos

ECCV 2024
5
citations

EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching

ECCV 2024
4
citations

SceneMI: Motion In-betweening for Modeling Human-Scene Interaction

ICCV 2025
3
citations

Discrete Curvature Graph Information Bottleneck

AAAI 2025
3
citations

Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation

ICCV 2025
2
citations

Style Quantization for Data-Efficient GAN Training

CVPR 2025
2
citations

FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video

CVPR 2025
1
citations

DeepFLASH: An Efficient Network for Learning-Based Medical Image Registration

CVPR 2020arXiv
0
citations

Unsupervised Multi-Source Domain Adaptation for Person Re-Identification

CVPR 2021arXiv
0
citations

One Shot Face Swapping on Megapixels

CVPR 2021arXiv
0
citations

Seeing in Extra Darkness Using a Deep-Red Flash

CVPR 2021
0
citations

Human-Object Interaction Detection via Disentangled Transformer

CVPR 2022arXiv
0
citations

MixFormer: Mixing Features Across Windows and Dimensions

CVPR 2022arXiv
0
citations

Estimating Egocentric 3D Human Pose in the Wild With External Weak Supervision

CVPR 2022arXiv
0
citations

Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer

CVPR 2022
0
citations

Implicit Sample Extension for Unsupervised Person Re-Identification

CVPR 2022arXiv
0
citations

3D Photo Stylization: Learning To Generate Stylized Novel Views From a Single Image

CVPR 2022arXiv
0
citations

Energy-Efficient Adaptive 3D Sensing

CVPR 2023
0
citations

Scene-Aware Egocentric 3D Human Pose Estimation

CVPR 2023arXiv
0
citations

PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers

CVPR 2023arXiv
0
citations

Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation

CVPR 2023
0
citations

Photometric Stereo With Small Angular Variations

ICCV 2015
0
citations

Deep Metric Learning With Angular Loss

ICCV 2017arXiv
0
citations

Reflectance Capture Using Univariate Sampling of BRDFs

ICCV 2017
0
citations

Micro-Baseline Structured Light

ICCV 2019
0
citations

Agile Depth Sensing Using Triangulation Light Curtains

ICCV 2019
0
citations

Mining Contextual Information Beyond Image for Semantic Segmentation

ICCV 2021arXiv
0
citations

MFNet: Multi-Filter Directive Network for Weakly Supervised Salient Object Detection

ICCV 2021
0
citations

Estimating Egocentric 3D Human Pose in Global Space

ICCV 2021arXiv
0
citations

KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception

CVPR 2025
0
citations

Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation

ICCV 2023arXiv
0
citations

s-Adaptive Decoupled Prototype for Few-Shot Object Detection

ICCV 2023
0
citations

Unified Pre-Training with Pseudo Texts for Text-To-Image Person Re-Identification

ICCV 2023arXiv
0
citations

Uncertainty-guided Learning for Improving Image Manipulation Detection

ICCV 2023
0
citations

Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement

ECCV 2020
0
citations

Action Quality Assessment with Temporal Parsing Transformer

ECCV 2022
0
citations

UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture

ECCV 2022
0
citations

Seeing Far in the Dark with Patterned Flash

ECCV 2022
0
citations

UFO: Unified Feature Optimization

ECCV 2022
0
citations

Hierarchical Memory Learning for Fine-Grained Scene Graph Generation

ECCV 2022
0
citations

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

ICCV 2023arXiv
0
citations

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling

CVPR 2025
0
citations

Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation

ICCV 2025
0
citations

T2Bs: Text-to-Character Blendshapes via Video Generation

ICCV 2025
0
citations

TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control

ICCV 2025
0
citations

RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation

ICCV 2025
0
citations

Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation

ICCV 2025
0
citations

Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization

AAAI 2025
0
citations

Federated Recommendation with Explicitly Encoding Item Bias

AAAI 2025
0
citations

3D Human Pose Perception from Egocentric Stereo Videos

CVPR 2024
0
citations

Towards Better Vision-Inspired Vision-Language Models

CVPR 2024
0
citations

EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams

CVPR 2024
0
citations

REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning

CVPR 2024
0
citations

Exponential Spectral Pursuit: An Effective Initialization Method for Sparse Phase Retrieval

ICML 2024
0
citations

Mobile Attention: Mobile-Friendly Linear-Attention for Vision Transformers

ICML 2024
0
citations

MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data

ICML 2024
0
citations

Re-Identification Supervised Texture Generation

CVPR 2019
0
citations

Watch out! Motion is Blurring the Vision of Your Deep Neural Networks

NeurIPS 2020
0
citations

Group Contextual Encoding for 3D Point Clouds

NeurIPS 2020
0
citations

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

NeurIPS 2022
0
citations

Geo-SIC: Learning Deformable Geometric Shapes in Deep Image Classifiers

NeurIPS 2022
0
citations

Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning

NeurIPS 2022
0
citations

A Unified Conditional Framework for Diffusion-based Image Restoration

NeurIPS 2023
0
citations

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

NeurIPS 2023
0
citations