Jian Wang
74
Papers
530
Total Citations
Papers (74)
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery
CVPR 2024
236
citations
Premise Selection for Theorem Proving by Deep Graph Embedding
NeurIPS 2017arXiv
142
citations
RobustSAM: Segment Anything Robustly on Degraded Images
CVPR 2024
35
citations
Cooper: Coordinating Specialized Agents towards a Complex Dialogue Goal
AAAI 2024arXiv
27
citations
KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems
ICML 2025
26
citations
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer
CVPR 2024
19
citations
Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
AAAI 2024arXiv
8
citations
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
ICCV 2025arXiv
7
citations
Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
CVPR 2025
5
citations
POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation
CVPR 2025
5
citations
Delving Deep into Engagement Prediction of Short Videos
ECCV 2024
5
citations
EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching
ECCV 2024
4
citations
SceneMI: Motion In-betweening for Modeling Human-Scene Interaction
ICCV 2025
3
citations
Discrete Curvature Graph Information Bottleneck
AAAI 2025
3
citations
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation
ICCV 2025
2
citations
Style Quantization for Data-Efficient GAN Training
CVPR 2025
2
citations
FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video
CVPR 2025
1
citations
DeepFLASH: An Efficient Network for Learning-Based Medical Image Registration
CVPR 2020arXiv
0
citations
Unsupervised Multi-Source Domain Adaptation for Person Re-Identification
CVPR 2021arXiv
0
citations
One Shot Face Swapping on Megapixels
CVPR 2021arXiv
0
citations
Seeing in Extra Darkness Using a Deep-Red Flash
CVPR 2021
0
citations
Human-Object Interaction Detection via Disentangled Transformer
CVPR 2022arXiv
0
citations
MixFormer: Mixing Features Across Windows and Dimensions
CVPR 2022arXiv
0
citations
Estimating Egocentric 3D Human Pose in the Wild With External Weak Supervision
CVPR 2022arXiv
0
citations
Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer
CVPR 2022
0
citations
Implicit Sample Extension for Unsupervised Person Re-Identification
CVPR 2022arXiv
0
citations
3D Photo Stylization: Learning To Generate Stylized Novel Views From a Single Image
CVPR 2022arXiv
0
citations
Energy-Efficient Adaptive 3D Sensing
CVPR 2023
0
citations
Scene-Aware Egocentric 3D Human Pose Estimation
CVPR 2023arXiv
0
citations
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers
CVPR 2023arXiv
0
citations
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation
CVPR 2023
0
citations
Photometric Stereo With Small Angular Variations
ICCV 2015
0
citations
Deep Metric Learning With Angular Loss
ICCV 2017arXiv
0
citations
Reflectance Capture Using Univariate Sampling of BRDFs
ICCV 2017
0
citations
Micro-Baseline Structured Light
ICCV 2019
0
citations
Agile Depth Sensing Using Triangulation Light Curtains
ICCV 2019
0
citations
Mining Contextual Information Beyond Image for Semantic Segmentation
ICCV 2021arXiv
0
citations
MFNet: Multi-Filter Directive Network for Weakly Supervised Salient Object Detection
ICCV 2021
0
citations
Estimating Egocentric 3D Human Pose in Global Space
ICCV 2021arXiv
0
citations
KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception
CVPR 2025
0
citations
Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation
ICCV 2023arXiv
0
citations
s-Adaptive Decoupled Prototype for Few-Shot Object Detection
ICCV 2023
0
citations
Unified Pre-Training with Pseudo Texts for Text-To-Image Person Re-Identification
ICCV 2023arXiv
0
citations
Uncertainty-guided Learning for Improving Image Manipulation Detection
ICCV 2023
0
citations
Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
ECCV 2020
0
citations
Action Quality Assessment with Temporal Parsing Transformer
ECCV 2022
0
citations
UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture
ECCV 2022
0
citations
Seeing Far in the Dark with Patterned Flash
ECCV 2022
0
citations
UFO: Unified Feature Optimization
ECCV 2022
0
citations
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
ECCV 2022
0
citations
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
ICCV 2023arXiv
0
citations
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
CVPR 2025
0
citations
Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation
ICCV 2025
0
citations
T2Bs: Text-to-Character Blendshapes via Video Generation
ICCV 2025
0
citations
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
ICCV 2025
0
citations
RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation
ICCV 2025
0
citations
Class Token as Proxy: Optimal Transport-assisted Proxy Learning for Weakly Supervised Semantic Segmentation
ICCV 2025
0
citations
Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization
AAAI 2025
0
citations
Federated Recommendation with Explicitly Encoding Item Bias
AAAI 2025
0
citations
3D Human Pose Perception from Egocentric Stereo Videos
CVPR 2024
0
citations
Towards Better Vision-Inspired Vision-Language Models
CVPR 2024
0
citations
EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams
CVPR 2024
0
citations
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning
CVPR 2024
0
citations
Exponential Spectral Pursuit: An Effective Initialization Method for Sparse Phase Retrieval
ICML 2024
0
citations
Mobile Attention: Mobile-Friendly Linear-Attention for Vision Transformers
ICML 2024
0
citations
MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data
ICML 2024
0
citations
Re-Identification Supervised Texture Generation
CVPR 2019
0
citations
Watch out! Motion is Blurring the Vision of Your Deep Neural Networks
NeurIPS 2020
0
citations
Group Contextual Encoding for 3D Point Clouds
NeurIPS 2020
0
citations
RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
NeurIPS 2022
0
citations
Geo-SIC: Learning Deformable Geometric Shapes in Deep Image Classifiers
NeurIPS 2022
0
citations
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
NeurIPS 2022
0
citations
A Unified Conditional Framework for Diffusion-based Image Restoration
NeurIPS 2023
0
citations
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
NeurIPS 2023
0
citations