Jue Wang

59
Papers
30
Total Citations

Papers (59)

FoldToken: Learning Protein Language via Vector Quantization and Beyond

AAAI 2025
20
citations

Text-Guided Video Masked Autoencoder

ECCV 2024
7
citations

FloE: On-the-Fly MoE Inference on Memory-constrained GPU

ICML 2025
3
citations

Coherent Parametric Contours for Interactive Video Object Segmentation

CVPR 2016
0
citations

Automatic Fence Segmentation in Videos of Dynamic Scenes

CVPR 2016
0
citations

Deep Video Deblurring for Hand-Held Cameras

CVPR 2017
0
citations

Video Representation Learning Using Discriminative Pooling

CVPR 2018arXiv
0
citations

DocUNet: Document Image Unwarping via a Stacked U-Net

CVPR 2018
0
citations

Scale-Recurrent Network for Deep Image Deblurring

CVPR 2018arXiv
0
citations

GIF2Video: Color Dequantization and Temporal Interpolation of GIF Images

CVPR 2019
0
citations

GeoNet: Deep Geodesic Networks for Point Cloud Analysis

CVPR 2019
0
citations

Audio Visual Scene-Aware Dialog

CVPR 2019
0
citations

UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning

CVPR 2021arXiv
0
citations

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs

CVPR 2021arXiv
0
citations

Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging

CVPR 2022arXiv
0
citations

Long-Short Temporal Contrastive Learning of Video Transformers

CVPR 2022arXiv
0
citations

FENeRF: Face Editing in Neural Radiance Fields

CVPR 2022arXiv
0
citations

Deformable Video Transformer

CVPR 2022arXiv
0
citations

Hallucinated Neural Radiance Fields in the Wild

CVPR 2022arXiv
0
citations

Deblur-NeRF: Neural Radiance Fields From Blurry Images

CVPR 2022
0
citations

Multi-Robot Active Mapping via Neural Bipartite Graph Matching

CVPR 2022arXiv
0
citations

LAS-AT: Adversarial Training With Learnable Attack Strategy

CVPR 2022
0
citations

Unsupervised Pre-Training for Temporal Action Localization Tasks

CVPR 2022arXiv
0
citations

Exploring Denoised Cross-Video Contrast for Weakly-Supervised Temporal Action Localization

CVPR 2022
0
citations

High-Fidelity GAN Inversion for Image Attribute Editing

CVPR 2022arXiv
0
citations

Self-Supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

CVPR 2022arXiv
0
citations

Patch-Based 3D Natural Scene Generation From a Single Example

CVPR 2023arXiv
0
citations

CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior

CVPR 2023arXiv
0
citations

Fine-Grained Face Swapping via Regional GAN Inversion

CVPR 2023arXiv
0
citations

ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction

CVPR 2023arXiv
0
citations

Learning Anchor Transformations for 3D Garment Animation

CVPR 2023arXiv
0
citations

Skinned Motion Retargeting With Residual Perception of Motion Semantics & Geometry

CVPR 2023arXiv
0
citations

UV Volumes for Real-Time Rendering of Editable Free-View Human Performance

CVPR 2023arXiv
0
citations

Selective Structured State-Spaces for Long-Form Video Understanding

CVPR 2023arXiv
0
citations

Zero-Order Reverse Filtering

ICCV 2017arXiv
0
citations

Detail-Revealing Deep Video Super-Resolution

ICCV 2017arXiv
0
citations

Semi-Supervised Skin Detection by Network With Mutual Guidance

ICCV 2019
0
citations

Not All Parts Are Created Equal: 3D Pose Estimation by Modeling Bi-Directional Dependencies of Body Parts

ICCV 2019
0
citations

GODS: Generalized One-Class Discriminative Subspaces for Anomaly Detection

ICCV 2019
0
citations

Disentangled Image Matting

ICCV 2019
0
citations

Motion-Guided Masking for Spatiotemporal Representation Learning

ICCV 2023arXiv
0
citations

Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter

ICCV 2025
0
citations

Practical Deep Raw Image Denoising on Mobile Devices

ECCV 2020
0
citations

Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards

ECCV 2020
0
citations

Prior-Guided Adversarial Initialization for Fast Adversarial Training

ECCV 2022
0
citations

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization

ECCV 2022
0
citations

Towards Accurate Active Camera Localization

ECCV 2022
0
citations

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN

ECCV 2022
0
citations

LocVTP: Video-Text Pre-training for Temporal Localization

ECCV 2022
0
citations

Content-Aware Unsupervised Deep Homography Estimation

ECCV 2020
0
citations

Soft Prompt Recovers Compressed LLMs, Transferably

ICML 2024
0
citations

Blind Optical Aberration Correction by Exploring Geometric and Visual Priors

CVPR 2015
0
citations

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

NeurIPS 2021
0
citations

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

NeurIPS 2022
0
citations

Stability Analysis and Generalization Bounds of Adversarial Training

NeurIPS 2022
0
citations

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

NeurIPS 2022
0
citations

OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training

NeurIPS 2022
0
citations

One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations

NeurIPS 2022
0
citations

Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation

NeurIPS 2022
0
citations