Jie Song

58
Papers
214
Total Citations

Papers (58)

SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion

CVPR 2024
69
citations

4D-DRESS: A 4D Dataset of Real-World Human Clothing With Semantic Annotations

CVPR 2024
43
citations

SpikePoint: An Efficient Point-based Spiking Neural Network for Event Cameras Action Recognition

ICLR 2024
32
citations

Training-Free Pretrained Model Merging

CVPR 2024
24
citations

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild

CVPR 2024
13
citations

PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation

CVPR 2025arXiv
10
citations

GauSTAR: Gaussian Surface Tracking and Reconstruction

CVPR 2025
7
citations

Dataset Ownership Verification in Contrastive Pre-trained Models

ICLR 2025
4
citations

MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips

ICCV 2025
3
citations

Holistic Semantic Representation for Navigational Trajectory Generation

AAAI 2025
3
citations

MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction

ICCV 2025
2
citations

D^2-DPM: Dual Denoising for Quantized Diffusion Probabilistic Models

AAAI 2025
1
citations

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

AAAI 2025
1
citations

Dataset Ownership Verification for Pre-trained Masked Models

ICCV 2025
1
citations

SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning

AAAI 2025
1
citations

Bootstrapping ViTs: Towards Liberating Vision Transformers From Pre-Training

CVPR 2022arXiv
0
citations

Meta-Attention for ViT-Backed Continual Learning

CVPR 2022arXiv
0
citations

D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions

CVPR 2022
0
citations

PINA: Learning a Personalized Implicit Neural Avatar From a Single RGB-D Video Sequence

CVPR 2022
0
citations

gDNA: Towards Generative Detailed Neural Avatars

CVPR 2022arXiv
0
citations

Learning Locally Editable Virtual Humans

CVPR 2023arXiv
0
citations

X-Avatar: Expressive Human Avatars

CVPR 2023
0
citations

InstantAvatar: Learning Avatars From Monocular Video in 60 Seconds

CVPR 2023arXiv
0
citations

Vid2Avatar: 3D Avatar Reconstruction From Videos in the Wild via Self-Supervised Scene Decomposition

CVPR 2023arXiv
0
citations

Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation

CVPR 2023arXiv
0
citations

Hi4D: 4D Instance Segmentation of Close Human Interaction

CVPR 2023arXiv
0
citations

Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation

ICCV 2019
0
citations

Monocular Neural Image Based Rendering With Continuous View Control

ICCV 2019
0
citations

ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos

CVPR 2025
0
citations

Self-Born Wiring for Neural Trees

ICCV 2021
0
citations

Shape-Aware Multi-Person Pose Estimation From Multi-View Images

ICCV 2021arXiv
0
citations

EM-POSE: 3D Human Pose Estimation From Sparse Electromagnetic Trackers

ICCV 2021
0
citations

ModelGiF: Gradient Fields for Model Functional Distance

ICCV 2023arXiv
0
citations

EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild

ICCV 2023arXiv
0
citations

Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks

ICCV 2023arXiv
0
citations

Human from Blur: Human Pose Tracking from Blurry Images

ICCV 2023arXiv
0
citations

Human Body Model Fitting by Learned Gradient Descent

ECCV 2020
0
citations

Category Level Object Pose Estimation via Neural Analysis-by-Synthesis

ECCV 2020
0
citations

Learning with Recoverable Forgetting

ECCV 2022
0
citations

Attention Diversification for Domain Generalization

ECCV 2022
0
citations

End-to-End Learning for Graph Decomposition

ICCV 2019
0
citations

Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?

CVPR 2025
0
citations

Capturing head avatar with hand contacts from a monocular video

ICCV 2025
0
citations

Boosting MLLM Reasoning with Text-Debiased Hint-GRPO

ICCV 2025
0
citations

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

AAAI 2025
0
citations

Association Pattern-enhanced Molecular Representation Learning

AAAI 2025
0
citations

Cooperative Policy Agreement: Learning Diverse Policy for Offline MARL

AAAI 2025
0
citations

Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos

CVPR 2017arXiv
0
citations

Cross-Modal Deep Variational Hand Pose Estimation

CVPR 2018arXiv
0
citations

Transductive Unbiased Embedding for Zero-Shot Learning

CVPR 2018arXiv
0
citations

DEPARA: Deep Attribution Graph for Deep Knowledge Transferability

CVPR 2020arXiv
0
citations

Tree-Like Decision Distillation

CVPR 2021
0
citations

Training Generative Adversarial Networks in One Stage

CVPR 2021arXiv
0
citations

Label Matching Semi-Supervised Object Detection

CVPR 2022
0
citations

Slimmable Domain Adaptation

CVPR 2022
0
citations

Deep Model Transferability from Attribution Maps

NeurIPS 2019
0
citations

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

NeurIPS 2021
0
citations

Lookaround Optimizer: $k$ steps around, 1 step average

NeurIPS 2023
0
citations