Tao Yu

51
Papers
572
Total Citations

Papers (51)

Generative Representational Instruction Tuning

ICLR 2025
212
citations

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

ICML 2025
165
citations

Fluctuation-Based Adaptive Structured Pruning for Large Language Models

AAAI 2024arXiv
96
citations

OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning

CVPR 2024
66
citations

PSHuman: Photorealistic Single-image 3D Human Reconstruction using Cross-Scale Multiview Diffusion and Explicit Remeshing

CVPR 2025
12
citations

GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration

CVPR 2025
10
citations

MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond

CVPR 2025
5
citations

Shadow Cones: A Generalized Framework for Partial Order Embeddings

ICLR 2024
3
citations

ImViD: Immersive Volumetric Videos for Enhanced VR Engagement

CVPR 2025
2
citations

View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection

AAAI 2025
1
citations

Robust 3D Self-Portraits in Seconds

CVPR 2020arXiv
0
citations

4D Association Graph for Realtime Multi-Person Motion Capture Using Multiple Video Cameras

CVPR 2020arXiv
0
citations

Deep Implicit Templates for 3D Shape Representation

CVPR 2021arXiv
0
citations

POSEFusion: Pose-Guided Selective Fusion for Single-View Human Volumetric Capture

CVPR 2021arXiv
0
citations

Function4D: Real-Time Human Volumetric Capture From Very Sparse Consumer RGBD Sensors

CVPR 2021arXiv
0
citations

DoubleField: Bridging the Neural Surface and Radiance Fields for High-Fidelity Human Reconstruction and Rendering

CVPR 2022arXiv
0
citations

FaceVerse: A Fine-Grained and Detail-Controllable 3D Face Morphable Model From a Hybrid Dataset

CVPR 2022arXiv
0
citations

Interacting Attention Graph for Single Image Two-Hand Reconstruction

CVPR 2022arXiv
0
citations

Structured Local Radiance Fields for Human Avatar Modeling

CVPR 2022arXiv
0
citations

Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting

CVPR 2023arXiv
0
citations

ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection

CVPR 2023arXiv
0
citations

Task Residual for Tuning Vision-Language Models

CVPR 2023arXiv
0
citations

DeepHuman: 3D Human Reconstruction From a Single Image

ICCV 2019
0
citations

DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras

ICCV 2021arXiv
0
citations

Lightweight Multi-Person Total Motion Capture Using Sparse Multi-View Cameras

ICCV 2021arXiv
0
citations

PARF: Primitive-Aware Radiance Fusion for Indoor Scene Novel View Synthesis

ICCV 2023
0
citations

RobustFusion: Human Volumetric Capture with Data-driven Visual Cues using a RGBD Camera

ECCV 2020
0
citations

NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image

ECCV 2020
0
citations

Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration

ECCV 2020
0
citations

HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling

ECCV 2022
0
citations

GIMO: Gaze-Informed Human Motion Prediction in Context

ECCV 2022
0
citations

Geometry-Aware Single-Image Full-Body Human Relighting

ECCV 2022
0
citations

BodyFusion: Real-Time Capture of Human Motion and Surface Geometry Using a Single Depth Camera

ICCV 2017
0
citations

V2V3D: View-to-View Denoised 3D Reconstruction for Light Field Microscopy

CVPR 2025
0
citations

Neural Fluid Simulation on Geometric Surfaces

ICLR 2025
0
citations

Neural Physical Simulation with Multi-Resolution Hash Grid Encoding

AAAI 2024
0
citations

DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation

CVPR 2024
0
citations

MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors

CVPR 2024
0
citations

HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models

CVPR 2024
0
citations

Collage: Light-Weight Low-Precision Strategy for LLM Training

ICML 2024
0
citations

DoubleFusion: Real-Time Capture of Human Performances With Inner Body Shapes From a Single Depth Sensor

CVPR 2018arXiv
0
citations

SimulCap : Single-View Human Performance Capture With Cloth Simulation

CVPR 2019
0
citations

Numerically Accurate Hyperbolic Embeddings Using Tiling-Based Models

NeurIPS 2019
0
citations

A New Defense Against Adversarial Images: Turning a Weakness into a Strength

NeurIPS 2019
0
citations

PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning

NeurIPS 2021
0
citations

Representing Hyperbolic Space Accurately using Multi-Component Floats

NeurIPS 2021
0
citations

Understanding Hyperdimensional Computing for Parallel Single-Pass Learning

NeurIPS 2022
0
citations

Mask-based Latent Reconstruction for Reinforcement Learning

NeurIPS 2022
0
citations

Triangulation Residual Loss for Data-efficient 3D Pose Estimation

NeurIPS 2023
0
citations

Coneheads: Hierarchy Aware Attention

NeurIPS 2023
0
citations

Simplifying Graph Convolutional Networks

ICML 2019
0
citations