Sen Wang

28
Papers
12
Total Citations

Papers (28)

General Scene Adaptation for Vision-and-Language Navigation

ICLR 2025
10
citations

Quantifying and Narrowing the Unknown: Interactive Text-to-Video Retrieval via Uncertainty Minimization

ICCV 2025
2
citations

FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation

CVPR 2025
0
citations

Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation

ICCV 2025
0
citations

From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning

ICCV 2025
0
citations

SAMPO: Scale-wise Autoregression with Motion Prompt for Generative World Models

NeurIPS 2025
0
citations

DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation

NeurIPS 2025arXiv
0
citations

Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets

ICLR 2025
0
citations

MoMask: Generative Masked Modeling of 3D Human Motions

CVPR 2024
0
citations

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

CVPR 2017arXiv
0
citations

Detailed Human Shape Estimation From a Single Image by Hierarchical Mesh Deformation

CVPR 2019
0
citations

ZSTAD: Zero-Shot Temporal Activity Detection

CVPR 2020arXiv
0
citations

LiDAR-Aug: A General Rendering-Based Augmentation Framework for 3D Object Detection

CVPR 2021
0
citations

Generating Diverse and Natural 3D Human Motions From Text

CVPR 2022
0
citations

DSVT: Dynamic Sparse Voxel Transformer With Rotated Sets

CVPR 2023arXiv
0
citations

Interactive Visual Hull Refinement for Specular and Transparent Object Surface Reconstruction

ICCV 2015
0
citations

Detailed Surface Geometry and Albedo Recovery From RGB-D Video Under Natural Illumination

ICCV 2017arXiv
0
citations

TextPlace: Visual Place Recognition and Topological Localization Through Reading Scene Texts

ICCV 2019
0
citations

Semantics Disentangling for Generalized Zero-Shot Learning

ICCV 2021arXiv
0
citations

EventHPE: Event-Based 3D Human Pose and Shape Estimation

ICCV 2021arXiv
0
citations

3D Human Shape Reconstruction from a Polarization Image

ECCV 2020
0
citations

Object Wake-Up: 3D Object Rigging from a Single Image

ECCV 2022
0
citations

TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts

ECCV 2022
0
citations

PDFactor: Learning Tri-Perspective View Policy Diffusion Field for Multi-Task Robotic Manipulation

CVPR 2025
0
citations

M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings

CVPR 2025
0
citations

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

NeurIPS 2019
0
citations

Improved Feature Distillation via Projector Ensemble

NeurIPS 2022
0
citations

RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation

NeurIPS 2023
0
citations