Hao Dong

38
Papers
189
Total Citations

Papers (38)

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation

AAAI 2024arXiv
58
citations

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

CVPR 2025
43
citations

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence

CVPR 2024
29
citations

No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation

CVPR 2024
27
citations

ET-SEED: EFFICIENT TRAJECTORY-LEVEL SE(3) EQUIVARIANT DIFFUSION POLICY

ICLR 2025
15
citations

GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts

ICCV 2025
5
citations

Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware Optimization

ICLR 2025
4
citations

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation

CVPR 2025arXiv
4
citations

CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation

CVPR 2025
2
citations

Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding

ICCV 2025arXiv
2
citations

Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation From Image Sequence

CVPR 2023
0
citations

PartManip: Learning Cross-Category Generalizable Part Manipulation Policy From Point Cloud Observations

CVPR 2023arXiv
0
citations

Semantic Image Synthesis via Adversarial Learning

ICCV 2017arXiv
0
citations

Contrastive Multimodal Fusion With TupleInfoNCE

ICCV 2021arXiv
0
citations

Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly

ICCV 2023arXiv
0
citations

Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation

ICCV 2023arXiv
0
citations

Unpaired Image-to-Image Translation using Adversarial Consistency Loss

ECCV 2020
0
citations

AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions

ECCV 2022
0
citations

Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects

ECCV 2022
0
citations

Unseen Visual Anomaly Generation

CVPR 2025
0
citations

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

CVPR 2025
0
citations

GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation

CVPR 2025
0
citations

DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection

CVPR 2025
0
citations

GFPack++: Attention-Driven Gradient Fields for Optimizing 2D Irregular Packing

ICCV 2025
0
citations

CADGrasp: Learning Contact and Collision Aware General Dexterous Grasping in Cluttered Scenes

NeurIPS 2025
0
citations

An Automatic Sound and Complete Abstraction Method for Generalized Planning with Baggable Types

AAAI 2025
0
citations

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers

AAAI 2024
0
citations

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

CVPR 2024
0
citations

GFPose: Learning 3D Human Pose Prior With Gradient Fields

CVPR 2023arXiv
0
citations

Generative 3D Part Assembly via Dynamic Graph Learning

NeurIPS 2020
0
citations

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

NeurIPS 2022
0
citations

TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification

NeurIPS 2022
0
citations

Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects

NeurIPS 2023
0
citations

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation

NeurIPS 2023
0
citations

Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping

NeurIPS 2023
0
citations

Generative Category-level Object Pose Estimation via Diffusion Models

NeurIPS 2023
0
citations

Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions

NeurIPS 2023
0
citations

SimMMDG: A Simple and Effective Framework for Multi-modal Domain Generalization

NeurIPS 2023
0
citations