Hao Dong

38

Papers

189

Total Citations

Papers (38)

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence

No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation

ET-SEED: EFFICIENT TRAJECTORY-LEVEL SE(3) EQUIVARIANT DIFFUSION POLICY

GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts

Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware Optimization

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation

CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation

Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding

Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation From Image Sequence

PartManip: Learning Cross-Category Generalizable Part Manipulation Policy From Point Cloud Observations

Semantic Image Synthesis via Adversarial Learning

Contrastive Multimodal Fusion With TupleInfoNCE

Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly

Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation

Unpaired Image-to-Image Translation using Adversarial Consistency Loss

AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions

Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects

Unseen Visual Anomaly Generation

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation

DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection

GFPack++: Attention-Driven Gradient Fields for Optimizing 2D Irregular Packing

CADGrasp: Learning Contact and Collision Aware General Dexterous Grasping in Cluttered Scenes

An Automatic Sound and Complete Abstraction Method for Generalized Planning with Baggable Types

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

GFPose: Learning 3D Human Pose Prior With Gradient Fields

Generative 3D Part Assembly via Dynamic Graph Learning

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification

Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation

Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping

Generative Category-level Object Pose Estimation via Diffusion Models

Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions

SimMMDG: A Simple and Effective Framework for Multi-modal Domain Generalization