Hao Luo

21
Papers
117
Total Citations

Papers (21)

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

ECCV 2024
86
citations

Reinforcement Learning Friendly Vision-Language Model for Minecraft

ECCV 2024
14
citations

Unified Multimodal Understanding via Byte-Pair Visual Encoding

ICCV 2025
7
citations

Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation

ICCV 2025
4
citations

PlayerOne: Egocentric World Simulator

NeurIPS 2025
3
citations

Making Old Film Great Again: Degradation-aware State Space Model for Old Film Restoration

CVPR 2025
3
citations

Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network

AAAI 2024
0
citations

DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

ICML 2024
0
citations

Accelerating Parallel Sampling of Diffusion Models

ICML 2024
0
citations

Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks

CVPR 2023arXiv
0
citations

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID

CVPR 2023arXiv
0
citations

TransReID: Transformer-Based Object Re-Identification

ICCV 2021arXiv
0
citations

Revisiting Vision Transformer from the View of Path Ensemble

ICCV 2023arXiv
0
citations

Unstructured Feature Decoupling for Vehicle Re-identification

ECCV 2022
0
citations

BVT-IMA: Binary Vision Transformer with Information-Modified Attention

AAAI 2024
0
citations

AnyI2V: Animating Any Conditional Image with Motion Control

ICCV 2025
0
citations

Preacher: Paper-to-Video Agentic System

ICCV 2025
0
citations

VideoOrion: Tokenizing Object Dynamics in Videos

ICCV 2025
0
citations

Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy

ICCV 2025
0
citations

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

AAAI 2024
0
citations

VTC-LFC: Vision Transformer Compression with Low-Frequency Components

NeurIPS 2022
0
citations