Houqiang Li

83
Papers
153
Total Citations

Papers (83)

EG4D: Explicit Generation of 4D Object without Score Distillation

ICLR 2025arXiv
39
citations

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

AAAI 2025
37
citations

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

AAAI 2025
22
citations

DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

CVPR 2025
19
citations

SmartEraser: Remove Anything from Images using Masked-Region Guidance

CVPR 2025
12
citations

Long-term Temporal Context Gathering for Neural Video Compression

ECCV 2024
12
citations

RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion

CVPR 2025
11
citations

Revisiting Open-Set Panoptic Segmentation

AAAI 2024
1
citations

KGDM: A Diffusion Model to Capture Multiple Relation Semantics for Knowledge Graph Embedding

AAAI 2024
0
citations

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

CVPR 2024
0
citations

Generative Latent Coding for Ultra-Low Bitrate Image Compression

CVPR 2024
0
citations

Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation

CVPR 2024
0
citations

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

ICML 2024
0
citations

From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning

ICML 2024
0
citations

Semi-Supervised Domain Adaptation With Subspace Learning for Visual Recognition

CVPR 2015
0
citations

SOM: Semantic Obviousness Metric for Image Quality Assessment

CVPR 2015
0
citations

Comparative Deep Learning of Hybrid Representations for Image Recommendations

CVPR 2016
0
citations

Jointly Modeling Embedding and Translation to Bridge Video and Language

CVPR 2016
0
citations

Video Captioning With Transferred Semantic Attributes

CVPR 2017arXiv
0
citations

Feature Selective Networks for Object Detection

CVPR 2018arXiv
0
citations

Multi-Cue Correlation Filters for Robust Visual Tracking

CVPR 2018
0
citations

Towards Open-Set Identity Preserving Face Synthesis

CVPR 2018arXiv
0
citations

Unsupervised Deep Tracking

CVPR 2019
0
citations

Iterative Alignment Network for Continuous Sign Language Recognition

CVPR 2019
0
citations

Quantization Networks

CVPR 2019
0
citations

M-LVC: Multiple Frames Prediction for Learned Video Compression

CVPR 2020
0
citations

Transformation GAN for Unsupervised Image Synthesis and Representation Learning

CVPR 2020
0
citations

Improving Sign Language Translation With Monolingual Data by Sign Back-Translation

CVPR 2021arXiv
0
citations

Representing Videos As Discriminative Sub-Graphs for Action Recognition

CVPR 2021
0
citations

Unsupervised Pre-Training for Person Re-Identification

CVPR 2021arXiv
0
citations

Model-Aware Gesture-to-Gesture Translation

CVPR 2021
0
citations

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation

CVPR 2021
0
citations

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

CVPR 2021arXiv
0
citations

Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE

CVPR 2021arXiv
0
citations

Revisiting Knowledge Distillation: An Inheritance and Exploration Framework

CVPR 2021
0
citations

Uformer: A General U-Shaped Transformer for Image Restoration

CVPR 2022
0
citations

Contextual Similarity Distillation for Asymmetric Image Retrieval

CVPR 2022
0
citations

Large-Scale Pre-Training for Person Re-Identification With Noisy Labels

CVPR 2022arXiv
0
citations

Domain-Agnostic Prior for Transfer Semantic Segmentation

CVPR 2022arXiv
0
citations

Asymmetric Feature Fusion for Image Retrieval

CVPR 2023
0
citations

Human Pose As Compositional Tokens

CVPR 2023arXiv
0
citations

Stare at What You See: Masked Image Modeling Without Reconstruction

CVPR 2023arXiv
0
citations

AltFreezing for More General Video Face Forgery Detection

CVPR 2023
0
citations

HandNeRF: Neural Radiance Fields for Animatable Interacting Hands

CVPR 2023arXiv
0
citations

CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training

ICCV 2017
0
citations

Relation Distillation Networks for Video Object Detection

ICCV 2019
0
citations

Joint Inductive and Transductive Learning for Video Object Segmentation

ICCV 2021arXiv
0
citations

SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition

ICCV 2021arXiv
0
citations

Conditional DETR for Fast Training Convergence

ICCV 2021arXiv
0
citations

Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation

ICCV 2021arXiv
0
citations

3D Local Convolutional Neural Networks for Gait Recognition

ICCV 2021
0
citations

Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval

ICCV 2021
0
citations

TransVG: End-to-End Visual Grounding With Transformers

ICCV 2021arXiv
0
citations

Sign Language Translation with Iterative Prototype

ICCV 2023arXiv
0
citations

DIRE for Diffusion-Generated Image Detection

ICCV 2023arXiv
0
citations

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

ICCV 2023arXiv
0
citations

Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation

ICCV 2023arXiv
0
citations

Masked Motion Predictors are Strong 3D Action Representation Learners

ICCV 2023arXiv
0
citations

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

ICCV 2023arXiv
0
citations

CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation

ECCV 2022
0
citations

TAPE: Task-Agnostic Prior Embedding for Image Restoration

ECCV 2022
0
citations

CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds

ECCV 2022
0
citations

MVP: Multimodality-Guided Visual Pre-training

ECCV 2022
0
citations

Geometric Representation Learning for Document Image Rectification

ECCV 2022
0
citations

Motion Information Propagation for Neural Video Compression

CVPR 2023
0
citations

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

CVPR 2025arXiv
0
citations

Towards Practical Real-Time Neural Video Compression

CVPR 2025
0
citations

OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation

CVPR 2025
0
citations

Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments

ICCV 2025
0
citations

S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction

ICCV 2025
0
citations

Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling

NeurIPS 2025
0
citations

SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning

AAAI 2024
0
citations

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

NeurIPS 2020
0
citations

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

NeurIPS 2021
0
citations

Contextual Similarity Aggregation with Self-attention for Visual Re-ranking

NeurIPS 2021
0
citations

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training

NeurIPS 2021
0
citations

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

NeurIPS 2022
0
citations

Hand-Object Interaction Image Generation

NeurIPS 2022
0
citations

Multi-Agent First Order Constrained Optimization in Policy Space

NeurIPS 2023
0
citations

CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection

NeurIPS 2023
0
citations

Hierarchical Multi-Agent Skill Discovery

NeurIPS 2023
0
citations

State Sequences Prediction via Fourier Transform for Representation Learning

NeurIPS 2023
0
citations

DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

NeurIPS 2023
0
citations