Huchuan Lu

30
Papers
342
Total Citations

Papers (30)

Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification

CVPR 2024
49
citations

TOP-ReID: Multi-Spectral Object Re-identification with Token Permutation

AAAI 2024arXiv
45
citations

Multi-view Aggregation Network for Dichotomous Image Segmentation

CVPR 2024
38
citations

SUTrack: Towards Simple and Unified Single Object Tracking

AAAI 2025
37
citations

Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM

CVPR 2024
32
citations

UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory

CVPR 2024
27
citations

The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

CVPR 2025
19
citations

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

ICCV 2025arXiv
18
citations

VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior

ICCV 2025arXiv
17
citations

Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking

AAAI 2025
15
citations

EvSign: Sign Language Recognition and Translation with Streaming Events

ECCV 2024
13
citations

CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification

AAAI 2025
12
citations

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity

ICLR 2025arXiv
5
citations

ReNeg: Learning Negative Embedding with Reward Guidance

CVPR 2025
5
citations

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

AAAI 2025
4
citations

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion

CVPR 2025arXiv
3
citations

CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting

ICCV 2025
2
citations

Efficient Motion Prompt Learning for Robust Visual Tracking

ICML 2025
1
citations

DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation

AAAI 2024
0
citations

Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking

AAAI 2024
0
citations

CAT: A Unified Click-and-Track Framework for Realistic Tracking

ICCV 2025
0
citations

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

CVPR 2024
0
citations

Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and Baseline

CVPR 2024
0
citations

IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification

CVPR 2025
0
citations

DefMamba: Deformable Visual State Space Model

CVPR 2025
0
citations

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception

CVPR 2024
0
citations

MambaPro: Multi-Modal Object Re-identification with Mamba Aggregation and Synergistic Prompt

AAAI 2025
0
citations

Spider: A Unified Framework for Context-dependent Concept Segmentation

ICML 2024
0
citations

FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning

NeurIPS 2025
0
citations

Large Occluded Human Image Completion via Image-Prior Cooperating

AAAI 2024
0
citations