Yaowei Wang

49
Papers
247
Total Citations

Papers (49)

Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance

ECCV 2024arXiv
92
citations

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition

CVPR 2024
75
citations

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors

AAAI 2024arXiv
64
citations

Spatial Understanding from Videos: Structured Prompts Meet Simulation Data

NeurIPS 2025
7
citations

Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning

ICCV 2025
4
citations

Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning

CVPR 2025
2
citations

Sound Bridge: Associating Egocentric and Exocentric Videos via Audio Cues

CVPR 2025
2
citations

Video Language Model Pretraining with Spatio-temporal Masking

CVPR 2025
1
citations

RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model

ICCV 2025
0
citations

Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression

ICCV 2025
0
citations

Unsupervised Degradation Representation Aware Transform for Real-World Blind Image Super-Resolution

AAAI 2025
0
citations

Pilot: Building the Federated Multimodal Instruction Tuning Framework

AAAI 2025
0
citations

Mixed-Effects Contextual Bandits

AAAI 2024
0
citations

RTracker: Recoverable Tracking via PN Tree Structured Memory

CVPR 2024
0
citations

Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization

CVPR 2024
0
citations

Regressor-Segmenter Mutual Prompt Learning for Crowd Counting

CVPR 2024
0
citations

Modality-Collaborative Test-Time Adaptation for Action Recognition

CVPR 2024
0
citations

Multi-Factor Adaptive Vision Selection for Egocentric Video Question Answering

ICML 2024
0
citations

Unsupervised Cross-Dataset Transfer Learning for Person Re-Identification

CVPR 2016
0
citations

Contrastive Neural Architecture Search With Neural Architecture Comparators

CVPR 2021arXiv
0
citations

Learning Scalable lY=-Constrained Near-Lossless Image Compression via Joint Lossy Image and Residual Compression

CVPR 2021
0
citations

Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark

CVPR 2021arXiv
0
citations

Fine-Grained Object Classification via Self-Supervised Pose Alignment

CVPR 2022arXiv
0
citations

Boosting Crowd Counting via Multifaceted Attention

CVPR 2022arXiv
0
citations

M5Product: Self-Harmonized Contrastive Learning for E-Commercial Multi-Modal Pretraining

CVPR 2022
0
citations

Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples

CVPR 2023arXiv
0
citations

Integrally Pre-Trained Transformer Pyramid Networks

CVPR 2023arXiv
0
citations

KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation

CVPR 2023arXiv
0
citations

AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection

CVPR 2023
0
citations

Exploiting Multi-Grain Ranking Constraints for Precisely Searching Visually-Similar Vehicles

ICCV 2017
0
citations

Learning Long-Term Dependencies for Action Recognition With a Biologically-Inspired Deep Network

ICCV 2017arXiv
0
citations

Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning

ICCV 2019
0
citations

Conformer: Local Features Coupling Global Representations for Visual Recognition

ICCV 2021arXiv
0
citations

Strip-MLP: Efficient Token Interaction for Vision MLP

ICCV 2023
0
citations

CiteTracker: Correlating Image and Text for Visual Tracking

ICCV 2023arXiv
0
citations

Large Batch Optimization for Object Detection: Training COCO in 12 Minutes

ECCV 2020
0
citations

An Asymmetric Modeling for Action Assessment

ECCV 2020
0
citations

Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance

ECCV 2022
0
citations

DAS: Densely-Anchored Sampling for Deep Metric Learning

ECCV 2022
0
citations

CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection

CVPR 2023
0
citations

AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing

CVPR 2025
0
citations

NN-Former: Rethinking Graph Structure in Neural Architecture Representation

CVPR 2025
0
citations

Building Vision Models upon Heat Conduction

CVPR 2025
0
citations

DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

CVPR 2025
0
citations

Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval

CVPR 2025arXiv
0
citations

Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios

ICCV 2025
0
citations

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

ICCV 2025
0
citations

Learning to Share in Networked Multi-Agent Reinforcement Learning

NeurIPS 2022
0
citations

Learning Mask-aware CLIP Representations for Zero-Shot Segmentation

NeurIPS 2023
0
citations