Wei Lin

30
Papers
1,261
Total Citations

Papers (30)

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

NeurIPS 2025
1,227
citations

Semantic Convergence: Harmonizing Recommender Systems via Two-Stage Alignment and Behavioral Semantic Tokenization

AAAI 2025
14
citations

LiveXiv - A Multi-Modal live benchmark based on Arxiv papers content

ICLR 2025arXiv
11
citations

Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting

CVPR 2025
4
citations

How to Trade Off the Quantity and Capacity of Teacher Ensemble: Learning Categorical Distribution to Stochastically Employ a Teacher for Distillation

AAAI 2024
2
citations

Efficient Long Context Fine-tuning with Chunk Flow

ICML 2025
2
citations

Proximal Mapping Loss: Understanding Loss Functions in Crowd Counting & Localization

ICLR 2025
1
citations

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

CVPR 2024
0
citations

Towards Robust Learning to Optimize with Theoretical Guarantees

CVPR 2024
0
citations

From Fourier to Neural ODEs: Flow Matching for Modeling Complex Systems

ICML 2024
0
citations

FESSNC: Fast Exponentially Stable and Safe Neural Controller

ICML 2024
0
citations

A Statistical Theory of Regularization-Based Continual Learning

ICML 2024
0
citations

Switched Flow Matching: Eliminating Singularities via Switching ODEs

ICML 2024
0
citations

Learning From Synthetic Data for Crowd Counting in the Wild

CVPR 2019
0
citations

SwapText: Image Based Texts Transfer in Scenes

CVPR 2020arXiv
0
citations

Cross-View Cross-Scene Multi-View Crowd Counting

CVPR 2021
0
citations

Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting

CVPR 2023
0
citations

Video Test-Time Adaptation for Action Recognition

CVPR 2023arXiv
0
citations

ActMAD: Activation Matching To Align Distributions for Test-Time-Training

CVPR 2023arXiv
0
citations

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge

ICCV 2023arXiv
0
citations

MATE: Masked Autoencoders are Online 3D Test-Time Learners

ICCV 2023arXiv
0
citations

CycDA: Unsupervised Cycle Domain Adaptation to Learn from Image to Video

ECCV 2022
0
citations

PerLA: Perceptive 3D Language Assistant

CVPR 2025
0
citations

KOEnsAttack: Towards Efficient Data-Free Black-Box Adversarial Attacks via Knowledge-Orthogonalized Substitute Ensembles

ICCV 2025
0
citations

Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs

ICCV 2025
0
citations

Zero-Sum vs. Positive-Sum: Effects of Inter-team Competition Modes and Haptic Feedback on Team Flow in Multi-team VR

ISMAR 2025
0
citations

Adversarial-Inspired Backdoor Defense via Bridging Backdoor and Adversarial Attacks

AAAI 2025
0
citations

Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning

AAAI 2024
0
citations

Hypergraph Neural Architecture Search

AAAI 2024
0
citations

LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections

NeurIPS 2023
0
citations