Luo
24
Papers
697
Total Citations
Papers (24)
MobileNetV4: Universal Models for the Mobile Ecosystem
ECCV 2024arXiv
407
citations
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
CVPR 2025
68
citations
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
ICLR 2025arXiv
51
citations
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
ICLR 2025arXiv
33
citations
Unlocking Multimodal Mathematical Reasoning via Process Reward Model
NeurIPS 2025arXiv
29
citations
Multi-Agent Collaboration via Evolving Orchestration
NeurIPS 2025arXiv
25
citations
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
NeurIPS 2025arXiv
20
citations
Uncertainty-aware sign language video retrieval with probability distribution modeling
ECCV 2024arXiv
10
citations
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models
ECCV 2024arXiv
10
citations
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
ICLR 2025arXiv
7
citations
Latent Chain-of-Thought for Visual Reasoning
NeurIPS 2025arXiv
7
citations
Simultaneous Swap Regret Minimization via KL-Calibration
NeurIPS 2025arXiv
6
citations
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts
ECCV 2024arXiv
6
citations
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
NeurIPS 2025arXiv
5
citations
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
CVPR 2025
4
citations
Attention! Your Vision Language Model Could Be Maliciously Manipulated
NeurIPS 2025arXiv
3
citations
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
CVPR 2025
3
citations
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
ICLR 2025arXiv
2
citations
When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach
ICLR 2025arXiv
1
citations
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension
CVPR 2025
0
citations
Geometric Algorithms for Neural Combinatorial Optimization with Constraints
NeurIPS 2025arXiv
0
citations
CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving
NeurIPS 2025arXiv
0
citations
Don’t Forget the Enjoin: FocalLoRA for Instruction Hierarchical Alignment in Large Language Models
NeurIPS 2025
0
citations
DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering
NeurIPS 2025arXiv
0
citations