Luo

24
Papers
697
Total Citations

Papers (24)

MobileNetV4: Universal Models for the Mobile Ecosystem

ECCV 2024arXiv
407
citations

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

CVPR 2025
68
citations

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

ICLR 2025arXiv
51
citations

Preserving Diversity in Supervised Fine-Tuning of Large Language Models

ICLR 2025arXiv
33
citations

Unlocking Multimodal Mathematical Reasoning via Process Reward Model

NeurIPS 2025arXiv
29
citations

Multi-Agent Collaboration via Evolving Orchestration

NeurIPS 2025arXiv
25
citations

FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

NeurIPS 2025arXiv
20
citations

Uncertainty-aware sign language video retrieval with probability distribution modeling

ECCV 2024arXiv
10
citations

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

ECCV 2024arXiv
10
citations

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

ICLR 2025arXiv
7
citations

Latent Chain-of-Thought for Visual Reasoning

NeurIPS 2025arXiv
7
citations

Simultaneous Swap Regret Minimization via KL-Calibration

NeurIPS 2025arXiv
6
citations

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

ECCV 2024arXiv
6
citations

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception

NeurIPS 2025arXiv
5
citations

FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression

CVPR 2025
4
citations

Attention! Your Vision Language Model Could Be Maliciously Manipulated

NeurIPS 2025arXiv
3
citations

WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation

CVPR 2025
3
citations

Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits

ICLR 2025arXiv
2
citations

When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach

ICLR 2025arXiv
1
citations

DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension

CVPR 2025
0
citations

Geometric Algorithms for Neural Combinatorial Optimization with Constraints

NeurIPS 2025arXiv
0
citations

CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving

NeurIPS 2025arXiv
0
citations

Don’t Forget the Enjoin: FocalLoRA for Instruction Hierarchical Alignment in Large Language Models

NeurIPS 2025
0
citations

DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering

NeurIPS 2025arXiv
0
citations