Luo

33
Papers
380
Total Citations

Papers (33)

stagNet: An Attentive Semantic RNN for Group Activity Recognition

ECCV 2018
152
citations

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

CVPR 2025
68
citations

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

ICLR 2025arXiv
51
citations

Unlocking Multimodal Mathematical Reasoning via Process Reward Model

NeurIPS 2025arXiv
29
citations

Multi-Agent Collaboration via Evolving Orchestration

NeurIPS 2025arXiv
25
citations

FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

NeurIPS 2025arXiv
20
citations

Uncertainty-aware sign language video retrieval with probability distribution modeling

ECCV 2024arXiv
10
citations

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

ECCV 2024arXiv
6
citations

Simultaneous Swap Regret Minimization via KL-Calibration

NeurIPS 2025arXiv
6
citations

FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression

CVPR 2025
4
citations

Attention! Your Vision Language Model Could Be Maliciously Manipulated

NeurIPS 2025arXiv
3
citations

WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation

CVPR 2025
3
citations

Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits

ICLR 2025arXiv
2
citations

When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach

ICLR 2025arXiv
1
citations

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

ECCV 2018
0
citations

Bi-Real Net: Enhancing the Performance of 1-bit CNNs with Improved Representational Capability and Advanced Training Algorithm

ECCV 2018
0
citations

Video Re-localization

ECCV 2018
0
citations

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

ECCV 2018
0
citations

Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net

ECCV 2018
0
citations

Macro-Micro Adversarial Network for Human Parsing

ECCV 2018
0
citations

DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension

CVPR 2025
0
citations

Geometric Algorithms for Neural Combinatorial Optimization with Constraints

NeurIPS 2025arXiv
0
citations

Don’t Forget the Enjoin: FocalLoRA for Instruction Hierarchical Alignment in Large Language Models

NeurIPS 2025
0
citations

VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions

ECCV 2018
0
citations

MVSNet: Depth Inference for Unstructured Multi-view Stereo

ECCV 2018
0
citations

StarMap for Category-Agnostic Keypoint and Viewpoint Estimation

ECCV 2018
0
citations

Acquisition of Localization Confidence for Accurate Object Detection

ECCV 2018
0
citations

``Factual'' or ``Emotional'': Stylized Image Captioning with Adaptive Learning and Attention

ECCV 2018
0
citations

Graph Distillation for Action Detection with Privileged Modalities

ECCV 2018
0
citations

DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency

ECCV 2018
0
citations

Learning to Navigate for Fine-grained Classification

ECCV 2018
0
citations

Deep Volumetric Video From Very Sparse Multi-View Performance Capture

ECCV 2018
0
citations

Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency

ECCV 2018
0
citations