Furu Wei

28
Papers
1,649
Total Citations
1
Affiliations

Affiliations

Microsoft Research

Papers (28)

Grounding Multimodal Large Language Models to the World

ICLR 2024
1,032
citations

Generative Representational Instruction Tuning

ICLR 2025
212
citations

Adapting Large Language Models via Reading Comprehension

ICLR 2024
126
citations

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought

ICML 2025
115
citations

Learning to Rank in Generative Retrieval

AAAI 2024arXiv
67
citations

Preference Optimization for Reasoning with Pseudo Feedback

ICLR 2025
33
citations

Self-Boosting Large Language Models with Synthetic Preference Data

ICLR 2025
29
citations

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

ICLR 2025
27
citations

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

CVPR 2025arXiv
6
citations

Text Diffusion with Reinforced Conditioning

AAAI 2024arXiv
2
citations

Generic-to-Specific Distillation of Masked Autoencoders

CVPR 2023arXiv
0
citations

Rethinking DPO-style Diffusion Aligning Frameworks

ICCV 2025
0
citations

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

ICML 2024
0
citations

Swin Transformer V2: Scaling Up Capacity and Resolution

CVPR 2022arXiv
0
citations

Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks

CVPR 2023
0
citations

Non-Contrastive Learning Meets Language-Image Pre-Training

CVPR 2023arXiv
0
citations

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

ECCV 2020
0
citations

Unified Language Model Pre-training for Natural Language Understanding and Generation

NeurIPS 2019
0
citations

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

NeurIPS 2020
0
citations

BERT Loses Patience: Fast and Robust Inference with Early Exit

NeurIPS 2020
0
citations

VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

NeurIPS 2022
0
citations

On the Representation Collapse of Sparse Mixture of Experts

NeurIPS 2022
0
citations

TextDiffuser: Diffusion Models as Text Painters

NeurIPS 2023
0
citations

On the Pareto Front of Multilingual Neural Machine Translation

NeurIPS 2023
0
citations

Extensible Prompts for Language Models on Zero-shot Language Style Customization

NeurIPS 2023
0
citations

Optimizing Prompts for Text-to-Image Generation

NeurIPS 2023
0
citations

Language Is Not All You Need: Aligning Perception with Language Models

NeurIPS 2023
0
citations

Augmenting Language Models with Long-Term Memory

NeurIPS 2023
0
citations