Furu Wei

28

Papers

1,649

Total Citations

1

Affiliations

Affiliations

Microsoft Research

Papers (28)

Grounding Multimodal Large Language Models to the World

Generative Representational Instruction Tuning

Adapting Large Language Models via Reading Comprehension

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought

Learning to Rank in Generative Retrieval

Preference Optimization for Reasoning with Pseudo Feedback

Self-Boosting Large Language Models with Synthetic Preference Data

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

Text Diffusion with Reinforced Conditioning

Generic-to-Specific Distillation of Masked Autoencoders

Rethinking DPO-style Diffusion Aligning Frameworks

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

Swin Transformer V2: Scaling Up Capacity and Resolution

Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks

Non-Contrastive Learning Meets Language-Image Pre-Training

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

Unified Language Model Pre-training for Natural Language Understanding and Generation

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

BERT Loses Patience: Fast and Robust Inference with Early Exit

VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts

On the Representation Collapse of Sparse Mixture of Experts

TextDiffuser: Diffusion Models as Text Painters

On the Pareto Front of Multilingual Neural Machine Translation

Extensible Prompts for Language Models on Zero-shot Language Style Customization

Optimizing Prompts for Text-to-Image Generation

Language Is Not All You Need: Aligning Perception with Language Models

Augmenting Language Models with Long-Term Memory