Fei Huang

24
Papers
1,118
Total Citations

Papers (24)

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

CVPR 2024
601
citations

Preference Ranking Optimization for Human Alignment

AAAI 2024arXiv
334
citations

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

CVPR 2024
116
citations

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

ICLR 2025
53
citations

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

CVPR 2025
7
citations

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

CVPR 2025
4
citations

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

ECCV 2024
2
citations

CateKV: On Sequential Consistency for Long-Context LLM Inference Acceleration

ICML 2025
1
citations

BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-Up Patch Summarization.

ICCV 2023arXiv
0
citations

Training-Free Long-Context Scaling of Large Language Models

ICML 2024
0
citations

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce

AAAI 2024arXiv
0
citations

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding

AAAI 2024arXiv
0
citations

OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition

CVPR 2024
0
citations

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

ICML 2024
0
citations

Unsupervised Multi-Modal Neural Machine Translation

CVPR 2019
0
citations

HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training

ICCV 2023arXiv
0
citations

Learning Trajectory-Word Alignments for Video-Language Tasks

ICCV 2023arXiv
0
citations

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

NeurIPS 2022
0
citations

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning

NeurIPS 2023
0
citations

RRHF: Rank Responses to Align Language Models with Human Feedback

NeurIPS 2023
0
citations

Debiased and Denoised Entity Recognition from Distant Supervision

NeurIPS 2023
0
citations

SPA: A Graph Spectral Alignment Perspective for Domain Adaptation

NeurIPS 2023
0
citations

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

NeurIPS 2023
0
citations

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs

NeurIPS 2023
0
citations