Fei Huang
24
Papers
1,118
Total Citations
Papers (24)
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
CVPR 2024
601
citations
Preference Ranking Optimization for Human Alignment
AAAI 2024arXiv
334
citations
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
CVPR 2024
116
citations
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
ICLR 2025
53
citations
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
CVPR 2025
7
citations
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
CVPR 2025
4
citations
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
ECCV 2024
2
citations
CateKV: On Sequential Consistency for Long-Context LLM Inference Acceleration
ICML 2025
1
citations
BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-Up Patch Summarization.
ICCV 2023arXiv
0
citations
Training-Free Long-Context Scaling of Large Language Models
ICML 2024
0
citations
EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce
AAAI 2024arXiv
0
citations
SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding
AAAI 2024arXiv
0
citations
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
CVPR 2024
0
citations
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
ICML 2024
0
citations
Unsupervised Multi-Modal Neural Machine Translation
CVPR 2019
0
citations
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training
ICCV 2023arXiv
0
citations
Learning Trajectory-Word Alignments for Video-Language Tasks
ICCV 2023arXiv
0
citations
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning
NeurIPS 2022
0
citations
EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
NeurIPS 2023
0
citations
RRHF: Rank Responses to Align Language Models with Human Feedback
NeurIPS 2023
0
citations
Debiased and Denoised Entity Recognition from Distant Supervision
NeurIPS 2023
0
citations
SPA: A Graph Spectral Alignment Perspective for Domain Adaptation
NeurIPS 2023
0
citations
SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents
NeurIPS 2023
0
citations
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
NeurIPS 2023
0
citations