🧬Language Models

Language Modeling

Core language modeling techniques

100 papers15,401 total citations

Compare with other topics

Feb '24 — Jan '261715 papers

Top Conferences

ICLR: 50 CVPR: 22 AAAI: 14 ICML: 5 ECCV: 4 ICCV: 3

Top Papers

#1

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Zhe Chen, Jiannan Wu, Wenhai Wang et al.

LISA: Reasoning Segmentation via Large Language Model

Xin Lai, Zhuotao Tian, Yukang Chen et al.

VILA: On Pre-training for Visual Language Models

Ji Lin, Danny Yin, Wei Ping et al.

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

Qinghao Ye, Haiyang Xu, Jiabo Ye et al.

SALMONN: Towards Generic Hearing Abilities for Large Language Models

Changli Tang, Wenyi Yu, Guangzhi Sun et al.

YaRN: Efficient Context Window Extension of Large Language Models

Bowen Peng, Jeffrey Quesnelle, Honglu Fan et al.

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Qidong Huang, Xiaoyi Dong, Pan Zhang et al.

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Jihan Yang, Shusheng Yang, Anjali W. Gupta et al.

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

Guowei Xu, Peng Jin, ZiangWu ZiangWu et al.

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Guan Wang, Sijie Cheng, Xianyuan Zhan et al.

Safety Alignment Should be Made More Than Just a Few Tokens Deep

Xiangyu Qi, Ashwinee Panda, Kaifeng Lyu et al.

Mixture-of-Agents Enhances Large Language Model Capabilities

Junlin Wang, Jue Wang, Ben Athiwaratkun et al.

Large Language Models as Tool Makers

Tianle Cai, Xuezhi Wang, Tengyu Ma et al.

On Scaling Up a Multilingual Vision and Language Model

Xi Chen, Josip Djolonga, Piotr Padlewski et al.

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Samuel Marks, Can Rager, Eric Michaud et al.

SaProt: Protein Language Modeling with Structure-aware Vocabulary

Jin Su, Chenchen Han, Yuyang Zhou et al.

Language Models Represent Space and Time

Wes Gurnee, Max Tegmark

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Jiabo Ye, Haiyang Xu, Haowei Liu et al.

Listen, Think, and Understand

Yuan Gong, Hongyin Luo, Alexander Liu et al.

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Jingyi Zhang, Jiaxing Huang, Huanjin Yao et al.

Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-World Multi-Turn Dialogue

Songhua Yang, Hanjie Zhao, Senbin Zhu et al.

AAAI 2024arXiv:2308.03549

large language modelschinese medicineexpert feedbackmulti-turn dialogue+4

204

citations

#22

Think before you speak: Training Language Models With Pause Tokens

Sachin Goyal, Ziwei Ji, Ankit Singh Rawat et al.

Can Large Language Models Infer Causation from Correlation?

Zhijing Jin, Jiarui Liu, Zhiheng LYU et al.

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

Weijia Shi, Jaechan Lee, Yangsibo Huang et al.

ICLR 2025arXiv:2407.06460

machine unlearninglanguage modelsprivacy leakageverbatim memorization+4

157

citations

#25

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Guangxuan Xiao, Jiaming Tang, Jingwei Zuo et al.

Training Language Models to Reason Efficiently

Daman Arora, Andrea Zanette

Physics of Language Models: Part 3.2, Knowledge Manipulation

Zeyuan Allen-Zhu, Yuanzhi Li

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal

Tinghao Xie, Xiangyu Qi, Yi Zeng et al.

ICLR 2025arXiv:2406.14598

safety refusal evaluationlarge language modelsfine-grained taxonomieslinguistic augmentations+4

141

citations

#29

Linearity of Relation Decoding in Transformer Language Models

Evan Hernandez, Arnab Sen Sharma, Tal Haklay et al.

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Yuzhou Huang, Liangbin Xie, Xintao Wang et al.

Large Language Models as Analogical Reasoners

Michihiro Yasunaga, Xinyun Chen, Yujia Li et al.

GSVA: Generalized Segmentation via Multimodal Large Language Models

Zhuofan Xia, Dongchen Han, Yizeng Han et al.

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Shengpeng Ji, Ziyue Jiang, Wen Wang et al.

ICLR 2025arXiv:2408.16532

acoustic codec tokenizeraudio language modelingvector quantizationaudio reconstruction+3

125

citations

#34

Layer by Layer: Uncovering Hidden Representations in Language Models

Oscar Skean, Md Rifat Arefin, Dan Zhao et al.

ToolACE: Winning the Points of LLM Function Calling

Weiwen Liu, Xu Huang, Xingshan Zeng et al.

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Di Wu, Hongwei Wang, Wenhao Yu et al.

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Han Zhao, Min Zhang, Wei Zhao et al.

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Xiong Wang, Yangze Li, Chaoyou Fu et al.

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

Suhas Kotha, Jacob Springer, Aditi Raghunathan

Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control

Longtao Zheng, Rundong Wang, Xinrun Wang et al.

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Chuwei Luo, Yufan Shen, Zhaoqing Zhu et al.

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Tian Ye, Zicheng Xu, Yuanzhi Li et al.

OR-Bench: An Over-Refusal Benchmark for Large Language Models

Jiaxing Cui, Wei-Lin Chiang, Ion Stoica et al.

When Attention Sink Emerges in Language Models: An Empirical View

Xiangming Gu, Tianyu Pang, Chao Du et al.

ICLR 2025arXiv:2410.10781

attention sink phenomenonlanguage model pre-trainingsoftmax normalizationkey biases+4

90

citations

#45

Remarkable Robustness of LLMs: Stages of Inference?

Vedang Lad, Jin Hwa Lee, Wes Gurnee et al.

NeurIPS 2025arXiv:2406.19384

layer deletionadjacent layer swappingstructural interventionsinference stages+4

89

citations

#46

Training Socially Aligned Language Models on Simulated Social Interactions

Ruibo Liu, Ruixin Yang, Chenyan Jia et al.

Making Text Embedders Few-Shot Learners

Chaofan Li, Minghao Qin, Shitao Xiao et al.

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Weijia Shi, Sewon Min, Maria Lomeli et al.

A Benchmark for Learning to Translate a New Language from One Grammar Book

Garrett Tanzer, Mirac Suzgun, Eline Visser et al.

Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance

Zan Wang, Yixin Chen, Baoxiong Jia et al.

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Haiyang Liu, Zihao Zhu, Giorgio Becherini et al.

Towards Foundation Models for Knowledge Graph Reasoning

Mikhail Galkin, Xinyu Yuan, Hesham Mostafa et al.

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Javier Ferrando, Oscar Obeso, Senthooran Rajamanoharan et al.

ICLR 2025arXiv:2411.14257

sparse autoencodershallucination mechanismsentity recognitionknowledge awareness+3

77

citations

#54

TLControl: Trajectory and Language Control for Human Motion Synthesis

WEILIN WAN, Zhiyang Dou, Taku Komura et al.

ECCV 2024arXiv:2311.17135

human motion synthesistrajectory controllanguage controlvq-vae+4

77

citations

#55

GraphRouter: A Graph-based Router for LLM Selections

Tao Feng, Yanzhen Shen, Jiaxuan You

ICLR 2025arXiv:2410.03834

llm selectionheterogeneous graph constructioninductive graph frameworkedge prediction mechanism+3

77

citations

#56

Eliciting Human Preferences with Language Models

Belinda Li, Alex Tamkin, Noah Goodman et al.

Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data

Xinyi Wang, Antonis Antoniades, Yanai Elazar et al.

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

Yichi Zhang, Ziqiao Ma, Xiaofeng Gao et al.

Graph Neural Prompting with Large Language Models

Yijun Tian, Huan Song, Zichen Wang et al.

AAAI 2024arXiv:2309.15427

graph neural networksknowledge graphslarge language modelsretrieval-augmented generation+4

74

citations

#60

MMTEB: Massive Multilingual Text Embedding Benchmark

Kenneth Enevoldsen, Isaac Chung, Imene Kerboua et al.

ICLR 2025arXiv:2502.13595

text embedding evaluationmultilingual benchmarksinstruction following taskslong-document retrieval+4

74

citations

#61

LLaFS: When Large Language Models Meet Few-Shot Segmentation

Lanyun Zhu, Tianrun Chen, Deyi Ji et al.

Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models

Wenbin Wang, Liang Ding, Minyan Zeng et al.

Towards 3D Molecule-Text Interpretation in Language Models

Sihang Li, Zhiyuan Liu, Yanchen Luo et al.

OmniGlue: Generalizable Feature Matching with Foundation Model Guidance

Hanwen Jiang, Arjun Karpur, Bingyi Cao et al.

On the Learnability of Watermarks for Language Models

Chenchen Gu, XIANG LI, Percy Liang et al.

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Zhen Ye, Peiwen Sun, Jiahe Lei et al.

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

Shanshan Zhong, Zhongzhan Huang, Shanghua Gao et al.

ELLA-V: Stable Neural Codec Language Modeling with Alignment-Guided Sequence Reordering

Yakun Song, Zhuo Chen, Xiaofei Wang et al.

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Zhangyang Qi, Ye Fang, Zeyi Sun et al.

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?

Liqiang Jing, Zhehui Huang, Xiaoyang Wang et al.

ICLR 2025arXiv:2409.07703

data science agentslarge language modelslarge vision-language modelsdata analysis tasks+4

62

citations

#71

Driving Everywhere with Large Language Model Policy Adaptation

Boyi Li, Yue Wang, Jiageng Mao et al.

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons

Yuheng Chen, Pengfei Cao, Yubo Chen et al.

AAAI 2024arXiv:2308.13198

knowledge neuronsmultilingual language modelsfactual knowledge storageintegrated gradients method+4

59

citations

#73

AI Sandbagging: Language Models can Strategically Underperform on Evaluations

Teun van der Weij, Felix Hofstätter, Oliver Jaffe et al.

Towards Semantic Equivalence of Tokenization in Multimodal LLM

Shengqiong Wu, Hao Fei, Xiangtai Li et al.

SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation

Yi-Chia Chen, WeiHua Li, Cheng Sun et al.

Editing Language Model

Based Knowledge Graph Embeddings

AAAI 2024arXiv:2305.14908

language model editinghallucination mitigationprompt-based editingunsupervised training+3

57

citations

#77

Language Model Inversion

John X. Morris, Wenting Zhao, Justin Chiu et al.

BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks

Frederikke Marin, Felix Teufel, Marc Horlacher et al.

Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction

Inhwan Bae, Junoh Lee, Hae-Gon Jeon

LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time

Sensitive Test Construction - Yucheng Li, Frank Guerin, Chenghua Lin

AAAI 2024arXiv:2312.12343

data contaminationlanguage model evaluationreading comprehensiondynamic evaluation+4

53

citations

#81

Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models

Weihao Ye, Qiong Wu, Wenhao Lin et al.

Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model

Shraman Pramanick, Guangxing Han, Rui Hou et al.

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Tian Ye, Zicheng Xu, Yuanzhi Li et al.

SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples

Phillip Howard, Avinash Madasu, Tiep Le et al.

Calibrating Large Language Models with Sample Consistency

Qing Lyu, Kumar Shridhar, Chaitanya Malaviya et al.

CoMo: Controllable Motion Generation through Language Guided Pose Code Editing

Yiming Huang, WEILIN WAN, Yue Yang et al.

ECCV 2024arXiv:2403.13900

text-to-motion generationcontrollable motion editingdiscrete pose codeslarge language models+4

48

citations

#87

Language Model Can Listen While Speaking

Ziyang Ma, Yakun Song, Chenpeng Du et al.

What does the Knowledge Neuron Thesis Have to do with Knowledge?

Jingcheng Niu, Andrew Liu, Zining Zhu et al.

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Alexander Wettig, Kyle Lo, Sewon Min et al.

ALLaM: Large Language Models for Arabic and English

M Saiful Bari, Yazeed Alnumay, Norah Alzahrani et al.

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Yixuan Wu, Yizhou Wang, Shixiang Tang et al.

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"

Yifei Ming, Senthil Purushwalkam, Shrey Pandit et al.

ICLR 2025

faithfulness hallucinationretrieval-augmented generationcontextual evaluation benchmarkunanswerable context handling+3

45

citations

#93

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Andy (DiJia) Su, Hanlin Zhu, Yingchen Xu et al.

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Kai Chen, Yunhao Gou, Runhui Huang et al.

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

Kai Chen, Chunwei Wang, Kuo Yang et al.

DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis

Pan Wang, Qiang Zhou, Yawen Wu et al.

Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style

Shuai Tan, Bin Ji, Ye Pan

AAAI 2024arXiv:2403.06365

talking head generationemotion style transferart style transferaudio-driven animation+4

43

citations

#98

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

Zhibo Yang, Jun Tang, Zhaohai Li et al.

ICCV 2025arXiv:2412.02210

large multimodal modelsoptical character recognitionmultilingual text readingdocument parsing+4

42

citations

#99

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model

Long Le, Jason Xie, William Liang et al.

Few-Shot Detection of Machine-Generated Text using Style Representations

Rafael Rivera Soto, Kailin Koch, Aleem Khan et al.

ICLR 2024

41

citations

Language Modeling

Top Conferences

Related Topics (Language Models)

Top Papers

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

LISA: Reasoning Segmentation via Large Language Model

VILA: On Pre-training for Visual Language Models

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

SALMONN: Towards Generic Hearing Abilities for Large Language Models

YaRN: Efficient Context Window Extension of Large Language Models

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Safety Alignment Should be Made More Than Just a Few Tokens Deep

Mixture-of-Agents Enhances Large Language Model Capabilities

Large Language Models as Tool Makers

On Scaling Up a Multilingual Vision and Language Model

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

SaProt: Protein Language Modeling with Structure-aware Vocabulary

Language Models Represent Space and Time

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Listen, Think, and Understand

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-World Multi-Turn Dialogue

Think before you speak: Training Language Models With Pause Tokens

Can Large Language Models Infer Causation from Correlation?

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Training Language Models to Reason Efficiently

Physics of Language Models: Part 3.2, Knowledge Manipulation

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal

Linearity of Relation Decoding in Transformer Language Models

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Large Language Models as Analogical Reasoners

GSVA: Generalized Segmentation via Multimodal Large Language Models

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Layer by Layer: Uncovering Hidden Representations in Language Models

ToolACE: Winning the Points of LLM Function Calling

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

OR-Bench: An Over-Refusal Benchmark for Large Language Models

When Attention Sink Emerges in Language Models: An Empirical View

Remarkable Robustness of LLMs: Stages of Inference?

Training Socially Aligned Language Models on Simulated Social Interactions

Making Text Embedders Few-Shot Learners

In-Context Pretraining: Language Modeling Beyond Document Boundaries

A Benchmark for Learning to Translate a New Language from One Grammar Book

Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Towards Foundation Models for Knowledge Graph Reasoning

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

TLControl: Trajectory and Language Control for Human Motion Synthesis

GraphRouter: A Graph-based Router for LLM Selections

Eliciting Human Preferences with Language Models

Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data

GROUNDHOG: Grounding Large Language Models to Holistic Segmentation

Graph Neural Prompting with Large Language Models

MMTEB: Massive Multilingual Text Embedding Benchmark

LLaFS: When Large Language Models Meet Few-Shot Segmentation

Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models

Towards 3D Molecule-Text Interpretation in Language Models

OmniGlue: Generalizable Feature Matching with Foundation Model Guidance

On the Learnability of Watermarks for Language Models

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

ELLA-V: Stable Neural Codec Language Modeling with Alignment-Guided Sequence Reordering

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?

Driving Everywhere with Large Language Model Policy Adaptation

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons

AI Sandbagging: Language Models can Strategically Underperform on Evaluations

Towards Semantic Equivalence of Tokenization in Multimodal LLM

SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation

Editing Language Model