Large language model - Wikipedia

This article provides an in-depth exploration of large language models (LLMs), their architecture, training processes, and historical development, focusing on technical aspects and advancements in the field of natural language processing.

Editorial Highlights

01LLMs are trained with self-supervised machine learning on vast amounts of text for natural language processing tasks.
02The transformer architecture, introduced in 2017, revolutionized language modeling by allowing efficient parallelization and handling of longer contexts.
03Recent developments include the rise of open-weight models and multimodal capabilities in LLMs.