LLM
An LLM (Large Language Model) is an AI model trained on enormous text corpora to predict the next word (or, more precisely, the next token) in a sequence.
An LLM (Large Language Model) is an AI model trained on enormous text corpora to predict the next word (or, more precisely, the next token) in a sequence. At scale, it produces coherent text, translates, summarises, reasons and generates code.
Modern LLMs are based on the [transformer](/ressources/glossaire-de-la-tech/transformer) architecture and typically have hundreds of billions of parameters. They are first pre-trained on raw text, then fine-tuned (instruction tuning, RLHF) to follow instructions and respect safety rules.
Major examples in 2026: Claude (Anthropic), GPT (OpenAI), Gemini (Google), Mistral, Llama (Meta, open weights), Qwen (Alibaba). They are accessible via API or runnable locally for open-weights models.
