Foundation Concepts / Core Definitions

LLM

Beginner [2/5]

Large Language Model Generative language model Neural language model

Definition

A Large Language Model (LLM) is an AI system trained on massive amounts of text data to understand and generate human-like text. These models learn patterns, facts, and reasoning abilities from billions of words of text.

LLMs are the foundation of modern AI assistants like ChatGPT, Claude, and Gemini. They can perform a wide variety of tasks including writing, coding, analysis, translation, and more.

Key Concepts

Parameters: The learned weights in the model (billions for large models)
Training data: The text corpus the model learned from
Inference: The process of generating output from the model
Context window: How much text the model can consider at once

Examples

Popular LLMs

Examples of Large Language Models

• GPT-4 (OpenAI) - Powers ChatGPT
• Claude (Anthropic) - Known for safety and helpfulness
• Gemini (Google) - Multimodal capabilities
• Llama (Meta) - Open-source model family
• Mistral - Efficient open-source models

Different companies have developed their own LLMs with varying capabilities and strengths.

Model Sizes

Understanding Scale

Small:   ~7B parameters   (can run on consumer hardware)
Medium:  ~70B parameters  (requires powerful GPUs)
Large:   ~175B+ parameters (requires data centers)

More parameters generally = more capable, but also:
• More expensive to run
• Slower inference
• Higher memory requirements

Model size affects both capability and practical deployment considerations.

Interactive Exercise

🔬

Explore LLM Capabilities

LLMs can handle remarkably diverse tasks. List 5 different types of tasks you could ask an LLM to perform:

Think beyond simple Q&A - consider creative, analytical, and technical tasks.

Pro Tips

LLMs are trained on data up to a cutoff date - they don't know recent events
They can be confidently wrong - always verify important facts
Different LLMs have different strengths and weaknesses
The same LLM can give different answers to the same prompt

Definition

Key Concepts

Examples

Interactive Exercise

Related Terms