How ChatGPT Actually Works (Simplified)

Nas Belfon
Mar 14
4 min read

A robotic and human hand reach toward each other under transparent "AI" letters on a blue, circuit-patterned background.

ChatGPT is probably the most well-known AI product right now. Millions of people use it daily. But most people have no idea what's happening when they type a question and get a response. Let's walk through it without drowning in technical jargon.

Understanding the Role of a Large Language Model

ChatGPT is built on a type of AI called a large language model (LLM). The word "large" refers to the size of the model — it has billions of parameters (think of parameters as adjustable dials that affect how the system processes text). The word "language" means it works with text. The word "model" means it's a mathematical representation of patterns in that text.

At its most basic level, an LLM does one thing: given a sequence of words, it predicts what word should come next. That's the fundamental operation. When you ask ChatGPT a question, it generates a response one word (actually one token, but we'll get to that) at a time, predicting the most likely next word based on everything that came before.

Training Methodology Overview

Training an LLM happens in stages.

Stage 1: Pre-training. The model is fed an enormous amount of text from the internet — websites, books, articles, code repositories, forums, and more. It reads all of this and learns statistical patterns about language. Which words tend to follow other words? How sentences are structured. What topics are associated with what kinds of language? It's not memorizing the text. It's learning the patterns within the text. This stage requires massive computational resources and takes weeks or months on thousands of specialized processors.

Stage 2: Fine-tuning. After pre-training, the model knows a lot about language, but it isn't particularly helpful in conversation. Fine-tuning adjusts the model using more specific data — often examples of high-quality conversations in which a human played both the user and the assistant roles. This teaches the model what a helpful, well-structured response looks like.

Stage 3: Reinforcement learning from human feedback (RLHF). Human reviewers rate the model's responses. The model is then adjusted to produce more responses that humans rated highly and fewer that they rated poorly. This is what distinguishes a model that generates grammatically correct text from one that generates text that's actually useful and appropriate.

What happens when you type a prompt?

When you type a message, here's what happens behind the scenes:

Your text gets broken into tokens. Tokens aren't exactly words — they're chunks of text. Common words are usually one token. Longer or rarer words get split into multiple tokens. The word "cybersecurity" might be two tokens: "cyber" and "security." On average, one token is about three-quarters of a word.

Those tokens get processed through the model's neural network. The network examines all the tokens in your prompt (and any conversation history), passes them through its layers, and computes a probability for every possible next token. The token with the highest probability gets selected. Then it repeats: all previous tokens plus the new one go back through the network to predict the next token. This happens over and over until the response is complete.

There's some randomness built in. The model doesn't always pick the single highest-probability token. A setting called "temperature" controls the amount of randomness. Low temperature means the model mostly picks the top prediction (more predictable, more repetitive). High temperature means it's more willing to pick less likely tokens (more creative, but also more likely to go off the rails).

What the model doesn't do

It doesn't search the internet (unless a search feature is specifically enabled). When you ask it a factual question, it generates the answer from patterns it learned during training, without looking anything up. This is why it can confidently state something wrong — it's predicting what a correct-sounding answer looks like based on patterns, not verifying facts.

It doesn't remember previous conversations (unless memory features are enabled). Each conversation starts fresh. It doesn't know who you are, what you asked last week, or what you're working on unless you tell it in the current conversation.

It doesn't understand what it's saying. This is the hardest thing for people to accept. The responses can be articulate, detailed, and convincing. But the model has no concept of truth, meaning, or correctness. It produces text that statistically resembles a helpful response. Sometimes that lines up perfectly with reality. Sometimes it doesn't.

Why does it make things up?

This is called hallucination, and it's not a bug — it's a byproduct of how the system works. The model is always generating text. It doesn't have a mechanism for saying "I genuinely don't know this." When it encounters a question it doesn't have good training data for, it still produces confident-sounding text because that's what the pattern-matching leads to. The patterns say that questions are answered with confident, detailed responses. So it generates one, regardless of whether the content is accurate.

This is why you should never trust AI output on factual matters without verifying it, especially for anything involving numbers, dates, citations, legal information, medical information, or anything where accuracy actually matters. The model doesn't know the difference between a correct fact and a plausible-sounding fabrication.

The context window

Every LLM has a context window — a limit on how much text it can process at once. This includes both your input and the model's response. Older models had context windows of a few thousand tokens. Newer models can handle tens of thousands or more.

When a conversation exceeds the context window, the model starts losing track of earlier parts. It might forget instructions you gave at the beginning or contradict something it said earlier. This isn't the model being unreliable; it's a technical limitation. The earlier text literally isn't being processed anymore.

Bottom line

ChatGPT is a next-word prediction engine trained on a massive amount of text. It's impressive at generating human-sounding responses because it has learned incredibly detailed statistical patterns of how language works. But it's not thinking, reasoning, or understanding. Knowing this helps you use it more effectively; you'll know when to trust its output, when to double-check it, and why it sometimes produces garbage with total confidence.

Next up, we'll look at a topic that bridges AI and cybersecurity: how attackers are using AI right now to make their attacks faster, cheaper, and harder to detect.

$50

Product Title

Product Details goes here with the simple product description and more information can be seen by clicking the see more button. Product Details goes here with the simple product description and more information can be seen by clicking the see more button

$50

Product Title

$50

Product Title

How ChatGPT Actually Works (Simplified)

Understanding the Role of a Large Language Model

Training Methodology Overview

Training an LLM happens in stages.

What happens when you type a prompt?

What the model doesn't do

Why does it make things up?

The context window

Bottom line

Recommended Products For This Post

Recent Posts

Comments

Nas's Cyber Corner
nasscybercorner@gmail.com
Real talk on breaking into cybersecurity. CISSP | CISM | CRISC
Privacy Policy

Understanding the Role of a Large Language Model

Training Methodology Overview

Training an LLM happens in stages.

What happens when you type a prompt?

What the model doesn't do

Why does it make things up?

The context window

Bottom line

Recommended Products For This Post

Comments

Nas's Cyber Corner nasscybercorner@gmail.com Real talk on breaking into cybersecurity. CISSP | CISM | CRISC Privacy Policy ​ ​

Nas's Cyber Corner
nasscybercorner@gmail.com
Real talk on breaking into cybersecurity. CISSP | CISM | CRISC
Privacy Policy