Deep Dive into LLMs like ChatGPT

📋 Summary

📖 Topics

Click any topic card to view the full transcript for that segment.

Building ChatGPT: The Full Pipeline

▶

Andrej walks through the entire pipeline of building an LLM: downloading the internet, tokenization, pre-training, and the key stages of creating something like ChatGPT.

pipelinepre-trainingChatGPTarchitecture

Pre-Training: Learning from the Internet

▶

Detailed walkthrough of the pre-training stage: data collection (FineWeb dataset), tokenization, the massive compute requirements, and what the model actually learns from internet-scale data.

pre-trainingdatatokenizationFineWeb

Fine-Tuning and RLHF: Making Models Helpful

▶

How supervised fine-tuning and RLHF transform a base model into a helpful assistant. The difference between a model that predicts text and one that follows instructions.

fine-tuningRLHFalignmentinstruction following

Cognitive Psychology and LLM Implications

▶

Andrej discusses the cognitive and psychological implications of LLMs — what they tell us about human intelligence, memory, and reasoning. Are LLMs thinking or just pattern matching?

psychologycognitionintelligencereasoning

Practical Guide: Using LLMs Effectively

▶

Practical advice on prompt engineering, understanding LLM limitations (hallucinations, sharp edges), and getting the most out of tools like ChatGPT.

prompt engineeringpracticallimitationshallucinations

Andrej Karpathy Interview

📋 Summary

📖 Topics

Building ChatGPT: The Full Pipeline

📝 Transcript for this segment

Pre-Training: Learning from the Internet

📝 Transcript for this segment

Fine-Tuning and RLHF: Making Models Helpful

📝 Transcript for this segment

Cognitive Psychology and LLM Implications

📝 Transcript for this segment

Practical Guide: Using LLMs Effectively

📝 Transcript for this segment