← Back to Interviews

Andrej Karpathy Interview

Deep Dive into LLMs like ChatGPT | Andrej Karpathy · 260429

5
Topics
5761
Segments
3h31m
Duration
-
Key Quotes

📋 Summary

Loading...

📖 Topics

Click any topic card to view the full transcript for that segment.

Building ChatGPT: The Full Pipeline

Andrej walks through the entire pipeline of building an LLM: downloading the internet, tokenization, pre-training, and the key stages of creating something like ChatGPT.

pipelinepre-trainingChatGPTarchitecture

Pre-Training: Learning from the Internet

Detailed walkthrough of the pre-training stage: data collection (FineWeb dataset), tokenization, the massive compute requirements, and what the model actually learns from internet-scale data.

pre-trainingdatatokenizationFineWeb

Fine-Tuning and RLHF: Making Models Helpful

How supervised fine-tuning and RLHF transform a base model into a helpful assistant. The difference between a model that predicts text and one that follows instructions.

fine-tuningRLHFalignmentinstruction following

Cognitive Psychology and LLM Implications

Andrej discusses the cognitive and psychological implications of LLMs — what they tell us about human intelligence, memory, and reasoning. Are LLMs thinking or just pattern matching?

psychologycognitionintelligencereasoning

Practical Guide: Using LLMs Effectively

Practical advice on prompt engineering, understanding LLM limitations (hallucinations, sharp edges), and getting the most out of tools like ChatGPT.

prompt engineeringpracticallimitationshallucinations