All People
Andrej Karpathy

Andrej Karpathy

AI educator, ex-Tesla/OpenAI

Recent Activity10 videos · 89 x-posts

Recent Activity

grep SOURCE=
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to criticism about overhyping a popular site.

Worth reading: Reflects his engagement with public discourse on AI hype.

LLMEvaluation
Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours.

Highlights: Karpathy used an LLM to refine a blog post over 4 hours, showcasing iterative AI-assisted writing.

Worth reading: Demonstrates a practical workflow for leveraging LLMs to improve argument quality.

LLM
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

Highlights: Karpathy observes a widening gap in public understanding of AI capabilities and identifies a key issue.

Worth reading: Highlights a critical communication challenge in the AI field.

LLM
Everything about the LLM stack is different (neural architecture, training data, training algorithms, and especially optimization pressure) so

Highlights: Karpathy emphasizes the distinctiveness of the LLM stack across multiple dimensions.

Worth reading: Provides insight into the fundamental differences in LLM development compared to traditional software.

LLMInfra
A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM

Highlights: Karpathy shares notes on using Claude for coding, focusing on workflow improvements from recent LLM advances.

Worth reading: Offers practical observations on AI-assisted coding with Claude.

LLMTooling
Excited to share that I am starting an AI+Education company called Eureka Labs. The announcement: --- We are

Highlights: Karpathy announces the launch of Eureka Labs, an AI+Education company.

Worth reading: Marks a significant career move into AI education.

LLM
A few random notes from claude coding quite a bit last few weeks. Given the latest lift in LLM coding capability, like many others I rapidly went from about 80% manual+autocomplete coding and 20% agents in November to 80% agent coding and 20% edits+touchups in ...

Highlights: Karpathy notes a shift from manual coding to agent-based coding due to improved LLM capabilities.

Worth reading: Demonstrates rapid adoption of AI coding agents in practice.

AgentLLMTooling
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to criticism of overhyping a popular site.

Worth reading: Shows Karpathy's engagement with public perception of AI hype.

LLMDeployment
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

Highlights: Karpathy notes a growing gap in understanding AI capability.

Worth reading: Highlights public misunderstanding of AI progress.

Safety
LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected. In any case they

Highlights: Karpathy observes LLMs are both smarter and dumber than expected.

Worth reading: Captures the paradoxical nature of LLM capabilities.

LLM
I've never felt this much behind as a programmer. The profession is being dramatically refactored as the bits

Highlights: Karpathy feels behind as a programmer due to AI-driven refactoring.

Worth reading: Reflects the impact of AI on software development.

Tooling
Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours.

Highlights: Karpathy used an LLM to refine a blog post argument over 4 hours.

Worth reading: Shows practical use of LLMs for iterative writing improvement.

LLM
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

Highlights: Karpathy announces joining Anthropic, emphasizing frontier LLM work and continued education passion.

Worth reading: Major talent move in AI industry, signaling Anthropic's growing influence.

LLMSafetyAgent
Personal update: I've joined Anthropic. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

Highlights: Karpathy announces his move to Anthropic, focusing on R&D while continuing his education passion.

Worth reading: Signals a major talent shift from OpenAI to Anthropic, impacting the AI landscape.

LLMSafetyDeployment
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating

Highlights: Using LLMs to build personal knowledge bases for research topics, shifting token usage from code to knowledge manipulation.

Worth reading: Shows a practical application of LLMs for personal knowledge management.

LLMRAGTooling
A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM coding capability, like many others I rapidly went from about 80% manual+autocomplete coding and 20% agents in November to 80% agent coding and 20% edits+touchups in

Highlights: Shift from 80% manual coding to 80% agent coding due to improved LLM coding capabilities.

Worth reading: Illustrates the rapid adoption of AI agents in coding workflows.

AgentToolingDeployment
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to criticism of overhyping a popular site, likely related to AI or tech.

Worth reading: Provides insight into Karpathy's perspective on hype in the AI community.

LLM
Very interested in what the coming era of highly bespoke software might look like. Example from this morning - I've become a bit loosy goosy with my cardio recently so I decided to do a more srs, regimented experiment to try to lower my Resting Heart Rate from 50 -> 45, over https://t.co/EDULdIpWmE

Highlights: Karpathy explores bespoke software for personal health experiments, aiming to lower his resting heart rate.

Worth reading: Illustrates the potential of personalized software for health optimization, a trend in AI-driven lifestyle management.

AgentTooling
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating

Highlights: Karpathy uses LLMs to build personal knowledge bases for research topics, shifting token usage from code manipulation to knowledge manipulation.

Worth reading: Shows a practical application of LLMs for personal knowledge management, relevant to researchers and knowledge workers.

LLMRAGTooling
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to criticism of overhyping a popular site.

Worth reading: Shows Karpathy's engagement with public perception of his commentary.

LLM
There's a new kind of coding I call "vibe coding", where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. Also I just talk to Composer with SuperWhisper.

Highlights: Karpathy coins 'vibe coding' as an AI-assisted coding style where developers rely on AI and ignore code details.

Worth reading: Introduces a popular concept shaping AI-assisted development.

ToolingAgent
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. People on X are the first to know.

Highlights: Karpathy notes a growing gap in AI understanding due to outdated or limited use of ChatGPT's free tier.

Worth reading: Highlights the disconnect between public perception and actual AI progress.

LLMEvaluation
A few random notes from claude coding quite a bit last few weeks. Given the latest lift in LLM coding capability, like many others I rapidly went from about 80% manual+autocomplete coding and 20% agents in November to 80% agent coding and 20% edits+touchups in

Highlights: Karpathy describes his shift from manual coding to heavy reliance on AI agents for coding.

Worth reading: Illustrates the rapid adoption of AI coding agents in practice.

AgentTooling
My most amusing interaction was where the model (I think I was given some earlier version with a ...

Highlights: Karpathy shares an amusing interaction with an early model version.

Worth reading: Provides insight into his experience with early LLM behavior.

LLM
I'm starting to get into a habit of reading everything (blogs, articles, book chapters, ...)

Highlights: Karpathy mentions his habit of extensive reading.

Worth reading: Reflects his approach to staying informed in AI.

LLM
My most amusing interaction was where the model (I think I was given some earlier version with a

Highlights: Karpathy shares an amusing interaction with an AI model, possibly an earlier version.

Worth reading: Provides insight into Karpathy's experiences with AI model behavior.

LLMSafety
I'm starting to get into a habit of reading everything (blogs, articles, book chapters,…)

Highlights: Karpathy mentions adopting a habit of extensive reading.

Worth reading: Reflects Karpathy's approach to continuous learning and information consumption.

Tooling
Very interested in what the coming era of highly bespoke software might look like.

Highlights: Karpathy expresses interest in the future of customized software.

Worth reading: Indicates Karpathy's forward-looking perspective on software development trends.

AgentInfra
2025 LLM Year in Review

Highlights: Karpathy shares a review of LLM developments in 2025.

Worth reading: Summarizes key LLM advancements from Karpathy's perspective.

LLMEvaluation
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

Highlights: Karpathy observes a growing gap in understanding AI capabilities, starting with a first issue.

Worth reading: Highlights a key concern about public perception versus actual AI progress.

LLMSafety
I'm starting to get into a habit of reading everything (blogs, articles, book chapters,…)

Highlights: Karpathy shares his habit of extensive reading to stay informed.

Worth reading: Reflects his approach to continuous learning in AI.

LLM
My most amusing interaction was where the model (I think I was given some earlier version with a

Highlights: Karpathy recounts an amusing interaction with an early model version.

Worth reading: Provides insight into his hands-on experience with AI models.

LLM
By training LLMs against auto

Highlights: Karpathy discusses training LLMs using auto-generated data or objectives.

Worth reading: Reveals a technical approach to improving LLM training.

Fine-tuningLLM
My most amusing interaction was where the model (I think I was given some earlier version with a ...

Highlights: Karpathy shares an amusing interaction with an AI model, likely referring to a chatbot or language model.

Worth reading: Provides insight into Karpathy's hands-on experience with AI models.

LLM
I'm starting to get into a habit of reading everything (blogs, articles, book chapters, ...)

Highlights: Karpathy mentions developing a habit of extensive reading across various sources.

Worth reading: Reflects his approach to staying informed and learning.

LLM
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around ...

Highlights: Karpathy observes a growing gap in understanding AI capabilities, pointing to a key issue.

Worth reading: Highlights a critical challenge in AI communication and education.

Safety
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest.

Highlights: Karpathy finds using LLMs to build personal knowledge bases for research topics very useful.

Worth reading: Shows a practical application of LLMs for personal knowledge management.

LLMRAG
Excited to share that I am starting an AI+Education company called Eureka Labs.

Highlights: Karpathy announces his new AI+Education company Eureka Labs.

Worth reading: Highlights Karpathy's latest venture in AI education.

LLMDeployment
Drafted a blog post. Used an LLM to meticulously improve the argument over 4 hours. Wow, feeling great, it’s so convincing! Fun idea let’s ask it to argue the opposite. LLM demolishes the entire argument and convinces me that the opposite is in fact true. lol

Highlights: LLMs can be used to improve arguments, but they can also convincingly argue the opposite, revealing their persuasive power and potential pitfalls.

Worth reading: Illustrates the double-edged nature of LLMs in reasoning and argumentation.

LLMSafety
The hottest new programming language is English

Highlights: Natural language is becoming a dominant interface for programming, especially with LLMs.

Worth reading: Captures a key trend in AI-assisted coding and the shift towards natural language programming.

LLMTooling
By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like 'reasoning' to humans - they learn to break down problem solving into intermediate calculations and they learn a number of problem-solving techniques.

Highlights: LLMs can develop emergent reasoning-like behaviors through reinforcement learning with verifiable rewards.

Worth reading: Shows how training methods can lead to spontaneous reasoning capabilities in LLMs.

LLMFine-tuningEvaluation
Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours. - LLM demolishes the entire argument and convinces me that the opposite is in fact true.

Highlights: Karpathy used an LLM to improve a blog post, but the LLM convinced him the opposite argument was true.

Worth reading: Shows how LLMs can challenge and refine reasoning, not just generate text.

LLM
The hottest new programming language is English

Highlights: Karpathy suggests English is becoming the new programming language due to LLMs.

Worth reading: Captures a key insight about how LLMs are changing programming.

LLM
By training LLMs against auto-generated data, we can achieve... [content truncated in search result]

Highlights: Karpathy discusses training LLMs with auto-generated data.

Worth reading: Highlights a technique for improving LLM training efficiency.

LLMFine-tuning
My most amusing interaction was where the model (I think I was given some earlier version with a

Highlights: Karpathy recounts an amusing interaction with an early model version.

Worth reading: Shows his hands-on experience with AI model behavior.

LLM
One common issue with personalization in all LLMs is how distracting memory seems to be for the models.

Highlights: Identifies a key challenge in LLM personalization: memory distraction.

Worth reading: Highlights a fundamental problem in making LLMs personalized.

LLMFine-tuning
- Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours.

Highlights: Karpathy describes using an LLM to refine a blog post argument over 4 hours.

Worth reading: Demonstrates a practical use case of LLMs for writing improvement.

LLMTooling
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

Highlights: Notes a growing gap in understanding AI capabilities, starting with a specific issue.

Worth reading: Reflects on public perception vs. reality of AI progress.

Evaluation
I'm starting to get into a habit of reading everything (blogs, articles, book chapters,…)

Highlights: Karpathy shares his habit of extensive reading across various formats.

Worth reading: Offers insight into his learning approach and information diet.

LLM
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating

Highlights: Karpathy describes using LLMs to build personal knowledge bases, shifting his token usage from code manipulation to knowledge manipulation.

Worth reading: Illustrates a practical and novel use of LLMs for personal knowledge management.

LLMRAG
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to accusations of overhyping a popular site, likely related to AI.

Worth reading: Shows Karpathy's engagement with public discourse and his perspective on hype in AI.

LLM
2025 LLM Year in Review

Highlights: Karpathy posted a review of LLM developments in 2025, likely summarizing key trends and breakthroughs.

Worth reading: Provides a high-level perspective on the state of LLMs from a leading AI researcher.

LLM
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest.

Highlights: Karpathy advocates using LLMs to create personal knowledge bases for research.

Worth reading: Highlights a practical application of LLMs for knowledge management.

LLMRAG
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating

Highlights: Karpathy uses LLMs to construct personal knowledge bases, shifting his focus from code to knowledge management.

Worth reading: Illustrates a practical application of LLMs for personal productivity and research.

LLMRAG
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to criticism about overhyping a popular site, showing self-awareness about hype cycles.

Worth reading: Reveals Karpathy's perspective on the balance between excitement and realism in AI.

Tooling
LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected. In any case they

Highlights: Karpathy observes that LLMs are both smarter and dumber than expected, highlighting their paradoxical nature.

Worth reading: Provides a balanced perspective on LLM capabilities from a leading AI figure.

LLM
A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM

Highlights: Karpathy shares notes on using Claude for coding, reflecting on improvements in LLM-based coding workflows.

Worth reading: Offers practical insights into advanced AI-assisted coding from an expert practitioner.

LLMTooling
My most amusing interaction was where the model (I think I was given some earlier version with a

Highlights: Karpathy recounts an amusing interaction with an early version of a model.

Worth reading: Provides insight into Karpathy's experience with early model capabilities.

LLM
One common issue with personalization in all LLMs is how distracting memory seems to be for the models.

Highlights: Karpathy notes that memory in LLMs can be distracting for personalization.

Worth reading: Highlights a key challenge in LLM personalization.

LLM
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

Highlights: Karpathy observes a growing gap in understanding AI capability.

Worth reading: Raises awareness about misconceptions in AI capability.

LLM
LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected. In any case they

Highlights: Karpathy describes LLMs as both smarter and dumber than expected.

Worth reading: Captures the paradoxical nature of current LLM intelligence.

LLM
I'm being accused of overhyping the [site everyone heard too much about today already].

Highlights: Karpathy responds to accusations of overhyping a site.

Worth reading: Shows Karpathy's engagement with public perception of AI hype.

LLM
+1 for "context engineering" over "prompt engineering". When in every industrial-strength LLM app, context engineering is the delicate art and science of filling the context window

Highlights: Advocates for 'context engineering' as a more accurate term than 'prompt engineering' for industrial LLM applications.

Worth reading: Reframes how we think about optimizing LLM inputs in production.

LLMDeploymentTooling
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating

Highlights: Using LLMs to create personal knowledge bases for research, shifting focus from code to knowledge manipulation.

Worth reading: Demonstrates a practical, personal use case for LLMs beyond coding.

LLMRAGTooling
2025 LLM Year in Review By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like "reasoning" to humans - they learn to break down problem solving into intermediate calculations and they learn a number of probl

Highlights: LLMs trained with verifiable rewards develop human-like reasoning strategies, breaking down problems into intermediate steps.

Worth reading: Highlights a key insight into how LLMs can learn reasoning without explicit instruction.

LLMEvaluationFine-tuning
Very interested in what the coming era of highly bespoke software might look like. Example from this morning - I've become a bit loosy goosy with my cardio recently so I decided to do a more srs, regimented experiment to try to lower my Resting Heart Rate from 50 -> 45, over ...

Highlights: Karpathy explores personalized software for health experiments, reflecting on bespoke software trends.

Worth reading: Shows how AI can enable highly personalized, data-driven self-improvement tools.

AgentTooling
By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like 'reasoning' to humans - they learn to break down problem solving into intermediate calculations and they learn a number of problem-solving techniques.

Highlights: LLMs trained with verifiable rewards develop reasoning-like behaviors, breaking problems into intermediate steps.

Worth reading: Key insight into how reinforcement learning can elicit reasoning in LLMs without explicit programming.

EvaluationFine-tuning
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge.

Highlights: Karpathy advocates using LLMs to create personal knowledge bases, shifting focus from code to knowledge management.

Worth reading: Highlights a practical application of LLMs for personal knowledge organization and research.

LLMRAG
I've never felt this much behind as a programmer. I have a sense that I could be 10X more powerful if I just properly string together what has become available.

Highlights: Karpathy expresses feeling behind as a programmer due to rapidly advancing AI tools, but sees potential for 10X productivity.

Worth reading: Reflects the challenge and opportunity of integrating modern AI tools into programming workflows.

ToolingDeployment
My most amusing interaction was where the model (I think I was given some earlier version with a ...)

Highlights: Karpathy shares a humorous anecdote about an early model interaction.

Worth reading: Insight into early model behavior and user experience.

LLM
Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model.

Highlights: Karpathy describes running an auto-research experiment tuning a small chat model.

Worth reading: Shows practical application of autonomous research in LLM tuning.

AgentFine-tuning
We're missing (at least one) major paradigm for LLM learning. Not sure what to call it,

Highlights: Karpathy identifies a gap in current LLM learning paradigms.

Worth reading: Provocative thought on future directions for LLM training.

LLMInfra
Bought a new Mac mini to properly tinker with claws over the weekend. The apple store person told me they are selling like hotcakes and everyone is confused :) I'm definitely a bit sus'd to run OpenClaw specifically - giving my private data/keys to 400K lines of vibe coded

Highlights: Karpathy bought a Mac mini to experiment with 'claws' (likely a typo for 'Claude' or 'Claw' agent), noting that Apple Store staff said they are selling well and customers are confused. He is cautious about running OpenClaw due to security concerns with vibe-coded code.

Worth reading: Shows Karpathy's hands-on approach with new AI tools and his awareness of security risks in AI-generated code.

AgentTooling
2025 LLM Year in Review

Highlights: Karpathy posted a summary of LLM developments in 2025, likely reflecting on key trends and milestones.

Worth reading: Provides Karpathy's perspective on the state of LLMs in 2025.

LLM
Very interested in what the coming era of highly bespoke software ... Example from this morning - I've become a bit loosy goosy with my cardio recently so I decided to do a more srs, regimented experiment to try to lower my Resting Heart Rate from 50 -> 45, over https://t.co/EDULdIpWmE

Highlights: Karpathy expresses interest in bespoke software and shares a personal experiment to lower his resting heart rate using a structured approach.

Worth reading: Illustrates Karpathy's methodical mindset applied to personal health, and his interest in personalized software.

Tooling
Bought a new Mac mini to properly tinker with claws over the weekend. The apple store person told me they are selling like hotcakes and everyone is confused :) I'm definitely a bit sus'd to run OpenClaw specifically - giving my private data/keys to 400K lines of vibe coded

Highlights: Karpathy bought a Mac mini to experiment with 'claws' (likely a typo for 'Claude' or an agent framework) and expresses concern about running open-source code with private data.

Worth reading: Shows Karpathy's hands-on approach to AI agent development and his security concerns with community code.

AgentTooling
Very interested in what the coming era of highly bespoke software might look like. Example from this morning - I've become a bit loosy goosy with my cardio recently so I decided to do a more srs, regimented experiment to try to lower my Resting Heart Rate from 50 -> 45, over https://t.co/EDULdIpWmE

Highlights: Karpathy envisions a future of highly personalized software, using his own health experiment as an example.

Worth reading: Highlights Karpathy's interest in AI-driven personalization and self-experimentation.

AgentTooling
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much.

Highlights: Karpathy observes a growing gap in AI capability understanding, noting that many base their views on outdated free-tier ChatGPT experiences.

Worth reading: Highlights a common misconception about AI progress and the importance of staying current.

LLMEvaluation
By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like 'reasoning' to humans - they learn to break down problem solving into intermediate calculations and they learn a number of probl

Highlights: Karpathy explains how training LLMs with verifiable rewards leads to emergent reasoning behaviors.

Worth reading: Key insight into how reinforcement learning can produce reasoning capabilities in LLMs.

LLMFine-tuningEvaluation
I'm being accused of overhyping the [site everyone heard too much about today already]. To add a few words beyond just memes in jest - obviously when you take a look at the activity, it's a lot of garbage - spams, scams, slop, the crypto people, highly ...

Highlights: Karpathy defends against accusations of overhyping a site, acknowledging the prevalence of spam and scams.

Worth reading: Shows Karpathy's nuanced view on hype versus reality in AI-related platforms.

DeploymentSafety
How I use LLMs

How I use LLMs

Andrej Karpathy

Feb 27, 10:29 PM·2,358,015 viewsYouTube

Highlights: The video provides a practical, example-driven walkthrough of how to effectively use Large Language Models in daily life, covering everything from basic interactions to understanding pricing tiers and model selection. It demystifies the growing LLM ecosystem by showing concrete applications and explaining when to use different models.

Worth watching: Andrej Karpathy's expertise and clear teaching style make complex AI concepts accessible, offering actionable insights for both beginners and experienced users looking to optimize their LLM usage.

Deep Dive into LLMs like ChatGPT

Deep Dive into LLMs like ChatGPT

Andrej Karpathy

Feb 5, 06:23 PM·6,058,917 viewsYouTube

Highlights: This video provides a comprehensive overview of how Large Language Models like ChatGPT are developed, covering the full training stack from data collection to deployment. It also offers practical mental models for understanding their 'psychology' and optimizing their use in real-world applications.

Worth watching: Worth watching because Andrej Karpathy, a leading AI researcher, delivers an accessible yet thorough explanation that bridges technical depth with practical application insights, making complex LLM concepts understandable for general audiences.

2025 LLM Year in Review

Highlights: Karpathy reflects on the key developments in LLMs over 2025, likely discussing training against auto-generated data.

Worth reading: Provides a high-level summary of LLM progress from one of the field's leading experts.

LLM
Excited to share that I am starting an AI+Education company called Eureka Labs.

Highlights: Karpathy launches Eureka Labs, an AI+Education venture.

Worth reading: Shows his commitment to AI in education, a key area for democratizing learning.

LLMDeploymentTooling
Let's reproduce GPT-2 (124M)

Let's reproduce GPT-2 (124M)

Andrej Karpathy

Jun 9, 11:31 PM·1,048,747 viewsYouTube

Highlights: This video provides a comprehensive, hands-on walkthrough of reproducing the GPT-2 (124M) model from scratch, covering network architecture, training optimization, and hyperparameter tuning based on original papers. It demonstrates the full training pipeline with practical implementation details and concludes with generated text samples to evaluate model performance.

Worth watching: Worth watching for its educational value in understanding transformer-based language model implementation and training optimization, presented by a renowned AI educator with clear, practical demonstrations.

Let's build the GPT Tokenizer

Let's build the GPT Tokenizer

Andrej Karpathy

Feb 20, 05:11 PM·1,069,817 viewsYouTube

Highlights: The video explains that tokenizers are a separate, crucial component in LLMs, using Byte Pair Encoding to translate between text and tokens. It demonstrates building the GPT tokenizer from scratch, highlighting its distinct training process and core encode/decode functions.

Worth watching: Worth watching to understand a fundamental yet often overlooked part of how LLMs process text, presented clearly by an expert in the field.

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

Nov 23, 02:27 AM·3,554,837 viewsYouTube

Highlights: This talk demystifies Large Language Models (LLMs) by explaining them as a new computing paradigm analogous to operating systems, where models like ChatGPT serve as the core technical component. It covers their fundamental workings, future trajectory, and unique security challenges in an accessible way for general audiences.

Worth watching: Andrej Karpathy provides a clear, foundational understanding of LLMs from one of the field's leading educators, making complex concepts accessible while addressing practical implications and security considerations that remain highly relevant.

The hottest new programming language is English

Highlights: Karpathy suggests that natural language is becoming the dominant way to program, thanks to AI.

Worth reading: Captures a paradigm shift in programming towards AI-driven code generation.

LLMTooling
Let's build GPT: from scratch, in code, spelled out.

Let's build GPT: from scratch, in code, spelled out.

Andrej Karpathy

Jan 17, 04:33 PM·7,077,458 viewsYouTube

Highlights: This video provides a hands-on coding tutorial where Andrej Karpathy builds a GPT model from scratch, implementing the transformer architecture described in 'Attention is All You Need' and connecting it to real-world applications like GPT-2/3 and ChatGPT. It demonstrates the practical implementation of autoregressive language modeling while showing GitHub Copilot (itself a GPT model) assisting in writing the code, creating a meta-learning experience.

Worth watching: It's worth watching because it demystifies complex AI concepts through clear, practical coding examples and connects theoretical papers to real implementations, making advanced transformer architectures accessible to developers and enthusiasts.

Building makemore Part 5: Building a WaveNet

Building makemore Part 5: Building a WaveNet

Andrej Karpathy

Nov 21, 12:32 AM·268,962 viewsYouTube

Highlights: This video demonstrates how to evolve a simple 2-layer MLP into a deeper, tree-like neural network architecture that resembles DeepMind's WaveNet (2016). It shows the practical implementation process using PyTorch's torch.nn module while explaining the underlying mechanics of deep learning development.

Worth watching: It's worth watching because it provides a clear, hands-on walkthrough of building a complex neural network from simpler components, offering valuable insights into both PyTorch fundamentals and the architectural thinking behind influential models like WaveNet.

Building makemore Part 4: Becoming a Backprop Ninja

Building makemore Part 4: Becoming a Backprop Ninja

Andrej Karpathy

Oct 11, 05:56 PM·335,387 viewsYouTube

Highlights: This video demonstrates manual backpropagation through a complete 2-layer MLP with BatchNorm, covering gradients from cross entropy loss through embedding tables. It builds intuitive understanding of gradient flow at the tensor level, beyond scalar implementations like micrograd, while reinforcing core deep learning concepts.

Worth watching: Essential viewing for developers wanting to move beyond autograd black boxes and truly understand gradient computation in neural networks, presented by one of the field's most effective educators.

Building makemore Part 3: Activations & Gradients, BatchNorm

Building makemore Part 3: Activations & Gradients, BatchNorm

Andrej Karpathy

Oct 4, 04:41 PM·489,022 viewsYouTube

Highlights: This video examines the statistical challenges in training deep neural networks, focusing on how improperly scaled activations and gradients can cause instability. It introduces Batch Normalization as a key technique to stabilize training by normalizing layer inputs.

Worth watching: Worth watching for its practical insights into diagnosing and fixing common deep learning training issues, presented by an expert with clear visualizations of internal network behavior.

Building makemore Part 2: MLP

Building makemore Part 2: MLP

Andrej Karpathy

Sep 12, 02:43 PM·524,893 viewsYouTube

Highlights: This video demonstrates building a multilayer perceptron (MLP) for character-level language modeling, covering essential ML fundamentals like training, hyperparameter tuning, and evaluation. It provides practical insights into handling train/dev/test splits and diagnosing under/overfitting in neural networks.

Worth watching: It's worth watching for hands-on implementation of MLPs with clear explanations of core machine learning concepts, making it accessible for both beginners and practitioners looking to solidify their understanding.

10 videos · 89 x-posts · All time