Andrej Karpathy

I'm being accused of overhyping the [site everyone heard too much about today already].

@karpathy

May 15, 04:25 AM

Highlights: Karpathy responds to criticism of overhyping a popular site.

Worth reading: Shows Karpathy's engagement with public perception of his commentary.

LLM

Power to the people: How LLMs flip the script on technology diffusion. So it strikes me as quite unique and remarkable that LLMs display a dramatic reversal of this pattern - they generate disproportionate benefit for regular people, while their impact is a lot more...

@karpathy

May 15, 04:25 AM

Highlights: Karpathy argues that LLMs benefit regular people more than experts, reversing typical tech diffusion.

Worth reading: Offers a fresh perspective on LLM societal impact.

LLM

There's a new kind of coding I call "vibe coding", where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. Also I just talk to Composer with SuperWhisper.

@karpathy

May 15, 04:25 AM

Highlights: Karpathy coins 'vibe coding' as an AI-assisted coding style where developers rely on AI and ignore code details.

Worth reading: Introduces a popular concept shaping AI-assisted development.

ToolingAgent

@karpathy

May 14, 04:21 AM

Highlights: Karpathy notes a growing gap in AI understanding due to outdated or limited use of ChatGPT's free tier.

Worth reading: Highlights the disconnect between public perception and actual AI progress.

LLMEvaluation

2025 LLM Year in Review. 2025 has been a strong and eventful year of progress in LLMs. The following is a list of personally notable and mildly surprising "paradigm changes" - things that altered the landscape and stood out to me conceptually. At the start of 2025, the LLM production stack in all labs looked something like this:

@karpathy

May 14, 04:21 AM

Highlights: Karpathy summarizes key paradigm shifts in LLMs during 2025, focusing on changes in the production stack.

Worth reading: Provides a high-level overview of LLM progress and infrastructure evolution.

LLMInfra

@karpathy

May 14, 04:21 AM

Highlights: Karpathy describes his shift from manual coding to heavy reliance on AI agents for coding.

Worth reading: Illustrates the rapid adoption of AI coding agents in practice.

AgentTooling

My most amusing interaction was where the model (I think I was given some earlier version with a ...

@karpathy

May 13, 04:22 AM

Highlights: Karpathy shares an amusing interaction with an early model version.

Worth reading: Provides insight into his experience with early LLM behavior.

LLM

I'm starting to get into a habit of reading everything (blogs, articles, book chapters, ...)

@karpathy

May 13, 04:22 AM

Highlights: Karpathy mentions his habit of extensive reading.

Worth reading: Reflects his approach to staying informed in AI.

LLM

2025 LLM Year in Review. 2025 has been a strong and eventful year of progress in LLMs. The following is a list of personally notable and mildly surprising 'paradigm changes' - things that altered the landscape and stood out to me conceptually. At the start of 2025, the LLM production stack in all labs looked something like this: ...

@karpathy

May 13, 04:22 AM

Highlights: Karpathy reviews paradigm changes in LLMs during 2025.

Worth reading: Summarizes key shifts in LLM landscape from a leading expert.

LLMInfra

My most amusing interaction was where the model (I think I was given some earlier version with a

@karpathy

Highlights: Karpathy shares an amusing interaction with an AI model, possibly an earlier version.

Worth reading: Provides insight into Karpathy's experiences with AI model behavior.

LLMSafety

I'm starting to get into a habit of reading everything (blogs, articles, book chapters,…)

@karpathy

Highlights: Karpathy mentions adopting a habit of extensive reading.

Worth reading: Reflects Karpathy's approach to continuous learning and information consumption.

Tooling

Very interested in what the coming era of highly bespoke software might look like.

@karpathy

Highlights: Karpathy expresses interest in the future of customized software.

Worth reading: Indicates Karpathy's forward-looking perspective on software development trends.

AgentInfra

@karpathy

Highlights: Karpathy shares a review of LLM developments in 2025.

Worth reading: Summarizes key LLM advancements from Karpathy's perspective.

LLMEvaluation

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

@karpathy

Highlights: Karpathy observes a growing gap in understanding AI capabilities, starting with a first issue.

Worth reading: Highlights a key concern about public perception versus actual AI progress.

LLMSafety

I'm starting to get into a habit of reading everything (blogs, articles, book chapters,…)

@karpathy

Highlights: Karpathy shares his habit of extensive reading to stay informed.

Worth reading: Reflects his approach to continuous learning in AI.

LLM

My most amusing interaction was where the model (I think I was given some earlier version with a

@karpathy

Highlights: Karpathy recounts an amusing interaction with an early model version.

Worth reading: Provides insight into his hands-on experience with AI models.

LLM

By training LLMs against auto

@karpathy

Highlights: Karpathy discusses training LLMs using auto-generated data or objectives.

Worth reading: Reveals a technical approach to improving LLM training.

Fine-tuningLLM

My most amusing interaction was where the model (I think I was given some earlier version with a ...

@karpathy

May 10, 04:23 AM

Highlights: Karpathy shares an amusing interaction with an AI model, likely referring to a chatbot or language model.

Worth reading: Provides insight into Karpathy's hands-on experience with AI models.

LLM

I'm starting to get into a habit of reading everything (blogs, articles, book chapters, ...)

@karpathy

May 10, 04:23 AM

Highlights: Karpathy mentions developing a habit of extensive reading across various sources.

Worth reading: Reflects his approach to staying informed and learning.

LLM

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around ...

@karpathy

May 10, 04:23 AM

Highlights: Karpathy observes a growing gap in understanding AI capabilities, pointing to a key issue.

Worth reading: Highlights a critical challenge in AI communication and education.

Safety

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest.

@karpathy

May 9, 04:12 AM

Highlights: Karpathy finds using LLMs to build personal knowledge bases for research topics very useful.

Worth reading: Shows a practical application of LLMs for personal knowledge management.

LLMRAG

Excited to share that I am starting an AI+Education company called Eureka Labs.

@karpathy

May 9, 04:12 AM

Highlights: Karpathy announces his new AI+Education company Eureka Labs.

Worth reading: Highlights Karpathy's latest venture in AI education.

LLMDeployment

Drafted a blog post. Used an LLM to meticulously improve the argument over 4 hours. Wow, feeling great, it’s so convincing! Fun idea let’s ask it to argue the opposite. LLM demolishes the entire argument and convinces me that the opposite is in fact true. lol

@karpathy

May 8, 04:07 AM

Highlights: LLMs can be used to improve arguments, but they can also convincingly argue the opposite, revealing their persuasive power and potential pitfalls.

Worth reading: Illustrates the double-edged nature of LLMs in reasoning and argumentation.

LLMSafety

The hottest new programming language is English

@karpathy

May 8, 04:07 AM

Highlights: Natural language is becoming a dominant interface for programming, especially with LLMs.

Worth reading: Captures a key trend in AI-assisted coding and the shift towards natural language programming.

LLMTooling

By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like 'reasoning' to humans - they learn to break down problem solving into intermediate calculations and they learn a number of problem-solving techniques.

@karpathy

May 8, 04:07 AM

Highlights: LLMs can develop emergent reasoning-like behaviors through reinforcement learning with verifiable rewards.

Worth reading: Shows how training methods can lead to spontaneous reasoning capabilities in LLMs.

LLMFine-tuningEvaluation

Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours. - LLM demolishes the entire argument and convinces me that the opposite is in fact true.

@karpathy

May 7, 04:18 AM

Highlights: Karpathy used an LLM to improve a blog post, but the LLM convinced him the opposite argument was true.

Worth reading: Shows how LLMs can challenge and refine reasoning, not just generate text.

LLM

The hottest new programming language is English

@karpathy

May 7, 04:18 AM

Highlights: Karpathy suggests English is becoming the new programming language due to LLMs.

Worth reading: Captures a key insight about how LLMs are changing programming.

LLM

By training LLMs against auto-generated data, we can achieve... [content truncated in search result]

@karpathy

May 7, 04:18 AM

Highlights: Karpathy discusses training LLMs with auto-generated data.

Worth reading: Highlights a technique for improving LLM training efficiency.

LLMFine-tuning

My most amusing interaction was where the model (I think I was given some earlier version with a

@karpathy

Highlights: Karpathy recounts an amusing interaction with an early model version.

Worth reading: Shows his hands-on experience with AI model behavior.

LLM

One common issue with personalization in all LLMs is how distracting memory seems to be for the models.

@karpathy

Highlights: Identifies a key challenge in LLM personalization: memory distraction.

Worth reading: Highlights a fundamental problem in making LLMs personalized.

LLMFine-tuning

- Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours.

@karpathy

Highlights: Karpathy describes using an LLM to refine a blog post argument over 4 hours.

Worth reading: Demonstrates a practical use case of LLMs for writing improvement.

LLMTooling

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

@karpathy

Highlights: Notes a growing gap in understanding AI capabilities, starting with a specific issue.

Worth reading: Reflects on public perception vs. reality of AI progress.

Evaluation

I'm starting to get into a habit of reading everything (blogs, articles, book chapters,…)

@karpathy

Highlights: Karpathy shares his habit of extensive reading across various formats.

Worth reading: Offers insight into his learning approach and information diet.

LLM

@karpathy

May 5, 04:06 AM

Highlights: Karpathy describes using LLMs to build personal knowledge bases, shifting his token usage from code manipulation to knowledge manipulation.

Worth reading: Illustrates a practical and novel use of LLMs for personal knowledge management.

LLMRAG

I'm being accused of overhyping the [site everyone heard too much about today already].

@karpathy

May 5, 04:06 AM

Highlights: Karpathy responds to accusations of overhyping a popular site, likely related to AI.

Worth reading: Shows Karpathy's engagement with public discourse and his perspective on hype in AI.

LLM

@karpathy

May 5, 04:06 AM

Highlights: Karpathy posted a review of LLM developments in 2025, likely summarizing key trends and breakthroughs.

Worth reading: Provides a high-level perspective on the state of LLMs from a leading AI researcher.

LLM

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest.

@karpathy

May 4, 04:20 AM

Highlights: Karpathy advocates using LLMs to create personal knowledge bases for research.

Worth reading: Highlights a practical application of LLMs for knowledge management.

LLMRAG

@karpathy

May 3, 04:21 AM

Highlights: Karpathy uses LLMs to construct personal knowledge bases, shifting his focus from code to knowledge management.

Worth reading: Illustrates a practical application of LLMs for personal productivity and research.

LLMRAG

I'm being accused of overhyping the [site everyone heard too much about today already].

@karpathy

May 3, 04:21 AM

Highlights: Karpathy responds to criticism about overhyping a popular site, showing self-awareness about hype cycles.

Worth reading: Reveals Karpathy's perspective on the balance between excitement and realism in AI.

Tooling

LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected. In any case they

@karpathy

May 2, 04:08 AM

Highlights: Karpathy observes that LLMs are both smarter and dumber than expected, highlighting their paradoxical nature.

Worth reading: Provides a balanced perspective on LLM capabilities from a leading AI figure.

LLM

A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM

@karpathy

May 2, 04:08 AM

Highlights: Karpathy shares notes on using Claude for coding, reflecting on improvements in LLM-based coding workflows.

Worth reading: Offers practical insights into advanced AI-assisted coding from an expert practitioner.

LLMTooling

My most amusing interaction was where the model (I think I was given some earlier version with a

@karpathy

Highlights: Karpathy recounts an amusing interaction with an early version of a model.

Worth reading: Provides insight into Karpathy's experience with early model capabilities.

LLM

One common issue with personalization in all LLMs is how distracting memory seems to be for the models.

@karpathy

Highlights: Karpathy notes that memory in LLMs can be distracting for personalization.

Worth reading: Highlights a key challenge in LLM personalization.

LLM

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around

@karpathy

Highlights: Karpathy observes a growing gap in understanding AI capability.

Worth reading: Raises awareness about misconceptions in AI capability.

LLM

LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected. In any case they

@karpathy

Highlights: Karpathy describes LLMs as both smarter and dumber than expected.

Worth reading: Captures the paradoxical nature of current LLM intelligence.

LLM

I'm being accused of overhyping the [site everyone heard too much about today already].

@karpathy

Highlights: Karpathy responds to accusations of overhyping a site.

Worth reading: Shows Karpathy's engagement with public perception of AI hype.

LLM

+1 for "context engineering" over "prompt engineering". When in every industrial-strength LLM app, context engineering is the delicate art and science of filling the context window

@karpathy

Apr 29, 04:15 AM

Highlights: Advocates for 'context engineering' as a more accurate term than 'prompt engineering' for industrial LLM applications.

Worth reading: Reframes how we think about optimizing LLM inputs in production.

LLMDeploymentTooling

@karpathy

Apr 29, 04:15 AM

Highlights: Using LLMs to create personal knowledge bases for research, shifting focus from code to knowledge manipulation.

Worth reading: Demonstrates a practical, personal use case for LLMs beyond coding.

LLMRAGTooling

2025 LLM Year in Review By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like "reasoning" to humans - they learn to break down problem solving into intermediate calculations and they learn a number of probl

@karpathy

Apr 29, 04:15 AM

Highlights: LLMs trained with verifiable rewards develop human-like reasoning strategies, breaking down problems into intermediate steps.

Worth reading: Highlights a key insight into how LLMs can learn reasoning without explicit instruction.

LLMEvaluationFine-tuning

@karpathy

Highlights: Karpathy explores personalized software for health experiments, reflecting on bespoke software trends.

Worth reading: Shows how AI can enable highly personalized, data-driven self-improvement tools.

AgentTooling

@karpathy

Highlights: LLMs trained with verifiable rewards develop reasoning-like behaviors, breaking problems into intermediate steps.

Worth reading: Key insight into how reinforcement learning can elicit reasoning in LLMs without explicit programming.

EvaluationFine-tuning

@karpathy

Highlights: Karpathy advocates using LLMs to create personal knowledge bases, shifting focus from code to knowledge management.

Worth reading: Highlights a practical application of LLMs for personal knowledge organization and research.

LLMRAG

I've never felt this much behind as a programmer. I have a sense that I could be 10X more powerful if I just properly string together what has become available.

@karpathy

Highlights: Karpathy expresses feeling behind as a programmer due to rapidly advancing AI tools, but sees potential for 10X productivity.

Worth reading: Reflects the challenge and opportunity of integrating modern AI tools into programming workflows.

ToolingDeployment

My most amusing interaction was where the model (I think I was given some earlier version with a ...)

@karpathy

Highlights: Karpathy shares a humorous anecdote about an early model interaction.

Worth reading: Insight into early model behavior and user experience.

LLM

Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model.

@karpathy

Highlights: Karpathy describes running an auto-research experiment tuning a small chat model.

Worth reading: Shows practical application of autonomous research in LLM tuning.

AgentFine-tuning

By training LLMs against automatically verifiable rewards across a number of environments (e.g. think math/code puzzles), the LLMs spontaneously develop strategies that look like "reasoning" to humans - they learn to break down problem solving into intermediate calculations and they learn a number of probl

@karpathy

Highlights: Karpathy explains how training with verifiable rewards leads to emergent reasoning-like strategies.

Worth reading: Key insight into how reasoning emerges from reward-based training.

LLMEvaluation

We're missing (at least one) major paradigm for LLM learning. Not sure what to call it,

@karpathy

Highlights: Karpathy identifies a gap in current LLM learning paradigms.

Worth reading: Provocative thought on future directions for LLM training.

LLMInfra

Bought a new Mac mini to properly tinker with claws over the weekend. The apple store person told me they are selling like hotcakes and everyone is confused :) I'm definitely a bit sus'd to run OpenClaw specifically - giving my private data/keys to 400K lines of vibe coded

@karpathy

Apr 25, 04:00 AM

Highlights: Karpathy bought a Mac mini to experiment with 'claws' (likely a typo for 'Claude' or 'Claw' agent), noting that Apple Store staff said they are selling well and customers are confused. He is cautious about running OpenClaw due to security concerns with vibe-coded code.

Worth reading: Shows Karpathy's hands-on approach with new AI tools and his awareness of security risks in AI-generated code.

AgentTooling

@karpathy

Apr 25, 04:00 AM

Highlights: Karpathy posted a summary of LLM developments in 2025, likely reflecting on key trends and milestones.

Worth reading: Provides Karpathy's perspective on the state of LLMs in 2025.

LLM

Very interested in what the coming era of highly bespoke software ... Example from this morning - I've become a bit loosy goosy with my cardio recently so I decided to do a more srs, regimented experiment to try to lower my Resting Heart Rate from 50 -> 45, over https://t.co/EDULdIpWmE

@karpathy

Apr 25, 04:00 AM

Highlights: Karpathy expresses interest in bespoke software and shares a personal experiment to lower his resting heart rate using a structured approach.

Worth reading: Illustrates Karpathy's methodical mindset applied to personal health, and his interest in personalized software.

Tooling

@karpathy

Apr 24, 12:00 AM

Highlights: Karpathy bought a Mac mini to experiment with 'claws' (likely a typo for 'Claude' or an agent framework) and expresses concern about running open-source code with private data.

Worth reading: Shows Karpathy's hands-on approach to AI agent development and his security concerns with community code.

AgentTooling

@karpathy

Apr 23, 12:00 AM

Highlights: Karpathy envisions a future of highly personalized software, using his own health experiment as an example.

Worth reading: Highlights Karpathy's interest in AI-driven personalization and self-experimentation.

AgentTooling

@karpathy

Mar 15, 12:00 AM

Highlights: Karpathy observes a growing gap in AI capability understanding, noting that many base their views on outdated free-tier ChatGPT experiences.

Worth reading: Highlights a common misconception about AI progress and the importance of staying current.

LLMEvaluation

@karpathy

Mar 1, 12:00 AM

Highlights: Karpathy explains how training LLMs with verifiable rewards leads to emergent reasoning behaviors.

Worth reading: Key insight into how reinforcement learning can produce reasoning capabilities in LLMs.

LLMFine-tuningEvaluation

I'm being accused of overhyping the [site everyone heard too much about today already]. To add a few words beyond just memes in jest - obviously when you take a look at the activity, it's a lot of garbage - spams, scams, slop, the crypto people, highly ...

@karpathy

Jan 20, 12:00 AM

Highlights: Karpathy defends against accusations of overhyping a site, acknowledging the prevalence of spam and scams.

Worth reading: Shows Karpathy's nuanced view on hype versus reality in AI-related platforms.

DeploymentSafety

@karpathy

Dec 31, 12:00 AM

Highlights: Karpathy reviews major paradigm changes in LLMs during 2025, noting shifts in the production stack.

Worth reading: Provides a high-level summary of key LLM trends from a leading AI researcher.

LLMInfraDeployment

How I use LLMs

Andrej Karpathy

Feb 27, 10:29 PM·2,358,015 viewsYouTube

Highlights: The video provides a practical, example-driven walkthrough of how to effectively use Large Language Models in daily life, covering everything from basic interactions to understanding pricing tiers and model selection. It demystifies the growing LLM ecosystem by showing concrete applications and explaining when to use different models.

Worth watching: Andrej Karpathy's expertise and clear teaching style make complex AI concepts accessible, offering actionable insights for both beginners and experienced users looking to optimize their LLM usage.

Deep Dive into LLMs like ChatGPT

Andrej Karpathy

Feb 5, 06:23 PM·6,058,917 viewsYouTube

Highlights: This video provides a comprehensive overview of how Large Language Models like ChatGPT are developed, covering the full training stack from data collection to deployment. It also offers practical mental models for understanding their 'psychology' and optimizing their use in real-world applications.

Worth watching: Worth watching because Andrej Karpathy, a leading AI researcher, delivers an accessible yet thorough explanation that bridges technical depth with practical application insights, making complex LLM concepts understandable for general audiences.

@karpathy

Jan 1, 12:00 AM

Highlights: Karpathy reflects on the key developments in LLMs over 2025, likely discussing training against auto-generated data.

Worth reading: Provides a high-level summary of LLM progress from one of the field's leading experts.

LLM

Excited to share that I am starting an AI+Education company called Eureka Labs.

@karpathy

Jul 16, 12:00 AM

Highlights: Karpathy launches Eureka Labs, an AI+Education venture.

Worth reading: Shows his commitment to AI in education, a key area for democratizing learning.

LLMDeploymentTooling

Let's reproduce GPT-2 (124M)

Andrej Karpathy

Jun 9, 11:31 PM·1,048,747 viewsYouTube

Highlights: This video provides a comprehensive, hands-on walkthrough of reproducing the GPT-2 (124M) model from scratch, covering network architecture, training optimization, and hyperparameter tuning based on original papers. It demonstrates the full training pipeline with practical implementation details and concludes with generated text samples to evaluate model performance.

Worth watching: Worth watching for its educational value in understanding transformer-based language model implementation and training optimization, presented by a renowned AI educator with clear, practical demonstrations.

Let's build the GPT Tokenizer

Andrej Karpathy

Feb 20, 05:11 PM·1,069,817 viewsYouTube

Highlights: The video explains that tokenizers are a separate, crucial component in LLMs, using Byte Pair Encoding to translate between text and tokens. It demonstrates building the GPT tokenizer from scratch, highlighting its distinct training process and core encode/decode functions.

Worth watching: Worth watching to understand a fundamental yet often overlooked part of how LLMs process text, presented clearly by an expert in the field.

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

Nov 23, 02:27 AM·3,554,837 viewsYouTube

Highlights: This talk demystifies Large Language Models (LLMs) by explaining them as a new computing paradigm analogous to operating systems, where models like ChatGPT serve as the core technical component. It covers their fundamental workings, future trajectory, and unique security challenges in an accessible way for general audiences.

Worth watching: Andrej Karpathy provides a clear, foundational understanding of LLMs from one of the field's leading educators, making complex concepts accessible while addressing practical implications and security considerations that remain highly relevant.