Global Timeline
Nathan Lambert·Apr 15, 2026
What I expect to come next and why, focused on the open-closed gap.
Highlights: The author predicts that by mid-2026, the gap between open and closed AI models will significantly narrow, with open models achieving performance parity in key areas. This shift is expected to be driven by advancements in training efficiency, data curation, and collaborative development within the open-source community.
Worth reading: It offers a forward-looking perspective on the evolving AI landscape, grounded in technical trends, making it valuable for developers, researchers, and anyone interested in the future of accessible AI technology.
Nathan Lambert·Apr 14, 2026
What I've been up to!
Highlights: The post offers a personal update on Nathan Lambert's multifaceted contributions to AI/ML, including the ATOM Report for technical insights, a post-training course for practical education, and his book for broader dissemination of knowledge. It highlights the importance of bridging research, education, and community engagement in advancing the field.
Worth reading: It provides a concise overview of current projects from an active researcher, useful for those interested in AI/ML trends, educational resources, or community contributions.
Nathan Lambert·May 12, 2026
Further reflections on China's high-participation, open-first AI ecosystem.
Highlights: The post explores how open model ecosystems, particularly in China, create compounding benefits through high participation and open-first strategies. It argues that openness accelerates innovation and leads to more robust AI development compared to closed systems.
Worth reading: It provides a unique perspective on the dynamics of open AI ecosystems, especially in the context of China's approach, which is often underrepresented in Western discussions.
Another dance around fears of open-source.
Highlights: The post critiques the 'Claude Mythos' narrative that overstates risks of open-weight AI models, arguing it's a form of fearmongering that distracts from more substantive discussions. It suggests this pattern reflects recurring anxieties in open-source debates rather than new, evidence-based concerns.
Worth reading: It offers a critical perspective on current AI discourse, challenging common assumptions about open-source risks and encouraging more nuanced evaluation of model accessibility.
Nathan Lambert·May 7, 2026
Lessons from my trip to talk to most of the leading AI labs in China.
Highlights: China's AI labs are highly focused on practical applications and large-scale engineering, often prioritizing rapid iteration over theoretical novelty. The ecosystem is characterized by intense competition, strong government support, and a unique blend of open-source contributions and proprietary development.
Worth reading: Offers rare firsthand insights into China's AI landscape, revealing how cultural and policy differences shape research priorities and innovation cycles.
Eugene Yan·May 3, 2026
Context as infra, taste as config, verification for autonomy, scale via delegation, closing the loop.
Highlights: The post frames working with AI as a compound process, where context serves as infrastructure, taste as configuration, and verification enables autonomy. It emphasizes scaling through delegation and closing the loop for continuous improvement. The core insight is that effective AI collaboration requires intentional design of these elements.
Worth reading: It offers a practical framework for integrating AI into workflows, moving beyond hype to actionable strategies. The author's experience with applied AI provides credible, nuanced advice for practitioners.
A learning-oriented workflow for understanding new open-weight model releases
Highlights: The post presents a systematic, learning-focused approach for analyzing new open-weight LLM architectures, emphasizing practical understanding over theoretical abstraction. It likely details a repeatable workflow that helps practitioners efficiently grasp architectural innovations and their implications.
Worth reading: It offers actionable guidance for staying current with rapidly evolving LLM releases, making it valuable for developers, researchers, and enthusiasts seeking to deepen their practical understanding of model architectures.
Nathan Lambert·May 16, 2026
An eventful month with one flagship release after another
Highlights: The post reviews a wave of major open model releases including Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, and GLM-5.1, highlighting the rapid pace of innovation in open AI. It focuses on CAISI's V4 assessment, providing a comparative analysis of performance and capabilities across these models.
Worth reading: If you follow open-source AI, this post offers a concise yet comprehensive snapshot of the latest model landscape, saving you hours of individual paper reviews.
Sebastian Raschka·May 16, 2026
From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs
Highlights: This post covers recent advances in LLM architectures aimed at reducing memory and compute costs for long-context processing, including KV sharing, multi-head caching (mHC), and compressed attention mechanisms. Key examples include Gemma 4's and DeepSeek V4's approaches to efficient attention, which enable handling longer sequences without proportional resource increases.
Worth reading: For practitioners and researchers working with LLMs, this article provides a concise overview of cutting-edge techniques that address the scalability bottleneck of long-context models, offering practical insights into how open-weight models are evolving.
And yes, I hate consortia too.
Highlights: The article argues that despite general skepticism toward consortia, the AI field urgently requires an open model consortium to ensure transparency, collaboration, and ethical standards. This collective approach is framed as essential for addressing the rapid, often opaque advancements in AI development.
Worth reading: It offers a pragmatic perspective on overcoming industry fragmentation and highlights the critical role of open collaboration in shaping responsible AI innovation.
The complex factors that determine the single evaluation number so many focus on. Plus, how this changes in the future.
Highlights: The post critiques the oversimplification of AI performance metrics, particularly the 'open-closed performance gap' often reduced to a single number. It argues this gap is shaped by complex, interdependent factors beyond simple comparisons, and explores how these dynamics might evolve with future AI advancements.
Worth reading: It offers a nuanced perspective on evaluating AI systems, moving beyond surface-level metrics to consider underlying complexities and future implications, which is valuable for practitioners and enthusiasts seeking deeper understanding.
Nathan Lambert·May 4, 2026
‘Distillation attacks’ is a horrible term for what is happening right now.
Highlights: The post criticizes the term 'distillation attacks' as misleading and argues that the current trend of smaller models learning from larger ones is a natural and beneficial progression in AI development.
Worth reading: It offers a clear, critical perspective on a hot topic in AI, helping readers understand the nuances of model distillation beyond the hype.
@stas00
Highlights: Stas Bekman warns about a common issue with FA4 (Flash Attention 4) involving cutlass.cute loading.
Worth reading: Useful for developers experimenting with Flash Attention 4.
@emollick.bsky.social
Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents
Python supercharged for the fastai library
@tdietterich.bsky.social
Observable Plot charts for Datasette Agent
A comparison of many lossless image compression formats.
Mobile companion for Google's Antigravity IDE. Mirror AI sessions on your phone, send messages, stop generation, automate via 9 MCP tools or OpenAPI.
@emollick.bsky.social
@emollick.bsky.social