Timeline

156 items from all sources, sorted by time

My bets on open models, mid-2026

Nathan Lambert·Apr 15, 2026

What I expect to come next and why, focused on the open-closed gap.

Highlights: The author predicts that by mid-2026, the gap between open and closed AI models will significantly narrow, with open models achieving performance parity in key areas. This shift is expected to be driven by advancements in training efficiency, data curation, and collaborative development within the open-source community.

Worth reading: It offers a forward-looking perspective on the evolving AI landscape, grounded in technical trends, making it valuable for developers, researchers, and anyone interested in the future of accessible AI technology.

Blog

Tue, Apr 14

What I’ve been building: ATOM Report, post-training course, finishing my book, and ongoing research

Nathan Lambert·Apr 14, 2026

What I've been up to!

Highlights: The post offers a personal update on Nathan Lambert's multifaceted contributions to AI/ML, including the ATOM Report for technical insights, a post-training course for practical education, and his book for broader dissemination of knowledge. It highlights the importance of bridging research, education, and community engagement in advancing the field.

Worth reading: It provides a concise overview of current projects from an active researcher, useful for those interested in AI/ML trends, educational resources, or community contributions.

Blog

Thu, Apr 9

Claude Mythos and misguided open-weight fearmongering

Nathan Lambert·Apr 9, 2026

Another dance around fears of open-source.

Highlights: The post critiques the 'Claude Mythos' narrative that overstates risks of open-weight AI models, arguing it's a form of fearmongering that distracts from more substantive discussions. It suggests this pattern reflects recurring anxieties in open-source debates rather than new, evidence-based concerns.

Worth reading: It offers a critical perspective on current AI discourse, challenging common assumptions about open-source risks and encouraging more nuanced evaluation of model accessibility.

Blog

Sat, Apr 11

The inevitable need for an open model consortium

Nathan Lambert·Apr 11, 2026

And yes, I hate consortia too.

Highlights: The article argues that despite general skepticism toward consortia, the AI field urgently requires an open model consortium to ensure transparency, collaboration, and ethical standards. This collective approach is framed as essential for addressing the rapid, often opaque advancements in AI development.

Worth reading: It offers a pragmatic perspective on overcoming industry fragmentation and highlights the critical role of open collaboration in shaping responsible AI innovation.

Blog

Thu, Apr 16

Marc Lanctot

@sharky6000.bsky.social

Last summer, I posted a thread and screenshots of ny replay of Castles of Dr. Creep on a #c64 emulator. This year I am replaying the sequel: Dungeons of Dr. Creep on my #commodore64 Ultimate. Tonight I finished a really hard level called Dirty Tricks! 🙌😁 Here is a thread and a few shorts 👇👇👇

Apr 16, 02:50 AM·❤️ 1🔄 1·💬 1

Simon Willison

@simonwillison.net

A claude.ai feature I really like is you can tell it to "clone x/y from GitHub" and it can then answer questions about a repo, or use snippets of code from that repo to help build new artifacts - used that just now to solve a minor friction simonwillison.net/2026/Apr/16/...

Apr 16, 12:31 AM·❤️ 34🔄 0·💬 3

LLMTooling

Wed, Apr 15

Emily M. Bender

@emilymbender.bsky.social

Last year, someone (specifically, OUP) asked me to write an encyclopedia entry for "AI". I've just finished reviewing the copy edits, so hopefully it will be in the world soon. Meanwhile, a teaser: >>

Apr 15, 10:47 PM·❤️ 72🔄 18·💬 3

Amy Zhang

@axz.bsky.social

Feeling FOMO that I can't be at #CHI2026 this year but please check out all the great work that our @socialfutureslab.bsky.social + friends are presenting (see below for paper links). And say hi to @kjfeng.me @aliciaguo.com, Katie Yurechko, and Tony Zhou who are at the conference!

Apr 15, 09:59 PM·❤️ 8🔄 1·💬 1

Ethan Mollick

@emollick.bsky.social

Instead of the gold standard, we can, as a thought experiment, imagine an inference standard of exchange, the FLOP. (As opposed to tokens, this accounts for AI ability) With some AI help, I figure $1 buys roughly 10^17 managed-LLM inference FLOPs So that $4 coffee would cost half an exaFLOP, choom

Apr 15, 07:45 PM·❤️ 27🔄 0·💬 3

LLMInfra

Nathan Lambert

@natolambert.bsky.social

I spent some time trying to distill all the complex factors impacting open models -- economics, capabilities, distribution, policy, etc. -- into a clear list of beliefs. Here they are in full. www.interconnects.ai/p/my-bets-on...

Apr 15, 06:32 PM·❤️ 24🔄 4·💬 1

LLMDeployment

Simon Willison

@simonwillison.net

The example prompt for Google's new Gemini Flash TTS text-to-speed model is a lot simonwillison.net/2026/Apr/15/...

Apr 15, 05:16 PM·❤️ 60🔄 7·💬 9

LLMMulti-modal

Ethan Mollick

@emollick.bsky.social

This is becoming a pattern in AI that makes talking about capabilities challenging. First, there are overstated claims (like the flubbed Erdos problems that were announced last year), then minor wins (AI helps with discovery) then breakthroughs. The first stage feels like (& often is) hype, but…

Apr 15, 05:10 PM·❤️ 63🔄 7·💬 7

Evaluation

Mark Riedl

@markriedl.bsky.social

On my way to give a talk at CNN’s NYC headquarters. Taking the opportunity to wear a niche AI humor t-shirt that probably only made sense in 2016

Apr 15, 05:00 PM·❤️ 34🔄 0·💬 2

Mark Riedl

@markriedl.bsky.social

Huh?

Apr 15, 03:48 PM·❤️ 10🔄 0·💬 7

Mark Riedl

@markriedl.bsky.social

Hey computer science faculty peeps! Are we prepared for the near future where every high school student and incoming college freshman has vibe-coded an AI agent as high school “research”? Exciting. And scary. We are going to need to update our priors.

Apr 15, 03:02 PM·❤️ 26🔄 4·💬 5

AgentDeployment

hardmaru

@hardmaru.bsky.social

We are hiring Software Engineers in Tokyo to help us scale Sakana AI’s R&D efforts. If you are interested in building the data pipelines and full stack infrastructure needed to push the boundaries of automated scientific discovery, we would love to hear from you. 🗼🎌 sakana.ai/careers/#sof...

Apr 15, 02:31 PM·❤️ 8🔄 2·💬 1

InfraDeployment

Ben Recht

@beenwrekt.bsky.social

The long legacy of simulation in control theory and what it can teach us about transferring policies from GPU to reality.

Apr 15, 02:28 PM·❤️ 11🔄 0

DeploymentInfra

Thomas Dietterich

@tdietterich.bsky.social

I'm late to the game -- I only recently discovered @techtrenches.dev Highly recommended reading!

Apr 15, 04:35 AM·❤️ 11🔄 3

zalandoresearch/pytorch-vq-vae

Jupyter Notebook⭐ 602·starred by pcuenca

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Highlights: This repository provides a PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE), a neural architecture that learns discrete latent representations for images. It demonstrates how to use vector quantization in the latent space to capture important features while maintaining reconstruction quality.

Worth reading: The implementation is clean and well-documented, making it an excellent educational resource for understanding how VQ-VAEs work and how to implement them in PyTorch.

deep-learningpytorchvaevq-vae

DWarez/kernels_bench

Python⭐ 9·starred by sayakpaul

Highlights: This project appears to benchmark computational kernels, likely focusing on performance comparisons of core operations in Python. It provides a framework for evaluating execution speed and efficiency across different implementations or hardware configurations.

Worth reading: For developers working on performance-critical applications, it offers insights into optimizing computational kernels and understanding performance trade-offs.

EvaluationTooling

millionco/claude-doctor

TypeScript⭐ 198·starred by philschmid

Diagnose your Claude Code sessions

Highlights: This project provides diagnostic tools for Claude Code sessions, helping developers identify issues and optimize their interactions with Claude's coding capabilities. It offers session analysis and debugging features specifically tailored for Claude's code generation workflows.

Worth reading: It addresses a practical need for developers working with Claude's coding features, providing insights that can improve productivity and code quality.

claudecodedoctor

ToolingEvaluation

Tue, Apr 14

Nathan Lambert

@natolambert.bsky.social

One of my key strategies with Interconnects is to develop the practice of making my work obviously compelling to a wider audience, keeping them hooked over time and wondering what I'm up to, etc. www.interconnects.ai/p/what-ive-b...

Apr 14, 09:06 PM·❤️ 7🔄 0·💬 1

Deployment