Intelligence.Log

2026-04-15

Extracted: 55 items. Sources: GitHub, Bluesky, X, Blogs.

++ AI OVERVIEW ++

Today's discourse highlights the practical and philosophical frontiers of AI. On the technical side, there's a resurgence of interest in foundational models, with a PyTorch VQ-VAE implementation gaining stars, while Nathan Lambert and Ethan Mollick tackle complex questions about open model economics and a potential "FLOP standard" for measuring AI value. Meanwhile, leaders like Mark Riedl and Emily M. Bender are looking ahead, pondering the educational impact of a generation fluent in "vibe-coding" and carefully defining the term "AI" itself for a major publication.

grep TOPIC=

grep SOURCE=

sort --by=

millionco/claude-doctor★ 0.2k▲ 6/10

Diagnose your Claude Code sessions

Starred byphilschmid|[Tooling][Evaluation]

“This project provides diagnostic tools for Claude Code sessions, helping developers identify issues and optimize their interactions with Claude's coding capabilities. It offers session analysis and debugging features specifically tailored for Claude's code generation workflows.”

zalandoresearch/pytorch-vq-vae★ 0.6k▲ 3/10

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Starred bypcuenca|[Multi-modal]

“This repository provides a PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE), a neural architecture that learns discrete latent representations for images. It demonstrates how to use vector quantization in the latent space to capture important features while maintaining reconstruction quality.”

DWarez/kernels_bench★ 0.0k▲ 2/10

Starred bysayakpaul|[Evaluation][Tooling]

“This project appears to benchmark computational kernels, likely focusing on performance comparisons of core operations in Python. It provides a framework for evaluating execution speed and efficiency across different implementations or hardware configurations.”

BSKY

Simon WillisonApr 15, 05:16 PM

The example prompt for Google's new Gemini Flash TTS text-to-speed model is a lot simonwillison.net/2026/Apr/15/...

❤️ 60 Likes|[LLM][Multi-modal]

BSKY

Mark RiedlApr 15, 05:00 PM

On my way to give a talk at CNN’s NYC headquarters. Taking the opportunity to wear a niche AI humor t-shirt that probably only made sense in 2016

❤️ 34 Likes|

BSKY

Mark RiedlApr 15, 03:48 PM

Huh?

❤️ 10 Likes|

BSKY

Mark RiedlApr 15, 03:02 PM

Hey computer science faculty peeps! Are we prepared for the near future where every high school student and incoming college freshman has vibe-coded an AI agent as high school “research”? Exciting. And scary. We are going to need to update our priors.

❤️ 26 Likes|[Agent][Deployment]

BSKY

Thomas DietterichApr 15, 04:35 AM

I'm late to the game -- I only recently discovered @techtrenches.dev Highly recommended reading!

❤️ 11 Likes|

BSKY

Nathan LambertApr 15, 06:32 PM

I spent some time trying to distill all the complex factors impacting open models -- economics, capabilities, distribution, policy, etc. -- into a clear list of beliefs. Here they are in full. www.interconnects.ai/p/my-bets-on...

❤️ 24 Likes|[LLM][Deployment]

BSKY

hardmaruApr 15, 02:31 PM

We are hiring Software Engineers in Tokyo to help us scale Sakana AI’s R&D efforts. If you are interested in building the data pipelines and full stack infrastructure needed to push the boundaries of automated scientific discovery, we would love to hear from you. 🗼🎌 sakana.ai/careers/#sof...

❤️ 8 Likes|[Infra][Deployment]

BSKY

Ethan MollickApr 15, 07:45 PM

Instead of the gold standard, we can, as a thought experiment, imagine an inference standard of exchange, the FLOP. (As opposed to tokens, this accounts for AI ability) With some AI help, I figure $1 buys roughly 10^17 managed-LLM inference FLOPs So that $4 coffee would cost half an exaFLOP, choom

❤️ 27 Likes|[LLM][Infra]

BSKY

Ethan MollickApr 15, 05:10 PM

This is becoming a pattern in AI that makes talking about capabilities challenging. First, there are overstated claims (like the flubbed Erdos problems that were announced last year), then minor wins (AI helps with discovery) then breakthroughs. The first stage feels like (& often is) hype, but…

❤️ 63 Likes|[Evaluation]

BSKY

Emily M. BenderApr 15, 10:47 PM

Last year, someone (specifically, OUP) asked me to write an encyclopedia entry for "AI". I've just finished reviewing the copy edits, so hopefully it will be in the world soon. Meanwhile, a teaser: >>

❤️ 72 Likes|

BSKY

Ben RechtApr 15, 02:28 PM

The long legacy of simulation in control theory and what it can teach us about transferring policies from GPU to reality.

❤️ 11 Likes|[Deployment][Infra]

BSKY

Amy ZhangApr 15, 09:59 PM

Feeling FOMO that I can't be at #CHI2026 this year but please check out all the great work that our @socialfutureslab.bsky.social + friends are presenting (see below for paper links). And say hi to @kjfeng.me @aliciaguo.com, Katie Yurechko, and Tony Zhou who are at the conference!

❤️ 8 Likes|

Andrej Karpathy@karpathy

LLM Knowledge Bases. Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating...

[LLM][RAG][Tooling]

“DeepSeek Summary: Karpathy is shifting focus from coding to using LLMs to build and manage personal knowledge bases for research, indicating a move towards knowledge compounding and organization.”

Andrej Karpathy@karpathy

LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected.

[LLM][Evaluation]

“DeepSeek Summary: Karpathy expresses the dual-nature surprise of LLM capabilities, acknowledging both their advanced and surprisingly limited aspects.”

Andrej Karpathy@karpathy

Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model.

[Fine-tuning][Agent][Tooling]

“DeepSeek Summary: Karpathy is experimenting with automated research and fine-tuning processes for a smaller model (nanochat), indicating hands-on work in model optimization.”

Simon Willison@simonw

Screenshot from a video game where a team of raccoons go on a heist

[Multi-modal]

“DeepSeek Summary: Simon Willison shares an AI-generated image prompt result showing creative multi-modal AI capabilities.”

Simon Willison@simonw

It's interesting how 'better at code' has become the defining goal of almost every AI lab over the...

[Agent][Evaluation]

“DeepSeek Summary: Willison observes the AI industry's intense focus on improving coding capabilities as a primary benchmark for progress.”

Soumith Chintala@soumithchintala

reading 'AI News' (previously Smol Talk) is probably the highest-leverage 45 mins

[LLM]

“DeepSeek Summary: Soumith Chintala recommends 'AI News' (formerly Smol Talk) as a highly valuable 45-minute investment for staying informed about AI developments.”

Soumith Chintala@soumithchintala

Open LLMs need to get organized and co-ordinated about sharing human feedback.

[LLM][Evaluation]

“DeepSeek Summary: Soumith Chintala calls for better organization and coordination among open LLM projects regarding the sharing of human feedback data.”

Soumith Chintala@soumithchintala

MacStudio you ask? Apple Engineering's actual time spent on PyTorch support

[Infra][Deployment]

“DeepSeek Summary: Soumith Chintala comments on Apple's engineering investment in PyTorch support, likely in the context of Mac Studio hardware.”

Soumith Chintala@soumithchintala

Sometimes we forget that NVIDIA wins because it's a software company.

[Infra][Tooling]

“DeepSeek Summary: Soumith Chintala reminds that NVIDIA's success is fundamentally driven by its software capabilities, not just hardware.”

Francois Chollet@fchollet

I think it's clear that for many smaller companies that invested in deep learning, it turned out

[Evaluation][Deployment]

“DeepSeek Summary: Chollet suggests that deep learning investments haven't paid off for many smaller companies, implying practical limitations or misalignment with business needs.”

David Ha@hardmaru

David Ha @hardmaru and team are super practical scientifically research driven geniuses . And this is amazing to see ‍ ‍

[Evaluation]

“DeepSeek Summary: A third party praises David Ha (@hardmaru) and his team for being practical, scientifically research-driven geniuses.”

Max Woolf@minimaxir

Impressive model based on a few minutes of playing, but disappointing to see no mention at all of a model card, red teaming, yesterday's incident,

[Safety][Evaluation]

“DeepSeek Summary: Max Woolf critiques a new AI model for lacking proper documentation (model card) and safety testing (red teaming), while also referencing a recent incident.”

Max Woolf@minimaxir

me irl

“DeepSeek Summary: A personal, casual post expressing a relatable feeling or situation.”

Max Woolf@minimaxir

“DeepSeek Summary: A tweet with engagement (19 likes) but no visible text content in the provided snippet.”

Max Woolf@minimaxir

“DeepSeek Summary: A tweet with significant views (468) but no visible text content in the provided snippet.”

Phil Wang@lucidrains

Stand-up comedy's gain is game criticism's loss: comedian @PhilNWang speaks with clarity and insight about the five video games he would like to

“DeepSeek Summary: Phil Wang discusses video games with clarity and insight, suggesting his transition from game criticism to comedy.”

Phil Wang@lucidrains

Phil Wang // Insta: @wangpix (@PhilNWang). 324 likes 24 replies. I got to cover for the excellent @HadleyFreeman in the Guardian today so

“DeepSeek Summary: Phil Wang mentions covering for Hadley Freeman at The Guardian, indicating journalistic work.”

Sasha Rush@srush_io

Some personal news: I recently joined Cursor. Cursor is a small, ambitious team, and they've created...

[Tooling]

“DeepSeek Summary: Sasha Rush announces joining Cursor, describing it as a small, ambitious team.”

Stas Bekman@stas00

If you were holding off to try @MSFTDeepSpeed ZeRO++ it looks like deepspeed@master should

[Infra][Deployment]

“DeepSeek Summary: Suggests that DeepSpeed's ZeRO++ feature is now available in the master branch and worth trying.”

Stas Bekman@stas00

If you're trying out FA4, you're likely to run into not being able to load cutlass.cute

[Infra][Tooling]

“DeepSeek Summary: Warns about a potential issue when experimenting with FA4 (likely FlashAttention 4) related to loading cutlass.cute.”

Stas Bekman@stas00

Thanks to an awesome contribution from @omarnomad The Machine Learning Engineering Open book now can

[Tooling]

“DeepSeek Summary: Acknowledges a contribution that enhanced the 'Machine Learning Engineering Open book'.”

Stas Bekman@stas00

Classical Jensen math. Unidirectional bandwidth is topped at 450GB/s, and then there comes a protocol overhead of two digit percentage.

[Infra]

“DeepSeek Summary: Discusses a hardware bandwidth limit (450GB/s) and significant protocol overhead affecting performance.”

Sayak Paul@sayakpaul

Had a nice time chatting about the state of diffusion models and some text-to-image data shenanigans at

[Multi-modal]

“DeepSeek Summary: Sayak Paul discusses the current state of diffusion models and text-to-image data issues.”

Sayak Paul@sayakpaul

Details:

“DeepSeek Summary: A brief post by Sayak Paul sharing unspecified details.”

Sayak Paul@sayakpaul

Giving a talk here is by far the most fulfilling experience of my life!

“DeepSeek Summary: Sayak Paul expresses that giving a talk was an extremely fulfilling personal experience.”

Philipp Schmid@philschmid

How to use Gemma 4 with the Gemini API and Google AI Studio. www.philschmid.de.

[LLM][Tooling]

“DeepSeek Summary: A technical guide on integrating Gemma 4 with Google's Gemini API and AI Studio platform.”

Philipp Schmid@philschmid

Random thought. We are going to be so much faster at creating and building.

“DeepSeek Summary: An optimistic reflection on the accelerating pace of innovation and development capabilities.”

Ethan Mollick@emollick

In discussions of AI and jobs, we put too much emphasis on the technology and not enough on...

[Deployment]

“DeepSeek Summary: Critiques the overemphasis on AI technology itself in job discussions, suggesting other factors are being overlooked.”

Ethan Mollick@emollick

As stories about AI increasingly become stories of either catastrophe or salvation,...

[Safety]

“DeepSeek Summary: Observes that AI narratives are polarizing into extremes of doom or utopia, missing more balanced discussions.”

Ethan Mollick@emollick

As someone involved in academic research on AI, it is notable to me that most of the key...

[Evaluation]

“DeepSeek Summary: Notes from an academic research perspective that key developments in AI are happening outside traditional academic institutions.”

Ethan Mollick@emollick

So much work is going into faking continual learning and memory for AIs,...

[LLM][Tooling]

“DeepSeek Summary: Comments on the significant engineering effort being devoted to simulating continuous learning and memory capabilities in AI systems.”

Emily M. Bender@emilymbender

For those playing along at home, here's a 'AI is sentient!' argument bingo card.

[LLM][Safety][Evaluation]

“DeepSeek Summary: Emily M. Bender shares a bingo card mocking common arguments about AI sentience, highlighting her critical stance on AI hype.”

Emily M. Bender@emilymbender

This is infuriating and also was totally predictable. Thank you @daveyalba

[Safety][Evaluation][Deployment]

“DeepSeek Summary: Emily M. Bender expresses frustration over a predictable issue in AI, likely related to ethics or misuse, acknowledging another user's contribution.”

Naomi Saphra@NaomiSaphra

If you still don't know what you're doing Friday at #ICML2024, I'm going to suggest our HiLD

[LLM][Evaluation]

“DeepSeek Summary: Promoting a HiLD event at ICML 2024 conference, indicating involvement in machine learning research community.”

Naomi Saphra@NaomiSaphra

I'll meet you at this button.

[Agent]

“DeepSeek Summary: Brief, possibly cryptic tweet that could reference a specific location or inside joke within the ML community.”

Naomi Saphra@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

[LLM][Fine-tuning][Evaluation]

“DeepSeek Summary: Describes research focus on NLP model training dynamics and emergent mechanistic behaviors in language models.”

Ben Recht@beenwrekt

For the first time in almost a decade, I'm teaching a class on learning and control.

[Evaluation]

“DeepSeek Summary: Ben Recht is returning to teaching a course on learning and control after nearly ten years.”

Ben Recht@beenwrekt

Part one of a new blog series: using the discovery of vitamins as a parable for why replication crises in [science/ML] matter.

[Evaluation]

“DeepSeek Summary: Introduces a blog series using the history of vitamin discovery as an analogy to discuss the importance of replication in scientific and machine learning research.”

Ben Recht@beenwrekt

Revisiting Sutton's Bitter Lesson in the wake of GPT-5.

[LLM][Evaluation]

“DeepSeek Summary: Re-examines the 'Bitter Lesson'—the idea that general methods leveraging computation scale best—in the context of advanced models like GPT-5.”

BLOG

My bets on open models, mid-2026

What I expect to come next and why, focused on the open-closed gap.

By Nathan Lambert

“The author predicts that by mid-2026, the gap between open and closed AI models will significantly narrow, with open models achieving performance parity in key areas. This shift is expected to be driven by advancements in training efficiency, data curation, and collaborative development within the open-source community.”

-- END OF LOG --

[STATS] 55 items · Filter applied