Intelligence.Log

2026-04-15

Extracted: 55 items. Sources: GitHub, Bluesky, X, Blogs.
++ AI OVERVIEW ++
Today's discourse highlights the practical and philosophical frontiers of AI. On the technical side, there's a resurgence of interest in foundational models, with a PyTorch VQ-VAE implementation gaining stars, while Nathan Lambert and Ethan Mollick tackle complex questions about open model economics and a potential "FLOP standard" for measuring AI value. Meanwhile, leaders like Mark Riedl and Emily M. Bender are looking ahead, pondering the educational impact of a generation fluent in "vibe-coding" and carefully defining the term "AI" itself for a major publication.
grep TOPIC=
grep SOURCE=
sort --by=
GH
millionco/claude-doctor0.2k6/10

Diagnose your Claude Code sessions

Starred byphilschmid|[Tooling][Evaluation]
This project provides diagnostic tools for Claude Code sessions, helping developers identify issues and optimize their interactions with Claude's coding capabilities. It offers session analysis and debugging features specifically tailored for Claude's code generation workflows.
GH

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Starred bypcuenca|[Multi-modal]
This repository provides a PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE), a neural architecture that learns discrete latent representations for images. It demonstrates how to use vector quantization in the latent space to capture important features while maintaining reconstruction quality.
GH
DWarez/kernels_bench0.0k2/10

Starred bysayakpaul|[Evaluation][Tooling]
This project appears to benchmark computational kernels, likely focusing on performance comparisons of core operations in Python. It provides a framework for evaluating execution speed and efficiency across different implementations or hardware configurations.
BSKY
simonwillison.netSimon Willison

The example prompt for Google's new Gemini Flash TTS text-to-speed model is a lot simonwillison.net/2026/Apr/15/...

❤️ 60 Likes|[LLM][Multi-modal]
BSKY
markriedl.bsky.socialMark Riedl

On my way to give a talk at CNN’s NYC headquarters. Taking the opportunity to wear a niche AI humor t-shirt that probably only made sense in 2016

❤️ 34 Likes|
BSKY
markriedl.bsky.socialMark Riedl

Huh?

❤️ 10 Likes|
BSKY
markriedl.bsky.socialMark Riedl

Hey computer science faculty peeps! Are we prepared for the near future where every high school student and incoming college freshman has vibe-coded an AI agent as high school “research”? Exciting. And scary. We are going to need to update our priors.

❤️ 26 Likes|[Agent][Deployment]
BSKY
t
Thomas Dietterich

I'm late to the game -- I only recently discovered @techtrenches.dev Highly recommended reading!

❤️ 11 Likes|
BSKY
natolambert.bsky.socialNathan Lambert

I spent some time trying to distill all the complex factors impacting open models -- economics, capabilities, distribution, policy, etc. -- into a clear list of beliefs. Here they are in full. www.interconnects.ai/p/my-bets-on...

❤️ 24 Likes|[LLM][Deployment]
BSKY
hardmaru.bsky.socialhardmaru

We are hiring Software Engineers in Tokyo to help us scale Sakana AI’s R&D efforts. If you are interested in building the data pipelines and full stack infrastructure needed to push the boundaries of automated scientific discovery, we would love to hear from you. 🗼🎌 sakana.ai/careers/#sof...

❤️ 8 Likes|[Infra][Deployment]
BSKY
emollick.bsky.socialEthan Mollick

Instead of the gold standard, we can, as a thought experiment, imagine an inference standard of exchange, the FLOP. (As opposed to tokens, this accounts for AI ability) With some AI help, I figure $1 buys roughly 10^17 managed-LLM inference FLOPs So that $4 coffee would cost half an exaFLOP, choom

❤️ 27 Likes|[LLM][Infra]
BSKY
emollick.bsky.socialEthan Mollick

This is becoming a pattern in AI that makes talking about capabilities challenging. First, there are overstated claims (like the flubbed Erdos problems that were announced last year), then minor wins (AI helps with discovery) then breakthroughs. The first stage feels like (& often is) hype, but…

❤️ 63 Likes|[Evaluation]
BSKY
emilymbender.bsky.socialEmily M. Bender

Last year, someone (specifically, OUP) asked me to write an encyclopedia entry for "AI". I've just finished reviewing the copy edits, so hopefully it will be in the world soon. Meanwhile, a teaser: >>

❤️ 72 Likes|
BSKY
beenwrekt.bsky.socialBen Recht

The long legacy of simulation in control theory and what it can teach us about transferring policies from GPU to reality.

❤️ 11 Likes|[Deployment][Infra]
BSKY
axz.bsky.socialAmy Zhang

Feeling FOMO that I can't be at #CHI2026 this year but please check out all the great work that our @socialfutureslab.bsky.social + friends are presenting (see below for paper links). And say hi to @kjfeng.me @aliciaguo.com, Katie Yurechko, and Tony Zhou who are at the conference!

❤️ 8 Likes|
X
LLM Knowledge Bases. Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating...
[LLM][RAG][Tooling]
“DeepSeek Summary: Karpathy is shifting focus from coding to using LLMs to build and manage personal knowledge bases for research, indicating a move towards knowledge compounding and organization.
X
LLMs are emerging as a new kind of intelligence, simultaneously a lot smarter than I expected and a lot dumber than I expected.
[LLM][Evaluation]
“DeepSeek Summary: Karpathy expresses the dual-nature surprise of LLM capabilities, acknowledging both their advanced and surprisingly limited aspects.
X
Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model.
[Fine-tuning][Agent][Tooling]
“DeepSeek Summary: Karpathy is experimenting with automated research and fine-tuning processes for a smaller model (nanochat), indicating hands-on work in model optimization.
X
Screenshot from a video game where a team of raccoons go on a heist
[Multi-modal]
“DeepSeek Summary: Simon Willison shares an AI-generated image prompt result showing creative multi-modal AI capabilities.
X
It's interesting how 'better at code' has become the defining goal of almost every AI lab over the...
[Agent][Evaluation]
“DeepSeek Summary: Willison observes the AI industry's intense focus on improving coding capabilities as a primary benchmark for progress.
X
soumithchintalaSoumith Chintala
reading 'AI News' (previously Smol Talk) is probably the highest-leverage 45 mins
[LLM]
“DeepSeek Summary: Soumith Chintala recommends 'AI News' (formerly Smol Talk) as a highly valuable 45-minute investment for staying informed about AI developments.
X
soumithchintalaSoumith Chintala
Open LLMs need to get organized and co-ordinated about sharing human feedback.
[LLM][Evaluation]
“DeepSeek Summary: Soumith Chintala calls for better organization and coordination among open LLM projects regarding the sharing of human feedback data.
X
soumithchintalaSoumith Chintala
MacStudio you ask? Apple Engineering's actual time spent on PyTorch support
[Infra][Deployment]
“DeepSeek Summary: Soumith Chintala comments on Apple's engineering investment in PyTorch support, likely in the context of Mac Studio hardware.
X
soumithchintalaSoumith Chintala
Sometimes we forget that NVIDIA wins because it's a software company.
[Infra][Tooling]
“DeepSeek Summary: Soumith Chintala reminds that NVIDIA's success is fundamentally driven by its software capabilities, not just hardware.
X
I think it's clear that for many smaller companies that invested in deep learning, it turned out
[Evaluation][Deployment]
“DeepSeek Summary: Chollet suggests that deep learning investments haven't paid off for many smaller companies, implying practical limitations or misalignment with business needs.
X
h
David Ha
David Ha @hardmaru and team are super practical scientifically research driven geniuses . And this is amazing to see ‍ ‍
[Evaluation]
“DeepSeek Summary: A third party praises David Ha (@hardmaru) and his team for being practical, scientifically research-driven geniuses.
X
minimaxirMax Woolf
Impressive model based on a few minutes of playing, but disappointing to see no mention at all of a model card, red teaming, yesterday's incident,
[Safety][Evaluation]
“DeepSeek Summary: Max Woolf critiques a new AI model for lacking proper documentation (model card) and safety testing (red teaming), while also referencing a recent incident.
X
minimaxirMax Woolf
me irl
“DeepSeek Summary: A personal, casual post expressing a relatable feeling or situation.
X
minimaxirMax Woolf
“DeepSeek Summary: A tweet with engagement (19 likes) but no visible text content in the provided snippet.
X
minimaxirMax Woolf
“DeepSeek Summary: A tweet with significant views (468) but no visible text content in the provided snippet.
X
lucidrainsPhil Wang
Stand-up comedy's gain is game criticism's loss: comedian @PhilNWang speaks with clarity and insight about the five video games he would like to
“DeepSeek Summary: Phil Wang discusses video games with clarity and insight, suggesting his transition from game criticism to comedy.
X
lucidrainsPhil Wang
Phil Wang // Insta: @wangpix (@PhilNWang). 324 likes 24 replies. I got to cover for the excellent @HadleyFreeman in the Guardian today so
“DeepSeek Summary: Phil Wang mentions covering for Hadley Freeman at The Guardian, indicating journalistic work.
X
srush_ioSasha Rush
Some personal news: I recently joined Cursor. Cursor is a small, ambitious team, and they've created...
[Tooling]
“DeepSeek Summary: Sasha Rush announces joining Cursor, describing it as a small, ambitious team.
X
If you were holding off to try @MSFTDeepSpeed ZeRO++ it looks like deepspeed@master should
[Infra][Deployment]
“DeepSeek Summary: Suggests that DeepSpeed's ZeRO++ feature is now available in the master branch and worth trying.
X
If you're trying out FA4, you're likely to run into not being able to load cutlass.cute
[Infra][Tooling]
“DeepSeek Summary: Warns about a potential issue when experimenting with FA4 (likely FlashAttention 4) related to loading cutlass.cute.
X
Thanks to an awesome contribution from @omarnomad The Machine Learning Engineering Open book now can
[Tooling]
“DeepSeek Summary: Acknowledges a contribution that enhanced the 'Machine Learning Engineering Open book'.
X
Classical Jensen math. Unidirectional bandwidth is topped at 450GB/s, and then there comes a protocol overhead of two digit percentage.
[Infra]
“DeepSeek Summary: Discusses a hardware bandwidth limit (450GB/s) and significant protocol overhead affecting performance.
X
sayakpaulSayak Paul
Had a nice time chatting about the state of diffusion models and some text-to-image data shenanigans at
[Multi-modal]
“DeepSeek Summary: Sayak Paul discusses the current state of diffusion models and text-to-image data issues.
X
sayakpaulSayak Paul
Details:
“DeepSeek Summary: A brief post by Sayak Paul sharing unspecified details.
X
sayakpaulSayak Paul
Giving a talk here is by far the most fulfilling experience of my life!
“DeepSeek Summary: Sayak Paul expresses that giving a talk was an extremely fulfilling personal experience.
X
philschmidPhilipp Schmid
How to use Gemma 4 with the Gemini API and Google AI Studio. www.philschmid.de.
[LLM][Tooling]
“DeepSeek Summary: A technical guide on integrating Gemma 4 with Google's Gemini API and AI Studio platform.
X
philschmidPhilipp Schmid
Random thought. We are going to be so much faster at creating and building.
“DeepSeek Summary: An optimistic reflection on the accelerating pace of innovation and development capabilities.
X
e
Ethan Mollick
In discussions of AI and jobs, we put too much emphasis on the technology and not enough on...
[Deployment]
“DeepSeek Summary: Critiques the overemphasis on AI technology itself in job discussions, suggesting other factors are being overlooked.
X
e
Ethan Mollick
As stories about AI increasingly become stories of either catastrophe or salvation,...
[Safety]
“DeepSeek Summary: Observes that AI narratives are polarizing into extremes of doom or utopia, missing more balanced discussions.
X
e
Ethan Mollick
As someone involved in academic research on AI, it is notable to me that most of the key...
[Evaluation]
“DeepSeek Summary: Notes from an academic research perspective that key developments in AI are happening outside traditional academic institutions.
X
e
Ethan Mollick
So much work is going into faking continual learning and memory for AIs,...
[LLM][Tooling]
“DeepSeek Summary: Comments on the significant engineering effort being devoted to simulating continuous learning and memory capabilities in AI systems.
X
e
Emily M. Bender
For those playing along at home, here's a 'AI is sentient!' argument bingo card.
[LLM][Safety][Evaluation]
“DeepSeek Summary: Emily M. Bender shares a bingo card mocking common arguments about AI sentience, highlighting her critical stance on AI hype.
X
e
Emily M. Bender
This is infuriating and also was totally predictable. Thank you @daveyalba
[Safety][Evaluation][Deployment]
“DeepSeek Summary: Emily M. Bender expresses frustration over a predictable issue in AI, likely related to ethics or misuse, acknowledging another user's contribution.
X
N
Naomi Saphra
If you still don't know what you're doing Friday at #ICML2024, I'm going to suggest our HiLD
[LLM][Evaluation]
“DeepSeek Summary: Promoting a HiLD event at ICML 2024 conference, indicating involvement in machine learning research community.
X
N
Naomi Saphra
I'll meet you at this button.
[Agent]
“DeepSeek Summary: Brief, possibly cryptic tweet that could reference a specific location or inside joke within the ML community.
X
N
Naomi Saphra
I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the
[LLM][Fine-tuning][Evaluation]
“DeepSeek Summary: Describes research focus on NLP model training dynamics and emergent mechanistic behaviors in language models.
X
b
Ben Recht
For the first time in almost a decade, I'm teaching a class on learning and control.
[Evaluation]
“DeepSeek Summary: Ben Recht is returning to teaching a course on learning and control after nearly ten years.
X
b
Ben Recht
Part one of a new blog series: using the discovery of vitamins as a parable for why replication crises in [science/ML] matter.
[Evaluation]
“DeepSeek Summary: Introduces a blog series using the history of vitamin discovery as an analogy to discuss the importance of replication in scientific and machine learning research.
X
b
Ben Recht
Revisiting Sutton's Bitter Lesson in the wake of GPT-5.
[LLM][Evaluation]
“DeepSeek Summary: Re-examines the 'Bitter Lesson'—the idea that general methods leveraging computation scale best—in the context of advanced models like GPT-5.
BLOG

What I expect to come next and why, focused on the open-closed gap.

The author predicts that by mid-2026, the gap between open and closed AI models will significantly narrow, with open models achieving performance parity in key areas. This shift is expected to be driven by advancements in training efficiency, data curation, and collaborative development within the open-source community.
-- END OF LOG --
[STATS] 55 items · Filter applied
Powered by Horizon + DeepSeek