Intelligence.Log

2026-05-10

Extracted: 42 items. Sources: GitHub, Bluesky, X.

++ AI OVERVIEW ++

The conversation today is sharply divided between pragmatic tooling and critical reflection. On the infrastructure side, the Rust graph library `petgraph` gained traction, while Thomas Dietterich highlighted the growing push to layer symbolic systems on top of LLMs to address core weaknesses like probabilistic execution and attribution. Meanwhile, Emily M. Bender voiced strong skepticism, dismissing "recent advances in AI/LLMs" as a turn-off in academic writing and lamenting the labor required to fact-check synthetic text. Mark Riedl added a lighter note by pointing to a new, free AI literacy course from the US Department of Labor, though its reception is tempered by broader industry fatigue. Finally, Naomi Saphra’s quip about *The Sheep Detectives* (2026) and its inaccurate portrayal of animal cognition serves as a playful reminder that even in entertainment, representation and accuracy matter.

◆ Signal

Co-Starred · Last 7 days

Repos independently starred by multiple AI leaders in the week ending 2026-05-10. Stronger signal = more overlap.

antirez/ds4

×2 starrers▲ 7/10★ 2.7k

DeepSeek 4 Flash local inference engine for Metal

by:lucidrains simonw

[Deployment][LLM]

|2026-05-07 → 2026-05-08

grep TOPIC=

grep SOURCE=

sort --by=

petgraph/petgraph★ 3.9k▲ 4/10

Graph data structure library for Rust.

Starred byminimaxir|[Infra]

“Petgraph is a comprehensive graph data structure library for Rust, offering a variety of graph types (undirected, directed, with or without node/edge weights) and classic graph algorithms (DFS, BFS, shortest paths, minimum spanning trees, etc.). It is widely used in the Rust ecosystem for modeling and solving graph problems efficiently.”

BSKY

Mark RiedlMay 10, 11:46 PM

The US Department of Labor has put out a new, free AI literacy course. Princeton CITP analyzed it blog.citp.princeton.edu/2026/05/05/m...

❤️ 8 Likes|[Safety]

BSKY

Mark RiedlMay 10, 04:44 PM

alife imitates art

❤️ 11 Likes|

BSKY

Thomas DietterichMay 10, 06:08 PM

This points to an important direction: layering symbolic systems on top of LLMs. These can overcome the main shortcomings of LLM architectures: probabilistic execution, continual learning, attribution, and (maybe) uncertainty quantification. 1/

❤️ 23 Likes|[LLM][Agent]

BSKY

Emily M. BenderMay 10, 09:12 PM

I guess I'm glad this is out there, but also I am infuriated that people have to spend their time doing this. OF COURSE synthetic text extruding machines are going to output something that *looks like* but is not an analysis. >>

❤️ 202 Likes|[Evaluation][LLM]

BSKY

Emily M. BenderMay 10, 08:56 PM

I can't think of anything that makes me want to read a paper less than encountering "Recent advances in AI/LLMs" in the abstract/intro. You can step off that bandwagon. Life is better over here!

❤️ 80 Likes|[Evaluation]

BSKY

Emily M. BenderMay 10, 01:03 PM

Tomorrow!

❤️ 27 Likes|

BSKY

Naomi SaphraMay 10, 04:17 PM

I had intended to see The Sheep Detectives (2026) (Rated PG) until Jill Lepore panned its inaccurate portrayal of animal cognition.

❤️ 27 Likes|

Andrej Karpathy@karpathy

My most amusing interaction was where the model (I think I was given some earlier version with a ...

[LLM]

“DeepSeek Summary: Karpathy shares an amusing interaction with an AI model, likely referring to a chatbot or language model.”

Andrej Karpathy@karpathy

I'm starting to get into a habit of reading everything (blogs, articles, book chapters, ...)

[LLM]

“DeepSeek Summary: Karpathy mentions developing a habit of extensive reading across various sources.”

Andrej Karpathy@karpathy

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around ...

[Safety]

“DeepSeek Summary: Karpathy observes a growing gap in understanding AI capabilities, pointing to a key issue.”

Simon Willison@simonw

A short note that the predictions that LLMs would favor 'boring technology' that's once you attach them to a good coding agent harness at least

[Agent][LLM]

“DeepSeek Summary: LLMs may favor boring technology when attached to a good coding agent harness.”

Simon Willison@simonw

I'm beginning to suspect that a key skill in working effectively with coding agents is developing an intuition for when you don't need to

[Agent][Tooling]

“DeepSeek Summary: Key skill with coding agents: intuition for when not to intervene.”

Simon Willison@simonw

Vibe coding is irresponsibly building software through dice rolls, not caring what code is produced

[Agent][Safety]

“DeepSeek Summary: Vibe coding defined as irresponsible software development.”

Harrison Chase@hwchase17

TL;DR: More and more agents need a workspace: a computer where they can run code, install packages, and access files. Sandboxes provide this

[Agent][Infra]

“DeepSeek Summary: Agents require a sandboxed workspace to execute code and access files, highlighting the need for secure execution environments.”

Harrison Chase@hwchase17

Your harness, your memory ... The “best” way to build agentic systems has changed dramatically over the past three years. When ChatGPT came out,

[Agent][LLM]

“DeepSeek Summary: The approach to building agentic systems has evolved significantly since ChatGPT's release, emphasizing memory and harness.”

Harrison Chase@hwchase17

We launched LangSmith Agent Builder this week as a no-code way to build agents. A key part of Agent builder is it's memory system. In this

[Agent][Tooling]

“DeepSeek Summary: LangSmith Agent Builder enables no-code agent creation with a focus on memory systems.”

Harrison Chase@hwchase17

When building agents, you need to iterate on production data much more than when building traditional software. You need to iterate on how

[Agent][Evaluation]

“DeepSeek Summary: Agent development requires more iteration on production data compared to traditional software.”

Jim Fan@DrJimFan

In this context, I define world modeling as predicting the next plausible world state (or a longer duration of states) conditioned on an action.

[Agent][Multi-modal]

“DeepSeek Summary: Jim Fan defines world modeling as predicting future world states conditioned on actions.”

Jeremy Howard@jeremyphoward

Here's a complete unedited video of asking Grok for its views on the Israel/Palestine situation. It first searches twitter for what Elon thinks.

[Safety][Evaluation][LLM]

“DeepSeek Summary: Jeremy Howard demonstrates that Grok prioritizes Elon Musk's opinions when asked about Israel/Palestine.”

Jeremy Howard@jeremyphoward

Jeremy Howard (@jeremyphoward). 189 replies. I replicated this result, that Grok focuses nearly entirely on finding out what Elon thinks in

[Safety][Evaluation][LLM]

“DeepSeek Summary: Jeremy Howard replicates a finding that Grok's responses center on Elon Musk's perspective.”

Soumith Chintala@soumithchintala

we've been working on democratizing fast kernel writing on the @PyTorch team. try

[Infra][Tooling]

“DeepSeek Summary: Soumith Chintala announces efforts to democratize fast kernel writing within the PyTorch team.”

Francois Chollet@fchollet

There's a big difference between solving a problem from first principles vs applying a solution

[Evaluation]

“DeepSeek Summary: Distinguishes between fundamental problem-solving and rote application.”

Francois Chollet@fchollet

The 3rd edition of my book Deep Learning with Python is being printed right now, and will be in bookstores within 2 weeks. You can order it now from Amazon

[Deployment]

“DeepSeek Summary: Announces the upcoming release of the 3rd edition of his book.”

Fei-Fei Li@drfeifei

Very excited to share @theworldlabs 's latest research work RTFM!! It's a real-time, ...

[Multi-modal]

“DeepSeek Summary: Fei-Fei Li shares excitement about World Labs' real-time research work RTFM.”

Max Woolf@minimaxir

LOL

“DeepSeek Summary: Max Woolf posted a simple 'LOL' tweet.”

Max Woolf@minimaxir

congrats to OpenAI on winning the Turing Test

[LLM][Evaluation]

“DeepSeek Summary: Max Woolf sarcastically congratulates OpenAI for supposedly winning the Turing Test.”

Stas Bekman@stas00

I have been compiling LLM/VLM training logbooks/chronicles. This is the one of the best sources to

[LLM][Tooling]

“DeepSeek Summary: Compiling LLM/VLM training logbooks as a key resource.”

Stas Bekman@stas00

Thanks to an awesome contribution from @omarnomad The Machine Learning Engineering Open book now can

[Tooling]

“DeepSeek Summary: Machine Learning Engineering Open Book updated with community contribution.”

Stas Bekman@stas00

This is a long overdue section of the ML Engineering Understanding Training Loss Patterns

[Fine-tuning]

“DeepSeek Summary: New section on understanding training loss patterns in ML Engineering.”

Stas Bekman@stas00

Modern art. Artist: PyTorch memory profiler Model: Llama-8B The piece on the left is the

[LLM][Infra][Tooling]

“DeepSeek Summary: Humorous take on PyTorch memory profiler output as modern art.”

Philipp Schmid@philschmid

I read three technical reports from Moonshot AI's Kimi K2.5 paper, Cursor's Composer 2 report and blog post, and Chroma's Context-1 write-up

[LLM][Tooling][RAG]

“DeepSeek Summary: Philipp Schmid shares his reading of three recent technical reports from Moonshot AI, Cursor, and Chroma.”

Philipp Schmid@philschmid

Random thought. We are going to be so much faster at creating and building.

[Agent][Deployment]

“DeepSeek Summary: Philipp Schmid expresses optimism about increased speed in creation and building due to AI.”

Philipp Schmid@philschmid

Skills have become one of the most used extension points in agents. They're flexible, easy to make, and simple to distribute.

[Agent][Tooling]

“DeepSeek Summary: Philipp Schmid highlights the importance of skills as extension points in AI agents.”

Ethan Mollick@emollick

On the plus side with Opus 4.7, if it does decide to think it produces BY FAR the best

[LLM]

“DeepSeek Summary: Opus 4.7 produces the best results when it decides to think.”

Ethan Mollick@emollick

We found that telling the AI "you are a great physicist" doesn't make it significantly more accurate at answering physics questions, nor does "

[Evaluation][LLM]

“DeepSeek Summary: Role prompting does not significantly improve AI accuracy on physics questions.”

Ethan Mollick@emollick

Amazing to see the two worst forms of AI posting in a QT. The original post misinterprets a

[Safety]

“DeepSeek Summary: Criticizes two poor forms of AI posting and misinterpretation in a quote tweet.”

Naomi Saphra@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

[LLM][Evaluation]

“DeepSeek Summary: Announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.”

Ben Recht@beenwrekt

For the first time in almost a decade, I'm teaching a class on learning and control.

[Evaluation]

“DeepSeek Summary: Ben Recht announces teaching a class on learning and control after nearly a decade.”

Ben Recht@beenwrekt

Building a theory of the architecture of organizing machines and people.

[Agent]

“DeepSeek Summary: He is working on a theory for organizing machines and people.”

Ben Recht@beenwrekt

With more equations than usual, I explain how policy gradient gives you a framework to randomly search for

[Evaluation]

“DeepSeek Summary: He explains policy gradient as a framework for random search, with equations.”

Ben Recht@beenwrekt

On unquantifiable costs and inherent tradeoffs in decision theory.

[Safety]

“DeepSeek Summary: He discusses unquantifiable costs and tradeoffs in decision theory.”

-- END OF LOG --

[STATS] 42 items · Filter applied