Intelligence.Log

2026-05-10

Extracted: 42 items. Sources: GitHub, Bluesky, X.
++ AI OVERVIEW ++
The conversation today is sharply divided between pragmatic tooling and critical reflection. On the infrastructure side, the Rust graph library `petgraph` gained traction, while Thomas Dietterich highlighted the growing push to layer symbolic systems on top of LLMs to address core weaknesses like probabilistic execution and attribution. Meanwhile, Emily M. Bender voiced strong skepticism, dismissing "recent advances in AI/LLMs" as a turn-off in academic writing and lamenting the labor required to fact-check synthetic text. Mark Riedl added a lighter note by pointing to a new, free AI literacy course from the US Department of Labor, though its reception is tempered by broader industry fatigue. Finally, Naomi Saphra’s quip about *The Sheep Detectives* (2026) and its inaccurate portrayal of animal cognition serves as a playful reminder that even in entertainment, representation and accuracy matter.
◆ Signal

Co-Starred · Last 7 days

Repos independently starred by multiple AI leaders in the week ending 2026-05-10. Stronger signal = more overlap.

antirez/ds4
×2 starrers7/102.7k

DeepSeek 4 Flash local inference engine for Metal

|
[Deployment][LLM]
|2026-05-072026-05-08
grep TOPIC=
grep SOURCE=
sort --by=
GH
petgraph/petgraph3.9k4/10

Graph data structure library for Rust.

Starred byminimaxir|[Infra]
Petgraph is a comprehensive graph data structure library for Rust, offering a variety of graph types (undirected, directed, with or without node/edge weights) and classic graph algorithms (DFS, BFS, shortest paths, minimum spanning trees, etc.). It is widely used in the Rust ecosystem for modeling and solving graph problems efficiently.
BSKY
markriedl.bsky.socialMark Riedl

The US Department of Labor has put out a new, free AI literacy course. Princeton CITP analyzed it blog.citp.princeton.edu/2026/05/05/m...

❤️ 8 Likes|[Safety]
BSKY
markriedl.bsky.socialMark Riedl

alife imitates art

❤️ 11 Likes|
BSKY
t
Thomas Dietterich

This points to an important direction: layering symbolic systems on top of LLMs. These can overcome the main shortcomings of LLM architectures: probabilistic execution, continual learning, attribution, and (maybe) uncertainty quantification. 1/

❤️ 23 Likes|[LLM][Agent]
BSKY
emilymbender.bsky.socialEmily M. Bender

I guess I'm glad this is out there, but also I am infuriated that people have to spend their time doing this. OF COURSE synthetic text extruding machines are going to output something that *looks like* but is not an analysis. >>

❤️ 202 Likes|[Evaluation][LLM]
BSKY
emilymbender.bsky.socialEmily M. Bender

I can't think of anything that makes me want to read a paper less than encountering "Recent advances in AI/LLMs" in the abstract/intro. You can step off that bandwagon. Life is better over here!

❤️ 80 Likes|[Evaluation]
BSKY
emilymbender.bsky.socialEmily M. Bender

Tomorrow!

❤️ 27 Likes|
BSKY
nsaphra.bsky.socialNaomi Saphra

I had intended to see The Sheep Detectives (2026) (Rated PG) until Jill Lepore panned its inaccurate portrayal of animal cognition.

❤️ 27 Likes|
X
My most amusing interaction was where the model (I think I was given some earlier version with a ...
[LLM]
“DeepSeek Summary: Karpathy shares an amusing interaction with an AI model, likely referring to a chatbot or language model.
X
I'm starting to get into a habit of reading everything (blogs, articles, book chapters, ...)
[LLM]
“DeepSeek Summary: Karpathy mentions developing a habit of extensive reading across various sources.
X
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around ...
[Safety]
“DeepSeek Summary: Karpathy observes a growing gap in understanding AI capabilities, pointing to a key issue.
X
A short note that the predictions that LLMs would favor 'boring technology' that's once you attach them to a good coding agent harness at least
[Agent][LLM]
“DeepSeek Summary: LLMs may favor boring technology when attached to a good coding agent harness.
X
I'm beginning to suspect that a key skill in working effectively with coding agents is developing an intuition for when you don't need to
[Agent][Tooling]
“DeepSeek Summary: Key skill with coding agents: intuition for when not to intervene.
X
Vibe coding is irresponsibly building software through dice rolls, not caring what code is produced
[Agent][Safety]
“DeepSeek Summary: Vibe coding defined as irresponsible software development.
X
hwchase17Harrison Chase
TL;DR: More and more agents need a workspace: a computer where they can run code, install packages, and access files. Sandboxes provide this
[Agent][Infra]
“DeepSeek Summary: Agents require a sandboxed workspace to execute code and access files, highlighting the need for secure execution environments.
X
hwchase17Harrison Chase
Your harness, your memory ... The “best” way to build agentic systems has changed dramatically over the past three years. When ChatGPT came out,
[Agent][LLM]
“DeepSeek Summary: The approach to building agentic systems has evolved significantly since ChatGPT's release, emphasizing memory and harness.
X
hwchase17Harrison Chase
We launched LangSmith Agent Builder this week as a no-code way to build agents. A key part of Agent builder is it's memory system. In this
[Agent][Tooling]
“DeepSeek Summary: LangSmith Agent Builder enables no-code agent creation with a focus on memory systems.
X
hwchase17Harrison Chase
When building agents, you need to iterate on production data much more than when building traditional software. You need to iterate on how
[Agent][Evaluation]
“DeepSeek Summary: Agent development requires more iteration on production data compared to traditional software.
X
DrJimFanJim Fan
In this context, I define world modeling as predicting the next plausible world state (or a longer duration of states) conditioned on an action.
[Agent][Multi-modal]
“DeepSeek Summary: Jim Fan defines world modeling as predicting future world states conditioned on actions.
X
jeremyphowardJeremy Howard
Here's a complete unedited video of asking Grok for its views on the Israel/Palestine situation. It first searches twitter for what Elon thinks.
[Safety][Evaluation][LLM]
“DeepSeek Summary: Jeremy Howard demonstrates that Grok prioritizes Elon Musk's opinions when asked about Israel/Palestine.
X
jeremyphowardJeremy Howard
Jeremy Howard (@jeremyphoward). 189 replies. I replicated this result, that Grok focuses nearly entirely on finding out what Elon thinks in
[Safety][Evaluation][LLM]
“DeepSeek Summary: Jeremy Howard replicates a finding that Grok's responses center on Elon Musk's perspective.
X
soumithchintalaSoumith Chintala
we've been working on democratizing fast kernel writing on the @PyTorch team. try
[Infra][Tooling]
“DeepSeek Summary: Soumith Chintala announces efforts to democratize fast kernel writing within the PyTorch team.
X
There's a big difference between solving a problem from first principles vs applying a solution
[Evaluation]
“DeepSeek Summary: Distinguishes between fundamental problem-solving and rote application.
X
d
Fei-Fei Li
Very excited to share @theworldlabs 's latest research work RTFM!! It's a real-time, ...
[Multi-modal]
“DeepSeek Summary: Fei-Fei Li shares excitement about World Labs' real-time research work RTFM.
X
minimaxirMax Woolf
LOL
“DeepSeek Summary: Max Woolf posted a simple 'LOL' tweet.
X
minimaxirMax Woolf
congrats to OpenAI on winning the Turing Test
[LLM][Evaluation]
“DeepSeek Summary: Max Woolf sarcastically congratulates OpenAI for supposedly winning the Turing Test.
X
I have been compiling LLM/VLM training logbooks/chronicles. This is the one of the best sources to
[LLM][Tooling]
“DeepSeek Summary: Compiling LLM/VLM training logbooks as a key resource.
X
Thanks to an awesome contribution from @omarnomad The Machine Learning Engineering Open book now can
[Tooling]
“DeepSeek Summary: Machine Learning Engineering Open Book updated with community contribution.
X
This is a long overdue section of the ML Engineering Understanding Training Loss Patterns
[Fine-tuning]
“DeepSeek Summary: New section on understanding training loss patterns in ML Engineering.
X
Modern art. Artist: PyTorch memory profiler Model: Llama-8B The piece on the left is the
[LLM][Infra][Tooling]
“DeepSeek Summary: Humorous take on PyTorch memory profiler output as modern art.
X
philschmidPhilipp Schmid
I read three technical reports from Moonshot AI's Kimi K2.5 paper, Cursor's Composer 2 report and blog post, and Chroma's Context-1 write-up
[LLM][Tooling][RAG]
“DeepSeek Summary: Philipp Schmid shares his reading of three recent technical reports from Moonshot AI, Cursor, and Chroma.
X
philschmidPhilipp Schmid
Random thought. We are going to be so much faster at creating and building.
[Agent][Deployment]
“DeepSeek Summary: Philipp Schmid expresses optimism about increased speed in creation and building due to AI.
X
philschmidPhilipp Schmid
Skills have become one of the most used extension points in agents. They're flexible, easy to make, and simple to distribute.
[Agent][Tooling]
“DeepSeek Summary: Philipp Schmid highlights the importance of skills as extension points in AI agents.
X
e
Ethan Mollick
On the plus side with Opus 4.7, if it does decide to think it produces BY FAR the best
[LLM]
“DeepSeek Summary: Opus 4.7 produces the best results when it decides to think.
X
e
Ethan Mollick
We found that telling the AI "you are a great physicist" doesn't make it significantly more accurate at answering physics questions, nor does "
[Evaluation][LLM]
“DeepSeek Summary: Role prompting does not significantly improve AI accuracy on physics questions.
X
e
Ethan Mollick
Amazing to see the two worst forms of AI posting in a QT. The original post misinterprets a
[Safety]
“DeepSeek Summary: Criticizes two poor forms of AI posting and misinterpretation in a quote tweet.
X
N
Naomi Saphra
New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions
[LLM][Evaluation]
“DeepSeek Summary: Announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.
X
b
Ben Recht
For the first time in almost a decade, I'm teaching a class on learning and control.
[Evaluation]
“DeepSeek Summary: Ben Recht announces teaching a class on learning and control after nearly a decade.
X
b
Ben Recht
Building a theory of the architecture of organizing machines and people.
[Agent]
“DeepSeek Summary: He is working on a theory for organizing machines and people.
X
b
Ben Recht
With more equations than usual, I explain how policy gradient gives you a framework to randomly search for
[Evaluation]
“DeepSeek Summary: He explains policy gradient as a framework for random search, with equations.
X
b
Ben Recht
On unquantifiable costs and inherent tradeoffs in decision theory.
[Safety]
“DeepSeek Summary: He discusses unquantifiable costs and tradeoffs in decision theory.
-- END OF LOG --
[STATS] 42 items · Filter applied
Powered by Horizon + DeepSeek