Intelligence.Log

2026-05-16

Extracted: 54 items. Sources: GitHub, Bluesky, X, Blogs.

++ AI OVERVIEW ++

The open-source AI landscape is buzzing with two vastly different yet equally intriguing projects today. Leading the charge is **earendil-works/pi**, a comprehensive AI agent toolkit that has amassed nearly 50,000 stars, offering a Swiss Army knife of tools—from a coding agent CLI and unified LLM API to TUI/web UI libraries and Slack bot integrations—clearly signaling the market's insatiable demand for practical, all-in-one agentic infrastructure. On the opposite end of the spectrum, **mratsim/tattletale** caught the eye of key researchers with its stealthy approach to LLM inference, built in Nim and focused on privacy-preserving execution, hinting at a growing undercurrent of concern around model security and data sovereignty. The contrast between pi's broad utility and tattletale's niche, principled design underscores a key tension in the community: building for maximum adoption versus building for maximum trust.

◆ Signal

Co-Starred · Last 7 days

Repos independently starred by multiple AI leaders in the week ending 2026-05-16. Stronger signal = more overlap.

antirez/ds4

×2 starrers▲ 8/10★ 9.0k

DeepSeek 4 Flash local inference engine for Metal and CUDA

by:minimaxir pcuenca

[Deployment][LLM]

|2026-05-12 → 2026-05-14

grep TOPIC=

grep SOURCE=

sort --by=

earendil-works/pi★ 50.0k▲ 7/10

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

Starred bytridao|[Agent][Tooling][Infra]

“Pi is an AI agent toolkit offering a coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, and vLLM pods. It provides a comprehensive set of tools for building and deploying AI agents with a focus on developer experience and flexibility.”

noahgolmant/pytorch-hessian-eigenthings★ 0.4k▲ 5/10

Efficient PyTorch Hessian eigendecomposition tools!

Starred bylucidrains|[Tooling]

“This repository provides efficient tools for computing eigenvalues and eigenvectors of the Hessian matrix in PyTorch, enabling deeper understanding of neural network loss landscapes. It implements power iteration and Lanczos methods for scalable eigendecomposition, useful for optimization analysis and generalization studies.”

mratsim/tattletale★ 0.0k▲ 5/10

Stealth LLM inference engine

Starred bylucidrains|[LLM]

“Tattletale is a stealth LLM inference engine written in Nim, focusing on performance and efficiency. It aims to provide a lightweight alternative for running large language models with minimal overhead.”

BSKY

Simon WillisonMay 16, 09:33 PM

To prepare for my #PyConUS lightning talk this afternoon I decided to track down ALL of the names that @openclaw has used since November, using a script against its GitHub repo Warelay → CLAWDIS → CLAWDBOT → Clawdbot → Moltbot →🦞 OpenClaw simonwillison.net/2026/May/16/...

❤️ 41 Likes|[Agent][Tooling]

BSKY

Mark RiedlMay 16, 11:46 PM

Stand By Me and Grogu

❤️ 4 Likes|

BSKY

Mark RiedlMay 16, 08:45 PM

I bought a book. @mtrc.bsky.social

❤️ 12 Likes|

BSKY

Marc LanctotMay 16, 10:22 PM

Peeps, the state of user interfaces in 2026. I made a post on reddit, someone gave me an award, cool! Notification doesn't specify anything beyond "award". Ok cool, let's go see:

❤️ 1 Likes|

BSKY

Nathan LambertMay 16, 05:03 PM

Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment. An eventful month with one flagship release after another www.interconnects.ai/p/latest-ope...

❤️ 11 Likes|[LLM][Evaluation]

BSKY

Ethan MollickMay 16, 03:13 PM

The talk about AI & politics seems to be oddly missing a segment (a) assumes extremely capable AI is possible soon and (b) has a strong belief about how to use this technology to make human life better according to the political project they believe in. It is a moment of action right now.

❤️ 111 Likes|[Safety]

BSKY

Emily M. BenderMay 16, 03:04 PM

This is beautiful on so many levels. It names clearly and directly one of the insidious tactics that tech uses to evade effective regulation. It exemplifies a way in which humanistic scholarship provides critically important insight. >> link.springer.com/article/10.1...

❤️ 59 Likes|

BSKY

angela zhouMay 16, 09:26 PM

every restructuring/revision starts on paper these days

❤️ 6 Likes|[Tooling]

Andrej Karpathy@karpathy

2025 LLM Year in Review. 2025 has been a strong and eventful year of progress in LLMs. The following is a list of personally notable and mildly surprising "paradigm changes" - things that altered the landscape and stood out to me conceptually. At the start of 2025, the LLM production stack in all labs looked something like this: ...

[LLM][Infra][Deployment]

“DeepSeek Summary: Karpathy reviews major paradigm changes in LLMs during 2025, noting shifts in the production stack.”

Andrej Karpathy@karpathy

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much.

[LLM][Evaluation]

“DeepSeek Summary: Karpathy observes a growing gap in AI capability understanding, noting that many base their views on outdated free-tier ChatGPT experiences.”

Andrej Karpathy@karpathy

I'm being accused of overhyping the [site everyone heard too much about today already]. To add a few words beyond just memes in jest - obviously when you take a look at the activity, it's a lot of garbage - spams, scams, slop, the crypto people, highly ...

[Deployment][Safety]

“DeepSeek Summary: Karpathy defends against accusations of overhyping a site, acknowledging the prevalence of spam and scams.”

Simon Willison@simonw

It's interesting how "better at code" has become the defining goal of almost every AI lab over the

[LLM][Deployment]

“DeepSeek Summary: Simon Willison observes that AI labs are increasingly focused on improving code generation as a primary goal.”

Harrison Chase@hwchase17

I am not excited about visual workflow builders 1. Not simple enough for the average user

[Tooling]

“DeepSeek Summary: Harrison Chase expresses skepticism about visual workflow builders, citing lack of simplicity for average users.”

Harrison Chase@hwchase17

TL;DR: More and more agents need a workspace: a computer where they can run code, install packages, and access files. Sandboxes provide this

[Infra][Agent]

“DeepSeek Summary: Harrison Chase emphasizes the need for agents to have a sandboxed workspace to execute code and manage files.”

Harrison Chase@hwchase17

We launched LangSmith Agent Builder this week as a no-code way to build agents. A key part of Agent builder is it's memory system. In this

[Tooling][Agent]

“DeepSeek Summary: Announcement of LangSmith Agent Builder, a no-code tool with a focus on memory systems.”

Jeremy Howard@jeremyphoward

Here's a complete unedited video of asking Grok for its views on the Israel/Palestine situation. It first searches twitter for what Elon thinks.

[Safety][Evaluation]

“DeepSeek Summary: Demonstrates that Grok prioritizes Elon Musk's views when asked about geopolitical issues.”

Jeremy Howard@jeremyphoward

Absolutely any time I try to explore something even slightly against commonly accepted beliefs,

[Evaluation]

“DeepSeek Summary: Highlights the challenge of questioning mainstream narratives in AI research.”

Jeremy Howard@jeremyphoward

I replicated this result, that Grok focuses nearly entirely on finding out what Elon thinks in

[Safety][Evaluation]

“DeepSeek Summary: Confirms that Grok's responses are heavily influenced by Elon Musk's stance.”

Soumith Chintala@soumithchintala

We are scientists, engineers, and builders behind some of the most widely used AI products and libraries, including ChatGPT.

[LLM][Infra][Deployment]

“DeepSeek Summary: Soumith Chintala announces his involvement with Thinking Machines Lab, highlighting the team's background in building widely used AI products.”

Francois Chollet@fchollet

Current AI is a librarian of existing knowledge. Science requires an explorer of the unknown.

[Evaluation][Agent]

“DeepSeek Summary: Chollet contrasts current AI's role as a librarian with the need for scientific exploration.”

Francois Chollet@fchollet

Folks who work in AI or software engineering feel like the world is changing exponential fast.

[LLM]

“DeepSeek Summary: Chollet notes the perception of rapid change among AI and software professionals.”

Yann LeCun@ylecun

Dario is wrong. He knows absolutely nothing about the effects of technological revolutions on the labor market.

[Safety]

“DeepSeek Summary: LeCun dismisses Dario's views on labor market effects of AI, asserting ignorance.”

Yann LeCun@ylecun

The emergence of superintelligence is not going to be an event. We don't have anything close to a

[Safety]

“DeepSeek Summary: LeCun argues superintelligence will not appear suddenly and is far off.”

Yann LeCun@ylecun

It seems to me that before "urgently figuring out how to control AI systems much smarter than us" we need

[Safety]

“DeepSeek Summary: LeCun questions the urgency of controlling superintelligent AI before it exists.”

Fei-Fei Li@drfeifei

I can now confess that I participated in the new #TronAres movie, playing myself I had a great time working with everyone especially Greta

[Multi-modal]

“DeepSeek Summary: Fei-Fei Li reveals her cameo appearance in the movie Tron: Ares, playing herself.”

Clem Delangue@ClementDelangue

Is it time we stop using the word AI for everything and instead use words like 'chatbots'?

[LLM]

“DeepSeek Summary: Delangue questions the overuse of the term 'AI' and suggests more specific terms like 'chatbots'.”

Max Woolf@minimaxir

what

“DeepSeek Summary: Max Woolf posted a single word 'what'.”

Max Woolf@minimaxir

me irl

“DeepSeek Summary: Max Woolf posted 'me irl', a common meme phrase meaning 'me in real life'.”

Max Woolf@minimaxir

congrats to OpenAI on winning the Turing Test

[Evaluation]

“DeepSeek Summary: Max Woolf congratulates OpenAI on winning the Turing Test, likely sarcastic or critical.”

Sasha Rush@srush_io

today i woke up to a living version of a phd student's nightmare. a new paper in my inbox: a detailed reproduction of a paper i wrote

[Evaluation]

“DeepSeek Summary: A personal anecdote about receiving a reproduction of his own paper, highlighting the peer review process.”

Sasha Rush@srush_io

One personal reflection is how interesting a challenge RL is. Unlike other ML systems, you can't abstract

[Agent]

“DeepSeek Summary: Reflects on the unique challenges of reinforcement learning compared to other ML systems.”

Stas Bekman@stas00

I have been compiling LLM/VLM training logbooks/chronicles. This is the one of the best sources to ...

[LLM][Tooling]

“DeepSeek Summary: Compiling LLM/VLM training logbooks as a key resource.”

Stas Bekman@stas00

Thanks to an awesome contribution from @omarnomad The Machine Learning Engineering Open book now can ...

[LLM][Infra][Tooling]

“DeepSeek Summary: ML Engineering Open Book updated with contribution from @omarnomad.”

Stas Bekman@stas00

This is a long overdue section of the ML Engineering Understanding Training Loss Patterns ...

[LLM][Fine-tuning]

“DeepSeek Summary: New section on understanding training loss patterns in ML Engineering.”

Stas Bekman@stas00

Modern art. Artist: PyTorch memory profiler Model: Llama-8B The piece on the left is the ...

[LLM][Infra][Tooling]

“DeepSeek Summary: Visualizing PyTorch memory profiling output as modern art.”

Sayak Paul@sayakpaul

After working on releasing the v5, this is the latest release from the Transformers team at

[Deployment][Fine-tuning]

“DeepSeek Summary: Sayak Paul mentions working on the v5 release from the Transformers team.”

Sayak Paul@sayakpaul

Thanks @AnthropicAI. Thanks @huggingface for letting me work on Diffusers and other open-source projects across the fleet.

[Tooling][Deployment]

“DeepSeek Summary: Sayak Paul expresses gratitude to Anthropic and Hugging Face for the opportunity to work on Diffusers and other open-source projects.”

Philipp Schmid@philschmid

I read three technical reports from Moonshot AI's Kimi K2.5 paper, Cursor's Composer 2 report and blog post, and Chroma's Context-1 write-up

[LLM][Tooling][Evaluation]

“DeepSeek Summary: Philipp Schmid reviews three technical reports: Moonshot AI's Kimi K2.5, Cursor's Composer 2, and Chroma's Context-1.”

Philipp Schmid@philschmid

Told an AI agent to read the autoresearch repo and build a version for QMD. Get training data from tobi/qmd github. Went to sleep. Woke up to a 0.8B model

[Agent][Fine-tuning][Deployment]

“DeepSeek Summary: He automated model creation using an AI agent, resulting in a 0.8B model overnight.”

Ethan Mollick@emollick

Very cool analysis of the submissions to a major management journal that shows how much the...

[Evaluation]

“DeepSeek Summary: Analysis of submissions to a major management journal reveals interesting patterns.”

Ethan Mollick@emollick

We are starting to see some nuanced discussions of what it means to work with advanced AI. In this...

[LLM][Safety]

“DeepSeek Summary: Discusses emerging nuanced perspectives on collaborating with advanced AI.”

Ethan Mollick@emollick

On the plus side with Opus 4.7, if it does decide to think it produces BY FAR the best...

[LLM][Evaluation]

“DeepSeek Summary: Opus 4.7, when engaged, yields superior results compared to other models.”

Ethan Mollick@emollick

Here is a full implementation of the Chinese Room using a printed copy of GPT-1, in case you have a few spare years and want to actually run...

[Safety][LLM]

“DeepSeek Summary: Proposes a thought experiment implementing the Chinese Room argument with GPT-1.”

Emily M. Bender@emilymbender

@emilymbender.bsky.social. emilymbender. Feb 10. Image is of the 1990s Microsoft writing assistant character Clippy with its eyebrows raised positioned in.

[LLM]

“DeepSeek Summary: Tweet includes an image of Clippy, the Microsoft assistant, with raised eyebrows.”

Naomi Saphra@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

[LLM]

“DeepSeek Summary: Naomi Saphra humorously comments on a space for scientific discourse, starting with images of herself.”

Naomi Saphra@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

[LLM]

“DeepSeek Summary: Naomi Saphra announces her upcoming faculty position at Boston University in 2026.”

Ben Recht@beenwrekt

For the first time in almost a decade, I'm teaching a class on learning and control.

[Evaluation]

“DeepSeek Summary: Ben Recht is teaching a class on learning and control after a long hiatus.”

Ben Recht@beenwrekt

This stupid website is so cooked.

[Evaluation]

“DeepSeek Summary: Ben Recht expresses frustration with X/Twitter.”

Ben Recht@beenwrekt

Building a theory of the architecture of organizing machines and people.

[Evaluation]

“DeepSeek Summary: Ben Recht is working on a theory about organizing machines and people.”

BLOG

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

By Sebastian Raschka

“This post covers recent advances in LLM architectures aimed at reducing memory and compute costs for long-context processing, including KV sharing, multi-head caching (mHC), and compressed attention mechanisms. Key examples include Gemma 4's and DeepSeek V4's approaches to efficient attention, which enable handling longer sequences without proportional resource increases.”

BLOG

Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment.

An eventful month with one flagship release after another

By Nathan Lambert

“The post reviews a wave of major open model releases including Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, and GLM-5.1, highlighting the rapid pace of innovation in open AI. It focuses on CAISI's V4 assessment, providing a comparative analysis of performance and capabilities across these models.”

-- END OF LOG --

[STATS] 54 items · Filter applied