← Back to Stars

← 2026-W19|latest →

Digest

Week 20

May 11 – May 17, 2026

23

Repos

65

Bsky

298

X

3

Blogs

46

Authors

7

Days

◆ Signal

Co-Starred This Week

Repos independently starred by multiple AI leaders — the strongest cross-person signal in the weekly feed.

×2 starrers▲ 8/10★ 9.0k

DeepSeek 4 Flash local inference engine for Metal and CUDA

by:minimaxir pcuenca

|

[Deployment][LLM]

|2026-05-12 → 2026-05-14

Summary

AI

This week in AI was marked by intense focus on open-weight models, with multiple releases and analyses converging across GitHub, Bluesky, and blog articles. The open model ecosystem saw significant commentary, particularly around China's high-participation, open-first AI ecosystem as discussed by Nathan Lambert in his blog 'How open model ecosystems compound'. Meanwhile, Sebastian Raschka's blog 'Recent Developments in LLM Architectures' covered KV sharing, mHC, and compressed attention, reflecting technical advances in models like Gemma 4 and DeepSeek V4. On GitHub, the antirez/ds4 repo (DeepSeek 4 Flash local inference engine) was starred by multiple prominent figures (pcuenca and minimaxir), indicating strong interest in running cutting-edge models locally. The pi agent toolkit (earendil-works/pi) also gained traction, with a dedicated awesome list (qualisero/awesome-pi-agent) appearing. Bluesky discussions were dominated by critical perspectives on AI's societal impact, with Emily M. Bender's posts about ChatGPT being a 'product' and the 'Zombie Internet' concept receiving high engagement. Ethan Mollick and Simon Willison contributed thoughtful posts on AI responsibility and practical tooling, respectively. X posts from Andrej Karpathy, Simon Willison, and Harrison Chase explored coding agents, memory systems, and the evolving landscape of agentic workflows. Overall, the week balanced technical innovation with ethical reflection, highlighting the growing gap between AI capability hype and grounded analysis.

Notable Repos

ggml-org/llama.cpp

LLM inference in C/C++, starred by pcuenca; foundational for local LLM deployment.

pcuenca

★ 110.7k

earendil-works/pi

AI agent toolkit including coding agent CLI, unified LLM API, TUI & web UI libraries; starred by tridao.

tridao

★ 50.0k

marimo-team/marimo

Reactive notebook for Python, AI-native editor; starred by minimaxir.

minimaxir

★ 21.0k

DeepSeek 4 Flash local inference engine for Metal and CUDA; starred by minimaxir and pcuenca.

minimaxir, pcuenca

★ 9.0k

abetlen/llama-cpp-python

Python bindings for llama.cpp; starred by pcuenca.

pcuenca

★ 10.3k

sharkdp/hyperfine

Command-line benchmarking tool; starred by minimaxir.

minimaxir

★ 28.1k

mitchellh/vouch

Community trust management system based on explicit vouches; starred by sayakpaul.

sayakpaul

★ 4.4k

cactus-compute/needle

26m function call model that runs on incredibly small devices; starred by simonw.

simonw

★ 534

Notable Blogs

How open model ecosystems compound

Reflections on China's high-participation, open-first AI ecosystem.

— Nathan Lambert

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, how new open-weight LLMs are reducing long-context costs.

— Sebastian Raschka

Latest open artifacts (#21): Open model bonanza!

Open model bonanza including Gemma 4, DeepSeek V4, Kimi K2.6, and others.

— Nathan Lambert

Notable X Posts

“Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around”

@karpathy

“A short note that the predictions that LLMs would favor 'boring technology' that's once you attach them to a good coding agent harness at least”

@simonw

“In the hot path as the agent is running. The agent can decided to (or the user can prompt it to) update its memory as it is working on the core”

@hwchase17

Key Discussions

“ChatGPT as a product, not a tool”

@Emily M. Bender

Bsky

Emily M. Bender reminds that ChatGPT is a product; everything typed is data sent to OpenAI.

“AI productivity claims vs reality”

@Emily M. Bender

Bsky

Emily M. Bender cites 404media article noting AI productivity gains haven't led to better products or shorter work weeks.

“ArXiV new LLM policy”

@Mark Riedl

Bsky

Mark Riedl shares screenshots of ArXiV's new LLM policy, sparking discussion.

“Human responsibility for AI use”

@Ethan Mollick

Bsky

Ethan Mollick argues making humans responsible for AI use is reasonable for academic research.

“LLM shebang scripts”

@Simon Willison

Bsky

Simon Willison describes using LLM CLI tool in shebang lines to write executable scripts in English.

Trending

Open Model Releases and Architectures

Multiple blog posts and GitHub repos focused on new open-weight models like DeepSeek V4 and Gemma 4, with discussions on architectures (KV sharing, compressed attention) and ecosystem impacts.

Local Inference and Tooling

The antirez/ds4 repo (DeepSeek 4 Flash local inference) was starred by multiple prominent figures, alongside llama.cpp and its Python bindings, indicating strong interest in running LLMs locally.

AI Ethics and Societal Impact

Emily M. Bender's Bluesky posts about ChatGPT as a product and the 'Zombie Internet' concept received high engagement, reflecting ongoing critical discourse on AI's role.

Coding Agents and Agentic Workflows

The pi agent toolkit and related awesome list gained stars; X posts from Karpathy, Simon Willison, and Harrison Chase discussed coding agents, memory systems, and best practices.

AI Productivitiy and Responsibility

Ethan Mollick's Bluesky posts on human responsibility for AI use and the missing segment in AI & politics debates sparked discussion.

Daily Logs

May 17 (Sun)→May 16 (Sat)→May 15 (Fri)→May 14 (Thu)→May 13 (Wed)→May 12 (Tue)→May 11 (Mon)→

23 repos · 65 bluesky · 298 x · 3 blogs · 7 days

Powered by DeepSeek