Intelligence.Log

2026-04-16

Extracted: 23 items. Sources: GitHub, Bluesky, Blogs.

++ AI OVERVIEW ++

Today's focus is on AI-assisted development, highlighted by Simon Willison's praise for a Claude.ai feature that can clone and interrogate GitHub repositories to answer questions or generate new code. This underscores a growing trend of AI deeply integrating with the developer workflow, moving beyond simple chat to become a dynamic tool for code comprehension and reuse. Meanwhile, a nostalgic counterpoint emerges from Marc Lanctot's replay of "Dungeons of Dr. Creep" on original Commodore 64 hardware, reflecting a continued interest in retro-computing and preservation within the tech community.

grep TOPIC=

grep SOURCE=

sort --by=

nicobailon/pi-subagents★ 0.8k▲ 7/10

Pi extension for async subagent delegation with truncation, artifacts, and session sharing

Starred byphilschmid|[Agent][Deployment][Tooling]

“This TypeScript project extends the Pi framework to enable asynchronous subagent delegation with features like truncation, artifact management, and session sharing. It provides a structured approach to distributing AI agent workloads while maintaining context and resource efficiency.”

ashwwwin/automation-mcp★ 0.4k▲ 7/10

Control your Mac with detailed mouse, keyboard, screen, and window management capabilities.

Starred byphilschmid|[Agent][Tooling]

“This project implements a Model Context Protocol (MCP) server that enables LLMs to control macOS systems through detailed mouse, keyboard, screen, and window management capabilities. It provides granular automation tools for desktop interaction, bridging AI systems with physical computer control.”

vllm-project/tpu-inference★ 0.3k▲ 7/10

TPU inference for vLLM, with unified JAX and PyTorch support.

Starred bysayakpaul|[Infra][Deployment]

“Enables high-performance TPU inference for vLLM, offering unified support for both JAX and PyTorch backends. This allows efficient large language model serving on Google's specialized hardware.”

j4orz/teenygrad★ 0.1k▲ 6/10

llm201n: neural networks zero to super hero. the bridge from mirograd to tinygrad!

Starred byjph00|[LLM][Tooling]

“Teenygrad is an educational neural network framework that builds from basic autograd concepts to advanced implementations, serving as a bridge between minimal gradient systems like micrograd and more comprehensive frameworks like tinygrad. It provides hands-on learning for understanding neural network fundamentals through progressive complexity.”

espennilsen/pi★ 0.1k▲ 2/10

Starred byphilschmid|[Agent][Tooling]

“The espennilsen/pi repository appears to be a TypeScript project focused on mathematical or computational implementations, likely related to Pi calculations or mathematical constants. With 91 stars, it demonstrates community interest in numerical computation or algorithm optimization.”

omaclaren/pi-markdown-preview★ 0.0k▲ 2/10

Rendered markdown + LaTeX preview for pi

Starred byphilschmid|[Tooling]

“This project provides a markdown and LaTeX preview tool specifically designed for the Raspberry Pi platform, enabling real-time rendering of technical documentation with mathematical notation. It offers a lightweight solution for developers working with technical content on resource-constrained devices.”

BSKY

Simon WillisonApr 16, 12:31 AM

A claude.ai feature I really like is you can tell it to "clone x/y from GitHub" and it can then answer questions about a repo, or use snippets of code from that repo to help build new artifacts - used that just now to solve a minor friction simonwillison.net/2026/Apr/16/...

❤️ 34 Likes|[LLM][Tooling]

BSKY

Marc LanctotApr 16, 02:50 AM

Last summer, I posted a thread and screenshots of ny replay of Castles of Dr. Creep on a #c64 emulator. This year I am replaying the sequel: Dungeons of Dr. Creep on my #commodore64 Ultimate. Tonight I finished a really hard level called Dirty Tricks! 🙌😁 Here is a thread and a few shorts 👇👇👇

❤️ 1 Likes|

BSKY

Simon WillisonApr 16, 05:26 PM

Shocking result on my pelican benchmark this morning, I got a better pelican from a 21GB local Qwen3.6-35B-A3B running on my laptop than I did from the new Opus 4.7! simonwillison.net/2026/Apr/16/... Qwen on the left, Opus on the right

❤️ 120 Likes|[LLM][Evaluation][Deployment]

BSKY

Mark RiedlApr 16, 10:57 PM

Apropos nothing, Sir Ben Kingsley is a living treasure who also happened to sign in to some epically bad movies like Species.

❤️ 3 Likes|

BSKY

Mark RiedlApr 16, 04:35 PM

The person who attacked Altman’s home referenced AI doomer philosophies as motivation for the attack

❤️ 8 Likes|[Safety]

BSKY

Mark RiedlApr 16, 02:47 PM

I hope @bsky.app gets their rogue “reginos” under control (whatever a “regino” is, I cannot keep up with all these new fancy tech terms)

❤️ 7 Likes|

BSKY

Nathan LambertApr 16, 08:06 PM

New video! Talking through my 10+ open model pieces from early 2026 and how they fit together. They're all trying to figure out where open models go next. www.youtube.com/watch?v=hKIX...

❤️ 6 Likes|[LLM][Evaluation]

BSKY

Nathan LambertApr 16, 02:45 PM

Opus 4.7 has a new tokenizer. This means it's also a new base model. Glory days of pretraining still very much going.

❤️ 78 Likes|[LLM][Fine-tuning]

BSKY

Nathan LambertApr 16, 02:45 PM

The current pace of token-efficient reasoning improvements across minor Claude Opus/GPT model versions is pretty wild. All signs point to this continuing. 4.6 to 4.7 could've been presented as a fairly large model bump in the past with this plot.

❤️ 24 Likes|[LLM][Evaluation]

BSKY

Ethan MollickApr 16, 08:09 PM

I think the adaptive thinking requirement in the new Claude Opus 4.7 is bad in the ways that all AI effort routers are bad, but magnified by the fact that there is no manual override like in ChatGPT. It regularly decides that non-math/code stuff is "low effort" & produces worse results.

❤️ 46 Likes|[LLM][Evaluation]

BSKY

Ethan MollickApr 16, 07:41 PM

I have found that asking for a sestina regularly triggers Opus 4.7's safety guardrails. The forbidden poetic form!

❤️ 55 Likes|[Safety][LLM]

BSKY

Ethan MollickApr 16, 03:26 PM

Claude remains irreducibly Claude, across many generations. If you know, you know. (The fact that models have distinct personalities that are consistent across generations is technically interesting, it also makes it easy to use new releases when they come along, because they feel very similar).

❤️ 56 Likes|[LLM][Evaluation]

BSKY

Emily M. BenderApr 16, 11:45 PM

Curious to hear from other podcasters -- do you see variation in downloads by day of the week the ep is released? For MAIHT3k, the sweet spot seems to be Tuesday, and I'm not sure why.

❤️ 3 Likes|[Evaluation]

BSKY

Emily M. BenderApr 16, 06:49 PM

This is not a drill! @ghostdoc2026.bsky.social is coming to SIFF!

❤️ 12 Likes|

BSKY

Emily M. BenderApr 16, 03:44 PM

When people ask me about "AI" policy, I like to point out that there is policy being made at many different levels and even if our national government is currently a shitshow, local action really matters, like zoning against data centers and "AI" policy in schools.

❤️ 30 Likes|[Safety]

BSKY

Emily M. BenderApr 16, 03:39 PM

A "pause AI" letter that I actually like (and signed)! Because it's not about imagined doomsday scenarios, but rather putting the brakes on the rush to impose synthetic text extruding machines in schools in particular: actionnetwork.org/petitions/ca...

❤️ 90 Likes|[Safety]

BLOG

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

<p>For anyone who has been (inadvisably) taking my <a href="https://simonwillison.net/tags/pelican-riding-a-bicycle/">pelican riding a bicycle benchmark</a> seriously as a robust way to test models, here are pelicans from this morning's two big model releases - <a...

By Simon Willison

“The post demonstrates that the Qwen3.6-35B-A3B model, running locally on a laptop, generated a more accurate or aesthetically pleasing image of a pelican riding a bicycle compared to the larger, cloud-based Claude Opus 4.7 model. This highlights the rapid progress in open-source, locally runnable AI models that can now compete with or surpass leading proprietary models in specific creative tasks.”

-- END OF LOG --

[STATS] 23 items · Filter applied