2026-05-19
HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
Data collection and analysis for a PyCon talk on GitHub Actions security across Python packages.
auth and other api helpers for mummy
On-policy distillation is on track to be a lasting method in post-training. The list of areas would be: Instruction tuning (SFT/IFT) RLHF Direct Preference Optimization (DPO et al) RLVR On-policy Distillation (OPD) New classes of methods are rare! Excited to play.
My notes on Gemini 3.5 Flash - 3x the price of Gemini 3 Flash but Google are planning to use it for many of their own products simonwillison.net/2026/May/19/...
Against the constant pressure of *genAI, genAI, genAI*, I am really appreciating @ai2.bsky.social 's work on creating tools for critical needs -- like crop maps and forest loss analysis. They just did a nice release on @hf.co. huggingface.co/blog/allenai...
Gmail's automatically generated responses (which can appear whether or not you ask for them) cement human anchoring bias: The tendency for people to heavily rely on what they have already seen. The effects are insidious, subconsciously influencing what we believe.
Yet another sobering post from @noahpinion.blogsky.venki.dev open.substack.com/pub/noahpini...
🚨Our paper is out in PNAS: we found classic human persuasion techniques worked on AIs in a "parahuman" way, making them agree to objectionable requests (increasing compliance from 35% to 51%) It worked on a range of major recent LLMs though newer models do resist more www.pnas.org/doi/10.1073/...
Also had some early access to Gemini 3.5 Flash. Very fast for a flash model and very capable, though not as powerful as a full frontier model. I added it to the gallery or procedurally generated one-shot towns (it made one error that it corrected): hg-20f7d1a3ce.netlify.app#gemini-3-5-f...
Gemini Omni is quite good at instruction following: "sea otter in a pilot's uniform explains why Spirit Airlines went bankrupt to a river otter who is distracted by their laptop while they are in a hot air balloon over NYC. in the next balloon over, william shakespeare fights a robot made of pizza"
Had early access to Gemini Omni: "a dramatic reading of Death by Water from the Wasteland by a man eating garlic bread while balanced on a unicycle on a small platform over a churning sea of tomato sauce in which, at the center, sites a meatball with bright blue eyes wearing a top hat"
Wow some terrible reporting about Google's latest horrible ideas about how to distort information access in the name of "convenience" (or something): techcrunch.com/2026/05/19/g... A short thread 🧵>>
Excited to share our paper! Due Process on Hold: A Queueing Framework for Improving Access in SNAP arxiv.org/abs/2605.15165 Millions of Americans interface with the social safety net via call centers that are too congested. In Holmes v. Knodell, bad operations = procedural due process violation.
<p>I put together these annotated slides from my five minute lightning talk at PyCon US 2026, using the <a href="https://tools.simonwillison.net/annotated-presentations">latest iteration</a> of my <a href="https://simonwillison.net/2023/Aug/6/annotated-presentations/">annotated presentation...
<p>Today at Google I/O, Google <a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/">released Gemini 3.5 Flash</a>. This one skipped the <code>-preview</code> modifier and went straight to general availability, and Google appear to be using it for a whole lot...