2026-05-07
DeepSeek 4 Flash local inference engine for Metal
Every so often I think about how, in 2022, for $24B we could had "prototype vaccines ready for each of the 26 known viral families that cause human disease" so they can be deployed in 100 days if there was ever a need. This effort was not funded. ifp.org/why-barda-de...
Under-reported details of the xAI/Anthropic Colossus data center deal: Anthropic get Colossus 1 but xAI keep using the larger Colossus 2, Colossus 1 has a REALLY bad environmental record, and xAI just shut down a bunch of older models on 2 weeks' notice simonwillison.net/2026/May/7/x...
@beenwrekt.bsky.social brilliant as usual: "Indeed, the language of mathematical rationality is a Bayesian language game, always working to box out the unmeasurable and unquantifiable. It demands language without ambiguity, but of course, language is always ambiguous, fluid, and evolving."
Visiting most of the leading Chinese AI labs, I'm struck by a culture that's extremely well suited to building LLMs with fewer resources, but one happening in a very different ecosystem, more companies at play, almost no data industry, etc. Full report: www.interconnects.ai/p/notes-from...
So Claude Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. hacks.mozilla.org/2026/05/behi...
@alexhanna.bsky.social and I are so excited to announce that THE AI CON has been selected as Book in Common for 2026-27 at Cal State Chico! We're excited for thousands of folks to read and engage with our work. We'll be visiting campus on April 7, 2027 for a public event.
Seattle friends! This event on May 17, with Shelley Fairweather-Vega at Folio: The Seattle Athenaeum should be really fun. Join us! www.folioseattle.org/event-detail...
Are large language models mathematically rational? I swear I’m not dodging the question in this post, but it depends on your perspective.
<p>There weren't a lot of big new announcements from Anthropic at yesterday's Code w/ Claude event, but the biggest by far was the deal they've struck with SpaceX/xAI to use "all of the capacity of their Colossus data center".</p> <p>As I mentioned in my <a...
Lessons from my trip to talk to most of the leading AI labs in China.