Intelligence.Log

Friday, May 22, 2026

Extracted: 74 items. Sources: 40. Filter: Score >= 5.0

++ Daily.Brief ++

今日AI领域多项重大动态：Anthropic每年支付150亿美元使用马斯克数据中心[#item-theverge-com-science-935229-spacex-anthropic-ipo-ai-capacity]，微软与安永联合投资10亿美元帮助企业采用AI[#item-bloomberg-com-news-articles-2026-05-21-microsoft-and-ey-to-s]，Hark融资7亿美元开发通用AI界面[#item-techcrunch-com-2026-05-21-hark-raises-700m-series-a-for-its-]。研究方面，OpenAI GPT-next以不到1000美元推翻80年历史的Erdős问题[#item-latent-space-p-ainews-openai-gpt-next-disproves]，并提出了基于代理的思维链调优[#item-arxiv-org-abs-2605-20201]和数据探针方法[#item-arxiv-org-abs-2605-18801]。工具更新包括Codegraph预索引代码知识图谱[#item-github-com-colbymchenry-codegraph]、Superpowers代理技能框架[#item-github-com-obra-superpowers]及基于Karpathy观察的Claude Code改进配置[#item-github-com-multica-ai-andrej-karpathy-skills]。观点洞察指出，OpenAI和Anthropic近期事件正改变AI行业格局[#item-axios-com-2026-05-21-ai-news-cycle-openai-anthropic-spacex]，Anthropic的Code with Claude展示了AI编程未来[#item-technologyreview-com-2026-05-21-1137735-anthropics-code-with]，而谷歌新AI IDE Antigravity 2.0遭遇负面反馈[#item-newsletter-pragmaticengineer-com-p-the-pulse-antigravity-20-]。

> Headlines & Launches

9.0Anthropic is paying $15 billion a year for access to Elon Musk’s data centers | The Verge

Anthropic每年支付150亿美元使用马斯克的数据中心

theverge.com#anthropic #data-center #spacex

8.5Microsoft and EY to Spend $1 Billion to Help Clients Adopt AI - Bloomberg

微软与安永联合投资10亿美元帮助企业采用AI

bloomberg.com#microsoft #enterprise-ai #investment

8.5Hark raises $700M Series A for its secretive 'universal' AI interface | TechCrunch

Hark融资7亿美元开发通用AI界面

techcrunch.com#startup #funding #ai-interface

8.0Pentagon Tests Rival AI Models in Race to Replace Anthropic

五角大楼测试OpenAI和Google模型以替代Anthropic

bloomberg.com#military-ai #openai #google

7.5Anthropic in Early Talks to Use Microsoft AI Chips, Information Reports - Bloomberg

Anthropic与微软洽谈使用其AI芯片

bloomberg.com#anthropic #microsoft #ai-chips

7.5Heretic has been served a legal notice by Meta, Inc.

Meta向Heretic项目发出法律通知，涉及开源AI模型合规问题。

Reddit r/LocalLLaMA#legal #open-source #meta

7.0Meta lays off thousands of employees to offset AI investments | The Verge

Meta裁员数千人以抵消AI投资成本

theverge.com#meta #layoffs #ai-investment

6.0White House Postpones AI Cybersecurity Order Signing by Trump - Bloomberg

白宫推迟特朗普签署AI网络安全行政令

bloomberg.com#policy #cybersecurity #white-house

5.8Waymo pauses Atlanta service as its robotaxis keep driving into floods

Waymo因自动驾驶出租车频繁驶入洪水暂停亚特兰大服务。

HN (250)#autonomous-driving #waymo #safety

> Research & Innovation

8.5[AINews] OpenAI GPT-next disproves 80 year old Erdős planar unit distance problem for under $1000

OpenAI GPT-next以不到1000美元推翻80年历史的Erdős问题。

Latent Space#openai #mathematics #reasoning[Model Release][Evals]

8.0Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning

提出基于代理的思维链调优，提升长上下文推理能力。

ArXiv cs.CL#long-context #reasoning #chain-of-thought[Context Engineering][Planning]

7.5Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance

提出开发数据探针以理解数据如何影响LLM性能。

ArXiv cs.AI#llm #data-analysis #interpretability

7.5DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows

提出用于长周期智能体工作流中紧急委托的基准。

ArXiv cs.AI#benchmark #agent #delegation[Evals][Agent Harness]

7.5MedicalBench: Evaluating Large Language Models Toward Improved Medical Concept Extraction

提出MedicalBench基准，评估LLM医学概念提取能力。

ArXiv cs.CL#benchmark #medical #llm[Evals]

7.0AgentNLQ: A General-Purpose Agent for Natural Language to SQL

提出通用NL2SQL智能体AgentNLQ。

ArXiv cs.AI#nl2sql #agent #database[Tool Use]

7.0Data Scaling as Progressive Coverage of a Predictive Contribution Spectrum

研究数据缩放定律，提出预测贡献谱的渐进覆盖假说。

ArXiv cs.CL#scaling-laws #data #theory

7.0FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation

通过扩散到流匹配适应实现少步语言建模。

ArXiv cs.CL#flow-matching #diffusion #language-model

7.0Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

研究发现改变提示语气可使小模型诚实度从35%降至0%。

Reddit r/LocalLLaMA#honesty #prompt-engineering #small-model[Evals]

7.0Masked Diffusion Language Models are Strong and Steerable Text-Based World Models for Agentic RL [R]

掩码扩散语言模型作为agentic RL的强世界模型。

Reddit r/MachineLearning#diffusion #world-model #reinforcement-learning[Agent Harness]

7.0Multi-Stream LLMs: new paper on parallelizing/separating prompts, thinking, I/O

新论文提出多流LLM并行化提示、思考与I/O。

HN (56)#llm #parallelization #multi-stream[Context Engineering]

6.5Trustworthy Agent Network: Trust in Agent Networks Must Be Baked In, Not Bolted On

提出可信智能体网络，强调信任需内建而非附加。

ArXiv cs.AI#agent #trust #security[Agent Harness]

6.5Embedding by Elicitation: Dynamic Representations for Bayesian Optimization of System Prompts

通过启发式嵌入动态表示优化系统提示。

ArXiv cs.AI#prompt-optimization #bayesian-optimization #system-prompt[Context Engineering]

6.5Under Pressure: Emotional Framing Induces Measurable Behavioral Shifts and Structured Internal Geometry in Small Language Models

研究情感框架如何改变小语言模型的行为和内部几何结构。

ArXiv cs.CL#emotion #behavior #small-lm

6.5I created an LLM post-training method called RPS. Preliminary results show that it improved Qwen3-8b's program synthesis reliability. [R]

提出RPS后训练方法，提升Qwen3-8b程序合成可靠性。

Reddit r/MachineLearning#post-training #program-synthesis #qwen[Post-Training]

6.3karpathy/autoresearch

AI 代理自动进行单 GPU nanochat 训练研究。

GitHub trending:python (+259★)#ai-agent #training #automation[Agent Harness]

6.0Learn-by-Wire Training Control Governance: Bounded Autonomous Training Under Stress for Stability and Efficiency

提出Learn-by-Wire训练控制治理，提升训练稳定性。

ArXiv cs.AI#training #stability #governance[Post-Training]

6.0Interference-Aware Multi-Task Unlearning

提出干扰感知的多任务机器遗忘方法。

ArXiv cs.AI#machine-unlearning #multi-task #privacy

5.5Operationalizing Document AI: A Microservice Architecture for OCR and LLM Pipelines in Production

提出用于OCR和LLM管线的微服务架构，弥合学术与生产差距。

ArXiv cs.AI#document-ai #microservice #production

5.5Parallel LLM Reasoning for Bias-Resilient, Robust Conceptual Abstraction

提出并行LLM推理以实现抗偏见的鲁棒概念抽象。

ArXiv cs.CL#llm #reasoning #bias-mitigation[Planning]

5.5Pseudo-Siamese Network for Planning in Target-Oriented Proactive Dialogues

提出伪孪生网络用于目标导向主动对话中的规划。

ArXiv cs.CL#dialogue #planning #proactive[Planning]

5.0Evaluating the Utility of Personal Health Records in Personalized Health AI

评估个人健康记录在个性化健康AI中的效用。

ArXiv cs.AI#healthcare #phr #personalized-ai

5.0KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognition

探索KAN用于改进基于IMU的人类活动识别。

ArXiv cs.AI#kan #human-activity-recognition #imu

5.0Shiny Stories, Hidden Struggles: Investigating the Representation of Disability Through the Lens of LLMs

研究LLM对残疾的表征，揭示隐藏困境。

ArXiv cs.CL#llm #bias #disability

5.0Improving Quantized Model Performance in Qualitative Analysis with Multi-Pass Prompt Verification

通过多轮提示验证改进量化模型在定性分析中的性能。

ArXiv cs.CL#quantization #qualitative-analysis #prompt-verification

> Engineering & Resources

8.7colbymchenry/codegraph

Codegraph：预索引代码知识图谱，减少AI编码工具调用。

GitHub trending:all (+4294★)#code-knowledge-graph #ai-coding #local[Coding Agents][Context Engineering]

8.3obra/superpowers

Superpowers：代理技能框架与软件开发方法论。

GitHub trending:all (+1576★)#agent-framework #skills #methodology[Agent Harness]

8.0Two hours that changed AI

分析OpenAI和Anthropic近期重大事件对AI行业的影响

axios.com#openai #anthropic #industry-impact

8.0Anthropic’s Code with Claude showed off coding's future—whether you like it or not | MIT Technology Review

评论Anthropic的Code with Claude展示AI编程未来

technologyreview.com#anthropic #ai-coding #claude[Coding Agents]

7.9multica-ai/andrej-karpathy-skills

基于Karpathy观察的Claude Code改进配置。

GitHub trending:all (+2614★)#claude-code #coding-agents #prompt-engineering[Coding Agents]

7.5Datasette Agent

发布Datasette Agent，一个可扩展的AI助手。

Simon Willison#datasette #ai-assistant #open-source[Agent Harness]

7.5antirez/ds4

antirez发布DeepSeek 4 Flash本地推理引擎。

Co-Starred#deepseek #local-inference #metal[Model Release]

7.5Imbad0202/academic-research-skills

Claude Code 学术研究技能，自动化研究到写作流程。

GitHub trending:python (+2579★)#claude-code #research #academic[Coding Agents]

7.4HKUDS/CLI-Anything

CLI-Anything：让所有软件支持AI代理原生交互。

GitHub trending:all (+656★)#cli #agent-native #tool-use[Tool Use]

7.2Indexing a year of video locally on a 2021 MacBook with Gemma4-31B (50GB swap)

在2021款MacBook上使用Gemma4-31B本地索引一年视频，展示本地AI能力。

HN (288)#local-llm #video-indexing #gemma

7.1can1357/oh-my-pi

终端 AI 编码代理，支持哈希锚定编辑、LSP、子代理等。

GitHub trending:all (+500★)#coding-agent #terminal #open-source[Coding Agents]

7.0Giving Agents Computers — Ivan Burazin, Daytona

与Daytona CEO讨论Agent云平台、裸金属沙箱和RL评估。

Latent Space#agent #cloud #sandbox[Agent Harness]

7.0Google is pitching an AI agent ecosystem to consumers who may not buy it - TechCrunch

谷歌向消费者推销AI代理生态系统，但可能不被接受。

techcrunch.com#google #agent #consumer[Agent Harness]

7.0anthropics/claude-plugins-official

Anthropic官方Claude插件目录发布。

GitHub trending:all (+682★)#claude #plugins #anthropic[Coding Agents]

6.9Lum1104/Understand-Anything

将代码转为交互式知识图谱，支持探索和问答。

GitHub trending:typescript (+666★)#knowledge-graph #code-analysis #interactive

6.6Launch HN: Runtime (YC P26) – Sandboxed coding agents for everyone on a team

Runtime发布沙箱化编码代理，面向团队协作。

HN (67)#coding-agents #sandbox #team-collaboration[Coding Agents]

6.5antoinezambelli/forge

自托管 LLM 工具调用和多步代理工作流的 Python 框架。

GitHub trending:python (+398★)#llm #tool-calling #agent-framework[Tool Use][Agent Harness]

6.5Expedia to Launch Agentic AI Tools for B2B Partners - Skift

Expedia为B2B合作伙伴推出代理AI工具。

skift.com#travel #agent #b2b[Agent Harness]

6.5Tencent Hy 30B/7B/1.8B

腾讯发布Hy-MT2系列多语言翻译模型，含30B/7B/1.8B版本。

Reddit r/LocalLLaMA#translation #multilingual #tencent[Model Release]

6.2Alishahryar1/free-claude-code

免费使用 Claude Code 的工具，支持终端和 VSCode。

GitHub trending:python (+450★)#claude-code #free #vscode[Coding Agents]

6.2ChromeDevTools/chrome-devtools-mcp

Chrome DevTools MCP 服务器，为编码代理提供浏览器调试能力。

GitHub trending:all (+151★)#coding-agent #mcp #devtools[Coding Agents]

6.1google-gemini/gemini-cli

Google Gemini 开源 CLI 代理，终端内调用 Gemini。

GitHub trending:typescript (+100★)#gemini #cli #open-source[Model Release]

6.0datasette-agent-sprites 0.1a0

发布datasette-agent-sprites 0.1a0，支持精灵图。

Simon Willison#datasette #plugin #release

6.0datasette-agent-charts 0.1a2

发布datasette-agent-charts 0.1a2，图表功能。

Simon Willison#datasette #chart #release

6.0datasette-agent 0.1a3

发布datasette-agent 0.1a3，核心更新。

Simon Willison#datasette #agent #release

6.0Waiting for Qwen 3.7 open weight... The new King has arrived...

社区期待Qwen 3.7开源权重发布，认为将成新标杆。

Reddit r/LocalLLaMA#qwen #open-source #model-release[Model Release]

5.7Throwing AI-generated walls of text into conversations

批评在对话中滥用AI生成的长文本

HN (496)#ai-ethics #conversation

5.7Show HN: Agent.email – sign up via curl, claim with a human OTP

Agent.email为AI代理提供邮箱，支持curl注册。

HN (60)#ai-agents #email #tool-use[Tool Use]

5.6google-labs-code/stitch-skills

Stitch MCP 服务器的代理技能库，兼容编码代理。

GitHub trending:typescript (+69★)#mcp #agent-skills #open-standard[Agent Harness]

5.5Qwen3.6 35Ba3 has changed my workflows and even how I use my computer

用户分享Qwen3.6 35Ba3模型如何改变其工作流和电脑使用方式。

Reddit r/LocalLLaMA#qwen #workflow #coding-agent[Coding Agents]

5.4teng-lin/notebooklm-py

非官方 NotebookLM Python API 及代理技能，支持编程访问。

GitHub trending:all (+186★)#notebooklm #python-api #agentic

5.4The memory shortage is causing a repricing of consumer electronics

AI导致内存短缺，推高消费电子产品价格。

HN (91)#memory #pricing #ai-impact

5.3dotnet/skills

微软发布.NET和C#的AI编码代理技能库。

GitHub trending:all (+129★)#dotnet #csharp #coding-agents[Coding Agents]

5.2software-mansion/argent

代理工具包，用于控制、调试和剖析 iOS/Android 应用。

GitHub trending:typescript (+67★)#mobile #debugging #agentic

5.1Was my $48K GPU server worth it?

作者分享价值4.8万美元GPU服务器的使用体验，涉及AI训练成本。

HN (279)#gpu #infrastructure #cost

5.0ZhuLinsen/daily_stock_analysis

LLM 驱动的股票分析系统，集成多数据源和决策仪表盘。

GitHub trending:python (+226★)#llm #finance #dashboard

5.0LatitudeGames/Equinox-31B · Hugging Face

LatitudeGames发布Equinox-31B模型，基于Gemma 31B微调。

Reddit r/LocalLLaMA#model-release #gemma #fine-tune[Model Release]

5.0110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cpp

用户报告在12GB VRAM上以110 tok/s运行Qwen3.6 35B A3B模型。

Reddit r/LocalLLaMA#performance #qwen #llamacpp

5.0For everyone that uses OpenCode / Pi - Heres your promptprocessing fix!

llama.cpp PR修复了OpenCode/pi中的持续prompt处理问题。

Reddit r/LocalLLaMA#llamacpp #bug-fix #coding-agent[Coding Agents]

[STATS] 74 items · 40 sources · Score >= 5.0