Intelligence.Log

Sunday, May 10, 2026

Extracted: 51 items. Sources: 34. Filter: Score >= 5.0

++ Daily.Brief ++

今日AI领域动态密集：Nvidia今年已承诺400亿美元AI股权交易[#item-techcrunch-com-2026-05-09-nvidia-has-already-committed-40b-t]，字节跳动计划将AI基础设施支出提高25%[#item-bloomberg-com-news-articles-2026-05-09-bytedance-targets-25-]，而DeepSeek则拒绝阿里巴巴投资以保持独立[#item-reddit-com-r-LocalLLaMA-comments-1t81u76-deepseek-rejects-al]。研究方面，DeepSeek V4完整论文发布详述FP4 QAT细节[#item-reddit-com-r-MachineLearning-comments-1t7yrvr-deepseek-v4-pa]，NVIDIA推出含30B/23B/12B推理模型的Star Elastic检查点[#item-reddit-com-r-LocalLLaMA-comments-1t8s83r-nvidia-ai-releases-]，另有语言模型实现进化尺度蛋白质结构预测[#item-science-org-doi-10-1126-science-ade2574]。工具更新亮点包括字节跳动开源多模态AI代理栈UI-TARS-desktop[#item-github-com-bytedance-UI-TARS-desktop]及自进化智能体GenericAgent[#item-github-com-lsdefine-GenericAgent]。观点方面，用户分享ChatGPT 5.5 Pro体验[#item-gowers-wordpress-com-2026-05-08-a-recent-experience-with-cha]，并指出Claude Code生成HTML效果出奇好[#item-twitter-com-trq212-status-2052809885763747935]。

> Headlines & Launches

9.0Nvidia has already committed $40B to equity AI deals this year | TechCrunch

Nvidia今年已承诺400亿美元AI股权交易。

techcrunch.com#nvidia #investment #funding

8.0ByteDance Targets 25% Rise in AI Infrastructure Spending: SCMP

字节跳动计划将AI基础设施支出提高25%。

bloomberg.com#bytedance #infrastructure #spending

7.5DeepSeek Rejects Alibaba: Prioritizing Corporate Independence Over Big Tech Ecosystems

DeepSeek拒绝阿里巴巴投资，优先保持独立。

Reddit r/LocalLLaMA#deepseek #funding #china

> Research & Innovation

9.0Evolutionary-scale prediction of atomic-level protein structure with a language model

语言模型实现进化尺度原子级蛋白质结构预测

science.org#protein-structure #language-model #bioinformatics

8.5DeepSeek V4 paper full version is out, FP4 QAT details and stability tricks [D]

DeepSeek V4完整论文发布，详述FP4 QAT细节和稳定性技巧。

Reddit r/MachineLearning#deepseek #fp4 #quantization[Model Release]

8.0NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

NVIDIA发布Star Elastic，一个检查点包含30B/23B/12B推理模型，支持零样本切片。

Reddit r/LocalLLaMA#nvidia #reasoning #model-compression[Model Release]

7.9LLMs corrupt your documents when you delegate

LLM在委托任务时会污染文档

HN (352)#llm #document-corruption #delegation

7.5ZAYA1-8B Technical Report

发布ZAYA1-8B推理MoE模型，700M活跃参数，8B总参数。

ArXiv cs.AI#mixture-of-experts #reasoning #model-release[Model Release]

7.5Rethinking scale in ophthalmic artificial intelligence: from bigger ...

眼科AI从大模型转向更智能的临床推理

nature.com#medical-ai #ophthalmology #clinical-reasoning

7.0Partial Evidence Bench: Benchmarking Authorization-Limited Evidence in Agentic Systems

提出Partial Evidence Bench，评估agent系统在受限检索下的表现。

ArXiv cs.AI#benchmark #agent #enterprise[Evals]

7.0BALAR : A Bayesian Agentic Loop for Active Reasoning

提出贝叶斯agent循环BALAR，用于主动推理和多步任务。

ArXiv cs.AI#reasoning #agent #bayesian[Planning]

7.0PRISM: Perception Reasoning Interleaved for Sequential Decision Making

PRISM框架将感知与推理交错，用于多模态顺序决策。

ArXiv cs.AI#multimodal #reasoning #embodied-agent[Planning]

7.0From History to State: Constant-Context Skill Learning for LLM Agents

提出Constant-Context技能学习方法，让LLM agent从历史中学习。

ArXiv cs.AI#agent #skill-learning #context[Context Engineering]

6.5HKUDS/ViMax

智能体视频生成框架，集导演、编剧、制片于一体。

GitHub trending:python (+108★)#video-generation #agent #multimodal[Agent Harness]

6.5When Helpfulness Becomes Sycophancy: Sycophancy is a Boundary Failure Between Social Alignment and Epistemic Integrity in Large Language Models

论证LLM谄媚是社会对齐与认知完整性之间的边界失败。

ArXiv cs.AI#sycophancy #alignment #llm[Post-Training]

6.5Agentic Retrieval-Augmented Generation for Financial Document Question Answering

Agentic RAG用于金融文档问答，支持多步数值推理。

ArXiv cs.AI#rag #finance #qa[Tool Use]

6.5"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

OncoAgent：双层多agent框架，用于隐私保护肿瘤临床决策。

Hugging Face#multi-agent #healthcare #privacy[Agent Harness]

6.5LLM rankings are not a ladder: experimental results from a transitive benchmark graph [D]

LLM排名非阶梯式：实验证明传递性基准图结果。

Reddit r/MachineLearning#llm #benchmark #ranking[Evals]

6.0Understanding Annotator Safety Policy with Interpretability

用可解释性方法理解AI安全策略，指导数据标注和模型行为。

ArXiv cs.AI#interpretability #safety #annotation

5.5LaTA: A Drop-in, FERPA-Compliant Local-LLM Autograder for Upper-Division STEM Coursework

LaTA：符合FERPA的本地LLM自动评分器，用于STEM课程。

ArXiv cs.AI#education #llm #grading

> Engineering & Resources

8.7addyosmani/agent-skills

生产级AI编码代理技能集合，提升代理工程能力。

GitHub trending:all (+3009★)#coding-agents #skills #open-source[Coding Agents]

8.3A recent experience with ChatGPT 5.5 Pro

用户分享ChatGPT 5.5 Pro使用体验

HN (602)#chatgpt #user-experience #llm

7.6bytedance/UI-TARS-desktop

字节跳动开源多模态AI代理栈UI-TARS-desktop。

GitHub trending:all (+552★)#multimodal #agent #open-source[Agent Harness]

7.6Using Claude Code: The unreasonable effectiveness of HTML

使用Claude Code生成HTML效果出奇好

HN (416)#claude-code #html #ai-coding[Coding Agents]

7.6lsdefine/GenericAgent

自进化智能体，从种子代码生长技能树，节省6倍token。

GitHub trending:python (+538★)#agent #self-evolving #skill-tree[Agent Harness]

7.5BeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!)

BeeLlama.cpp 新分支，支持推理和视觉，在3090上2-3倍加速。

Reddit r/LocalLLaMA#llm #inference #quantization

7.5huggingface/ml-intern

Hugging Face发布ml-intern：开源ML工程师，读论文、训练模型。

Co-Starred#open-source #ml-engineer #automation[Agent Harness]

7.5datawhalechina/hello-agents

从零开始构建智能体的中文教程，面向AI代理学习。

GitHub trending:all (+1197★)#agent #tutorial #open-source[Agent Harness]

7.5decolua/9router

免费AI编码路由工具，连接多种模型和提供商。

GitHub trending:all (+1031★)#ai-coding #routing #open-source[Coding Agents]

7.3HKUDS/AI-Trader

全自动智能体原生交易系统。

GitHub trending:python (+646★)#agent #trading #automation[Agent Harness]

7.3LearningCircuit/local-deep-research

本地深度研究工具，SimpleQA达95%，支持多种LLM和搜索引擎。

GitHub trending:python (+322★)#research #local-llm #rag[Context Engineering]

7.2rohitg00/agentmemory

为AI编码代理提供持久记忆的开源库agentmemory。

GitHub trending:all (+533★)#memory #coding-agents #open-source[Context Engineering]

7.1earendil-works/pi

AI智能体工具包，含编码CLI、统一LLM API等。

GitHub trending:typescript (+515★)#agent-toolkit #coding-agent #cli[Coding Agents][Agent Harness]

7.0Musk v. Altman week two recap. - The Verge

马斯克与奥特曼法庭对决第二周回顾

theverge.com#openai #legal #ai-governance

7.080 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP

用户分享在12GB VRAM上以80 tok/s运行Qwen3.6 35B A3B并支持128K上下文。

Reddit r/LocalLLaMA#llm #inference #optimization[Context Engineering]

7.0antirez/ds4

antirez/ds4：DeepSeek 4 Flash本地推理引擎，支持Metal。

Co-Starred#deepseek #inference #metal[Model Release]

6.6anthropics/financial-services

Anthropic发布金融领域AI工具集，用于编码代理。

GitHub trending:all (+3281★)#ai-coding #financial-services #open-source[Coding Agents]

6.6lobehub/lobehub

智能体协作平台，构建和协作智能体队友。

GitHub trending:typescript (+403★)#agent #collaboration #platform[Agent Harness]

6.5OpenAI’s WebRTC problem

OpenAI面临WebRTC技术问题分析

HN (470)#webrtc #openai #real-time

6.5ECB's Escrivá Says AI Risks Prompt Finance Infrastructure Review

欧洲央行官员称AI风险促使金融基础设施审查

bloomberg.com#ai-risk #finance #regulation

6.5Running Minimax 2.7 at 100k context on strix halo

用户分享在Strix Halo上以100k上下文运行Minimax 2.7的配置。

Reddit r/LocalLLaMA#llm #inference #context-window[Context Engineering]

6.1jingyaogong/minimind

2小时从零训练64M参数小模型的开源项目。

GitHub trending:python (+112★)#llm #training #open-source[Model Release]

6.1ChromeDevTools/chrome-devtools-mcp

Chrome DevTools MCP，为编码代理提供调试能力。

GitHub trending:all (+107★)#coding-agents #devtools #mcp[Coding Agents]

6.0AI Finds Necessity in Frenemies - The Information

AI公司间竞争与合作并存的必要性分析

theinformation.com#openai #ai-chip #industry

6.0More Qwen3.6-27B MTP success but on dual Mi50s

用户分享在双Mi50上使用Qwen3.6-27B MTP获得1.5倍加速。

Reddit r/LocalLLaMA#llm #inference #multi-gpu

5.6vellum-ai/vellum-assistant

跨平台个人AI助手，具备记忆和个性。

GitHub trending:typescript (+54★)#assistant #memory #cross-platform[Context Engineering]

5.5Meta's embrace of A.I. is making its employees miserable

报道Meta的AI战略导致员工不满，反映内部文化问题。

HN (281)#meta #company-culture #ai-adoption

5.5ds4 webui

为ds4.c服务器开发的极简Web UI。

Reddit r/LocalLLaMA#webui #open-source

5.3sgl-project/sglang

高性能LLM和多模态模型服务框架。

GitHub trending:python (+153★)#llm #serving #framework

5.3rowboatlabs/rowboat

开源AI同事Rowboat，具备记忆功能。

GitHub trending:all (+144★)#ai-coworker #memory #open-source[Agent Harness]

5.2CopilotKit/CopilotKit

前端智能体与生成式UI框架，支持React和Angular。

GitHub trending:typescript (+86★)#agent #frontend #react[Agent Harness]

[STATS] 51 items · 34 sources · Score >= 5.0