Intelligence.Log

Tuesday, April 28, 2026

Extracted: 67 items. Sources: 33. Filter: Score >= 5.0

++ Daily.Brief ++

今日AI领域迎来重大变动，微软与OpenAI终止独家及收入分成协议，双方著名的AGI协议也随之失效，OpenAI重获自由。在研究方面，微软开源了40亿参数的图像转3D模型TRELLIS.2，同时新论文指出基于结果的奖励无法保证可验证推理。工具更新上，微软开源了前沿语音AI VibeVoice，并有项目提供免费使用Claude Code的途径。观点洞察中，业界正追踪已失效的AGI条款历史，并讨论生产级RAG系统的局限性，以及AI在物理世界中的应用。

> Headlines & Launches

9.6Microsoft and OpenAI end their exclusive and revenue-sharing deal

微软与OpenAI终止独家及收入分成协议。

HN (747)#openai #microsoft #partnership[Model Release]

9.5Microsoft and OpenAI's famed AGI agreement is dead - The Verge

微软与OpenAI的AGI协议终止，合作关系重新谈判。

theverge.com#openai #microsoft #agi[Model Release]

9.5OpenAI Breaks Free of Microsoft AI Exclusivity Pact - Bloomberg

OpenAI打破与微软的AI独家协议，重获自由。

bloomberg.com#openai #microsoft #exclusivity[Model Release]

9.0DeepMind's David Silver just raised $1.1B to build an AI that learns without human data

DeepMind David Silver融资11亿美元构建无人类数据学习AI。

techcrunch.com#funding #deepmind #unsupervised-learning[Model Release]

9.0We have a jury. | The Verge

马斯克与奥特曼就OpenAI未来展开法庭对决。

theverge.com#openai #lawsuit #governance

9.0Musk and Altman face off in trial that will determine OpenAI's future - Ars Technica

马斯克与奥特曼对簿公堂，决定OpenAI未来走向。

arstechnica.com#openai #lawsuit #governance

9.0Bloomberg Tech: OpenAI Drops Deal with Microsoft - Bloomberg

彭博播客报道OpenAI与微软分道扬镳。

bloomberg.com#openai #microsoft #podcast[Model Release]

9.0Elon Musk and Sam Altman are going to court over OpenAI's future

马斯克与奥特曼就OpenAI未来对簿公堂。

technologyreview.com#openai #lawsuit #governance

9.04TB of voice samples just stolen from 40k AI contractors at Mercor

AI承包商Mercor泄露4TB语音样本和身份证扫描件。

HN (436)#data-breach #privacy #ai-contractors

8.5China blocks Meta's acquisition of AI startup Manus

中国阻止Meta收购AI初创公司Manus，涉及国家监管与AI竞争。

HN (299)#acquisition #regulation #china

8.5Meta’s $2 billion Manus acquisition blocked by China.

Meta 20 亿美元收购 Manus 被中国发改委阻止

Reddit r/LocalLLaMA#acquisition #regulation #meta

8.3GitHub Copilot is moving to usage-based billing

GitHub Copilot将改为按用量计费。

HN (541)#github-copilot #billing #ai-coding[Coding Agents]

8.0EU tells Google to open up AI on Android; Google says that's "unwarranted intervention" - Ars Technica

欧盟要求谷歌开放安卓AI助手市场，谷歌称其过度干预。

arstechnica.com#regulation #android #ai-assistant

6.5Announcing our partnership with the Republic of Korea

DeepMind与韩国合作加速前沿AI科学突破。

DeepMind#partnership #deepmind #korea

> Research & Innovation

8.0Microsoft Presents "TRELLIS.2": An Open-Source, 4b-Parameter, Image-To-3D Model Producing Up To 1536³ PBR Textured Assets, Built On Native 3D VAES With 16× Spatial Compression, Delivering Efficient, Scalable, High-Fidelity Asset Generation.

微软开源40亿参数图像转3D模型TRELLIS.2，高保真生成。

Reddit r/LocalLLaMA#3d-generation #open-source #microsoft[Model Release]

7.5Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning

证明基于结果的奖励不能保证可验证或因果重要的推理。

ArXiv cs.CL#rlvr #reasoning #chain-of-thought[Post-Training]

7.0Memanto: Typed Semantic Memory with Information-Theoretic Retrieval for Long-Horizon Agents

提出带类型语义记忆和信息论检索的长时域智能体。

ArXiv cs.AI#agent #memory #long-horizon[Context Engineering][Agent Harness]

7.0AI framework autonomously outperforms human-designed R&D baselines

新AI框架自主优化训练数据、架构和算法，超越人类基线。

venturebeat.com#automl #optimization #framework

7.0Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Qwen3 4B在代码任务上超越云agent，Mahoraga研究。

Reddit r/MachineLearning#qwen #code-agents #open-source[Coding Agents]

6.5Math Takes Two: A test for emergent mathematical reasoning in communication

提出测试LLM在对话中涌现数学推理能力的基准。

ArXiv cs.AI#llm #reasoning #benchmark[Evals]

6.5MolClaw: An Autonomous Agent with Hierarchical Skills for Drug Molecule Evaluation, Screening, and Optimization

提出具有分层技能的自主智能体用于药物分子评估与优化。

ArXiv cs.AI#agent #drug-discovery #hierarchical-skills[Agent Harness]

6.5Emergent Strategic Reasoning Risks in AI: A Taxonomy-Driven Evaluation Framework

提出AI战略推理风险的分类评估框架。

ArXiv cs.AI#llm #reasoning #risk[Evals]

6.5When Does LLM Self-Correction Help? A Control-Theoretic Markov Diagnostic and Verify-First Intervention

用控制论诊断LLM自我纠正何时有效并提出先验证策略。

ArXiv cs.AI#llm #self-correction #control-theory[Planning]

6.5Source-Modality Monitoring in Vision-Language Models

定义并研究视觉语言模型中的源模态监控能力。

ArXiv cs.CL#multimodal #vision-language #monitoring

6.5Incentivizing Neuro-symbolic Language-based Reasoning in VLMs via Reinforcement Learning

通过强化学习激励VLM中的神经符号语言推理。

ArXiv cs.CL#reinforcement-learning #vlm #neuro-symbolic[Post-Training]

6.5The 4B class of 2026 (benchmark)

4B 参数模型基准测试，对比不同模型性能

Reddit r/LocalLLaMA#benchmark #small-models[Evals]

6.0An Artifact-based Agent Framework for Adaptive and Reproducible Medical Image Processing

提出基于工件的智能体框架用于自适应医学图像处理。

ArXiv cs.AI#agent #medical-imaging #framework[Agent Harness]

6.0Read the Paper, Write the Code: Agentic Reproduction of Social-Science Results

使用LLM智能体复现社会科学研究结果。

ArXiv cs.AI#llm #agent #social-science[Agent Harness]

6.0Sound Agentic Science Requires Adversarial Experiments

强调科学智能体需要对抗性实验来验证可靠性。

ArXiv cs.AI#agent #adversarial #science[Agent Harness]

6.0Introducing Background Temperature to Characterise Hidden Randomness in Large Language Models

引入背景温度概念刻画LLM中隐藏的随机性。

ArXiv cs.AI#llm #randomness #temperature

6.0Shared Lexical Task Representations Explain Behavioral Variability In LLMs

发现共享词汇任务表征解释LLM行为变异性。

ArXiv cs.CL#llm #behavior #representation

6.0Lightweight Retrieval-Augmented Generation and Large Language Model-Based Modeling for Scalable Patient-Trial Matching

提出轻量级RAG和LLM用于可扩展的患者-试验匹配。

ArXiv cs.CL#rag #llm #healthcare[Context Engineering]

6.0Where Should LoRA Go? Component-Type Placement in Hybrid Language Models

研究LoRA在混合语言模型中的组件类型放置。

ArXiv cs.CL#lora #hybrid-models #fine-tuning

5.5When Cow Urine Cures Constipation on YouTube: Limits of LLMs in Detecting Culture-specific Health Misinformation

研究LLM检测文化特定健康错误信息的局限性。

ArXiv cs.CL#llm #misinformation #health

5.5An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation

为乌克兰语本地部署优化的端到端RAG系统。

ArXiv cs.CL#rag #ukrainian #local-deployment

5.5Knowledge-driven Augmentation and Retrieval for Integrative Temporal Adaptation

知识驱动的增强与检索用于整合时间适应。

ArXiv cs.CL#temporal-adaptation #retrieval #knowledge-augmentation

5.0Optimal Question Selection from a Large Question Bank for Clinical Field Recovery in Conversational Psychiatric Intake

研究如何从大量题库中为精神科对话选择最优问题。

ArXiv cs.CL#clinical #question-selection #nlp

5.0Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI

物理信息驱动的自适应超声成像AI模型。

Hugging Face#ultrasound #physics-informed #medical-ai

> Engineering & Resources

8.3mattpocock/skills

面向工程师的AI技能集，来自Claude Code配置。

GitHub trending:all (+5645★)#ai-coding #skills #claude-code[Coding Agents]

7.9microsoft/VibeVoice

微软开源前沿语音AI VibeVoice。

GitHub trending:all (+757★)#voice-ai #open-source #microsoft[Model Release]

7.9Alishahryar1/free-claude-code

免费使用Claude Code的终端/VSCode扩展。

GitHub trending:all (+2949★)#claude-code #free #vscode[Coding Agents]

7.7Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview

开源Agent在TerminalBench上超越谷歌官方模型。

HN (302)#open-source #agent #benchmark[Coding Agents][Evals]

7.5Tracking the history of the now-deceased OpenAI Microsoft AGI clause

追踪微软与OpenAI已失效的AGI条款历史。

Simon Willison#openai #microsoft #agi

7.5Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090

Luce DFlash: Qwen3.6-27B 在单张 RTX 3090 上实现 2 倍吞吐量

Reddit r/LocalLLaMA#speculative-decoding #qwen #inference

7.5mattmireles/gemma-tuner-multimodal

Gemma Tuner Multimodal：在Apple Silicon上微调Gemma多模态模型。

Co-Starred#gemma #fine-tuning #multimodal[Post-Training]

7.5abhigyanpatwari/GitNexus

GitNexus：零服务器代码智能引擎，客户端知识图谱。

GitHub trending:all (+1102★)#code-intelligence #knowledge-graph[Coding Agents]

7.4badlogic/pi-mono

AI代理工具包，含编码CLI、统一LLM API等。

GitHub trending:typescript (+974★)#agents #toolkit #cli[Coding Agents][Agent Harness]

7.1gastownhall/beads

Beads：为编程agent提供内存升级。

GitHub trending:all (+498★)#coding-agent #memory #context[Context Engineering][Coding Agents]

7.0Join the new AI Agents Vibe Coding Course from Google and Kaggle

Google与Kaggle推出AI Agents Vibe Coding课程。

Google AI Blog#ai-agents #vibe-coding #education[Coding Agents]

7.0microsoft/VibeVoice

微软开源Whisper风格音频模型VibeVoice。

Simon Willison#audio #whisper #open-source[Model Release]

7.0Skymizer Taiwan Inc. Unveils Breakthrough Architecture Enabling Ultra-Large LLM Inference on a Single Card

Skymizer 发布新架构，单卡可运行 700B 参数模型

Reddit r/LocalLLaMA#inference #hardware #architecture

7.0huggingface/ml-intern

Hugging Face ml-intern：开源ML工程师，自动读论文、训练模型。

Co-Starred#open-source #automation #mlops[Agent Harness]

6.5Got OpenAI's privacy filter model running on-device via ExecuTorch

通过 ExecuTorch 在设备上运行 OpenAI 隐私过滤模型

Reddit r/LocalLLaMA#privacy #on-device #executorch

6.5Three limitations I keep hitting with retrieval-augmented generation in production and I'm running out of ideas [D]

Reddit讨论：生产RAG系统的三个局限性。

Reddit r/MachineLearning#rag #production #limitations[Context Engineering]

6.3TauricResearch/TradingAgents

TradingAgents：多智能体LLM金融交易框架。

GitHub trending:all (+248★)#multi-agent #finance #trading[Agent Harness]

6.0Physical AI that Moves the World — Qasar Younis & Peter Ludwig, Applied Intuition

Applied Intuition将AI应用于采矿、无人机等物理世界。

Latent Space#physical-ai #robotics #autonomous-vehicles

6.0GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B

GBNF 语法优化提升 Qwen3.6 模型推理速度

Reddit r/LocalLLaMA#qwen #inference #optimization

6.0INT8 quantization gives me better accuracy than FP16 ! [D]

Reddit讨论：INT8量化比FP16精度更高。

Reddit r/MachineLearning#quantization #deep-learning

5.6RooCodeInc/Roo-Code

在代码编辑器中提供AI代理开发团队。

GitHub trending:typescript (+58★)#coding-agent #ide #ai[Coding Agents]

5.5Rethinking Publication: A Certification Framework for AI-Enabled Research

提出AI辅助研究的认证框架。

ArXiv cs.AI#ai-research #certification #framework

5.5Speech translation in Google Meet is now rolling out to mobile devices

Google Meet语音翻译功能向移动设备推出。

Simon Willison#speech-translation #google-meet #mobile

5.4ruvnet/ruflo

Claude的多智能体编排平台，支持自主工作流。

GitHub trending:typescript (+178★)#agents #orchestration #claude[Agent Harness]

5.3davila7/claude-code-templates

Claude Code模板的CLI工具，用于配置和监控。

GitHub trending:all (+154★)#claude-code #cli #templates[Coding Agents]

5.2openai/openai-cs-agents-demo

OpenAI Agents SDK实现的客服用例演示。

GitHub trending:python (+56★)#agents #openai #sdk[Agent Harness]

5.0The AI-powered R&D department: how agentic AI is supercharging engineering velocity - The Manufacturer

探讨AI代理如何加速工程研发效率。

themanufacturer.com#agentic-ai #engineering #productivity

5.0“The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness.” | The Verge

文章探讨AI模拟意识与实例化意识的区别。

theverge.com#consciousness #philosophy

5.0End-2-end tutorial on fine-tuning, the whole journey

端到端微调教程，以 wildfire 检测为例

Reddit r/LocalLLaMA#fine-tuning #tutorial[Post-Training]

[STATS] 67 items · 33 sources · Score >= 5.0