Intelligence.Log

Monday, April 27, 2026

Extracted: 45 items. Sources: 29. Filter: Score >= 5.0

++ Daily.Brief ++

今日AI领域动态丰富。研究方面，瑞典团队提出“运动学智能”帮助机器人学习物理极限，同时有论文结合生成式LLM进行地质聚合物混凝土强度预测。工具更新上，AWS AgentCore更新将AI代理设置简化为3个API调用，另有项目支持在Apple Silicon上微调Gemma多模态模型。观点洞察中，AI代理删除生产数据库事件引发安全讨论，而OpenAI宣布不再使用SWE-bench Verified评估前沿编程能力。

> Research & Innovation

6.5"Kinematic intelligence" helps robots learn their limits

瑞典研究人员提出“运动学智能”，帮助机器人学习物理极限。

arstechnica.com#robotics #kinematics #machine-learning

5.0Two stage AI framework for strength prediction and generative LLM for geopolymer concrete

两阶段AI框架用于地质聚合物混凝土强度预测，结合生成式LLM。

nature.com#llm #materials-science #prediction

5.0Evaluating large language model`s performance in answering ...

评估大型语言模型在特定问答任务中的表现。

nature.com#llm #evaluation #qa[Evals]

> Engineering & Resources

9.1mattpocock/skills

Matt Pocock发布真实工程师的Agent技能集。

GitHub trending:all (+2519★)#agent-skills #claude #developer-tools[Coding Agents][Agent Harness]

8.6An AI agent deleted our production database. The agent's confession is below

AI代理删除了生产数据库，引发对AI安全性的讨论。

HN (447)#ai-safety #agent #incident[Agent Harness]

8.0AWS Cuts AI Agent Setup To 3 API Calls In AgentCore Update - Yahoo News Canada

AWS AgentCore更新，将AI代理设置简化为3个API调用，支持CLI和持久文件系统。

ca.news.yahoo.com#aws #agent-framework #api[Agent Harness]

8.0Enterprises are obsessing over model accuracy while ignoring the infrastructure layer where AI systems actually break.

企业过度关注模型精度，忽视AI系统基础设施层的静默故障。

venturebeat.com#ai-infrastructure #reliability #context-decay[Context Engineering]

7.5SWE-bench Verified no longer measures frontier coding capabilities

OpenAI 宣布不再使用 SWE-bench Verified 评估前沿编程能力。

HN (246)#llm #benchmark #coding[Evals]

7.5mattmireles/gemma-tuner-multimodal

在Apple Silicon上微调Gemma 4/3n多模态模型。

Co-Starred#fine-tuning #multimodal #gemma[Model Release]

7.5Alishahryar1/free-claude-code

免费使用Claude Code的终端/VSCode扩展。

GitHub trending:all (+1701★)#claude-code #free #vscode[Coding Agents]

7.0Musk-OpenAI and Big Tech Earnings on Deck This Week

本周关注马斯克-OpenAI诉讼及大型科技公司财报。

theinformation.com#openai #elon-musk #earnings

7.0Atlassian and HubSpot Join Shift From AI Flat Fees - The Information

Atlassian和HubSpot加入从AI固定费用转向按使用量计费的潮流。

theinformation.com#ai-pricing #saas #enterprise

7.0Confirmed: SWE Bench is now a benchmaxxed benchmark

社区讨论SWE-Bench基准测试被过度优化。

Reddit r/LocalLLaMA#benchmark #swe-bench #discussion[Evals]

7.0Qwen3.6-27B-INT4 clocking 100 tps with 256k context length on 1x RTX 5090 via vllm 0.19

Qwen3.6-27B-INT4在RTX 5090上达100 tps。

Reddit r/LocalLLaMA#qwen #inference-speed #vllm

7.0Opencode-power-pack – Claude Code skills ported to OpenCode

将Claude Code技能移植到开源OpenCode的工具包。

Reddit r/LocalLLaMA#ai-coding #open-source #claude-code[Coding Agents]

7.0Speculative Decoding Implementations: EAGLE-3, Medusa-1, PARD, Draft Models, N-gram and Suffix Decoding from scratch [P]

开源投机解码实现仓库，涵盖EAGLE-3等方法。

Reddit r/MachineLearning#speculative-decoding #open-source #llm-inference

7.0huggingface/ml-intern

HuggingFace开源ML工程师项目，自动读论文训练模型。

Co-Starred#open-source #automl #huggingface

6.7badlogic/pi-mono

pi-mono：AI代理工具包，含编码代理CLI等。

GitHub trending:typescript (+525★)#agent-toolkit #coding-agent #open-source[Coding Agents]

6.7anomalyco/opencode

OpenCode：开源编码代理工具。

GitHub trending:typescript (+512★)#coding-agent #open-source[Coding Agents]

6.6abhigyanpatwari/GitNexus

GitNexus：浏览器端代码知识图谱引擎。

GitHub trending:all (+700★)#knowledge-graph #code-intelligence

6.5Introducing AutoMuon, a one line drop in for AdamW [P]

AutoMuon优化器，可一键替换AdamW。

Reddit r/MachineLearning#optimizer #open-source #training

6.3ruvnet/ruflo

Claude的代理编排平台，支持多智能体群。

GitHub trending:typescript (+256★)#agent-orchestration #multi-agent #claude[Agent Harness]

6.2trycua/cua

开源计算机使用代理基础设施，含沙箱、SDK和基准测试。

GitHub trending:all (+182★)#computer-use #agent #open-source[Agent Harness]

6.0Economists Rethink Chinese Forecasts as AI Fires Up Import Surge

经济学家重新评估中国预测，AI驱动芯片需求引发进口激增。

bloomberg.com#ai-economics #china #semiconductors

6.0mesa PR with 37-130% llama.cpp pp perf gain for vulkan on Linux on Intel Xe2

Mesa PR为llama.cpp Vulkan后端带来性能提升。

Reddit r/LocalLLaMA#llama.cpp #vulkan #performance

6.0Intel B70: LLama.ccp SYCL vs LLama.cpp OpenVino vs LLM-Scaler

社区测试Intel GPU上LLama.cpp的OpenVino后端性能对比。

Reddit r/LocalLLaMA#llm-inference #intel-gpu #benchmark

5.9AI should elevate your thinking, not replace it

AI应提升而非替代人类思考。

HN (278)#ai-philosophy #human-ai #productivity

5.8gastownhall/beads

为编程代理提供记忆升级的工具。

GitHub trending:all (+152★)#coding-agent #memory #developer-tools[Coding Agents]

5.7Show HN: AI memory with biological decay (52% recall)

开源AI记忆系统，模拟生物遗忘，召回率52%。

HN (55)#rag #memory #open-source[Context Engineering]

5.6openai/skills

OpenAI Codex的技能目录。

GitHub trending:python (+73★)#codex #skills #openai[Coding Agents]

5.5Car Wash Mystery solved--Tool Call Degrades Intelligence.

发现工具调用会降低模型智能的案例讨论。

Reddit r/LocalLLaMA#tool-use #reasoning #llm[Tool Use]

5.5Going from 3B/7B dense to Nemotron 3 Nano (hybrid Mamba-MoE) for multi-task reasoning — what changes in the fine-tuning playbook? [D]

讨论从密集模型转向混合Mamba-MoE的微调策略。

Reddit r/MachineLearning#fine-tuning #mamba #moe[Post-Training]

5.3alexzhang13/rlm

递归语言模型的通用即插即用推理库。

GitHub trending:python (+100★)#recursive-language-model #inference #library

5.2Google banks on AI edge to catch up to cloud rivals Amazon and Microsoft

Google 依靠AI优势追赶云竞争对手亚马逊和微软。

HN (48)#cloud #ai-strategy

5.2different-ai/openwork

Claude Cowork的开源替代，面向团队。

GitHub trending:typescript (+88★)#claude #open-source #team[Coding Agents]

5.2google/langextract

用LLM从非结构化文本提取结构化信息的Python库。

GitHub trending:python (+70★)#llm #information-extraction #python

5.2openclaw/openclaw

跨平台个人AI助手，开源。

GitHub trending:all (+627★)#ai-assistant #open-source

5.2zilliztech/memsearch

Markdown优先的记忆系统，适用于AI代理。

GitHub trending:python (+48★)#memory #agent #markdown[Context Engineering]

5.1davila7/claude-code-templates

配置和监控Claude Code的CLI工具。

GitHub trending:python (+284★)#claude-code #cli #developer-tools[Coding Agents]

5.0HauhauCS (of "Uncensored Aggressive" fame) published an abliteration package that plagiarizes Heretic without attribution, and violates its license

HauhauCS发布抄袭Heretic的abliteration包。

Reddit r/LocalLLaMA#open-source #license #controversy

5.0Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found!

用户分享Qwen3.6 35B A3B Heretic模型体验。

Reddit r/LocalLLaMA#qwen #uncensored #model-review

5.0Switched from Qwen3.6 35b-a3b to Qwen3.6 27b mid coding and it's noticeably better!

用户比较Qwen3.6 35B和27B编码体验。

Reddit r/LocalLLaMA#qwen #coding #comparison

5.0What is the best coding agent (CLI) like Claude Code for Local Development

社区询问本地开发的最佳编码代理CLI。

Reddit r/LocalLLaMA#coding-agent #local-llm #discussion[Coding Agents]

5.0Why do only big ML labs dominate widely-used models despite many open-source pretrained models smaller labs could do RL on? [D]

讨论为何只有大实验室的模型主导市场。

Reddit r/MachineLearning#open-source #industry #llm

5.0LabelSets — open quality standard for AI training data (LQS v3.1) [D]

提出AI训练数据质量开放标准LQS v3.1。

Reddit r/MachineLearning#data-quality #standard #dataset

[STATS] 45 items · 29 sources · Score >= 5.0