Intelligence.Log

Monday, May 18, 2026

Extracted: 40 items. Sources: 22. Filter: Score >= 5.0

++ Daily.Brief ++

今日AI领域动态：Anthropic和OpenAI占据AI初创公司收入89%的份额，市场主导地位进一步巩固重大发布。研究方面，有论文对比了5种abliteration方法研究论文，并综述了LLM架构中KV共享等最新进展研究论文。工具更新上，Hugging Face发布开源ML工程师项目工具更新，另有OpenHuman个人AI超级智能项目工具更新。观点洞察指出，AI是技术而非产品观点洞察，且其应用瓶颈在于模糊需求而非速度观点洞察。

> Headlines & Launches

> Research & Innovation

7.585 GPU-hours comparing 5 abliteration methods on Qwen3.6-27B: benchmarks, safety, weight forensics - Abliterlitics

85 GPU小时对比5种abliteration方法，含基准测试。

Reddit r/LocalLLaMA#abliteration #safety #benchmark[Evals]

7.5Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention [P]

综述LLM架构最新进展：KV共享、mHC和压缩注意力。

Reddit r/MachineLearning#llm-architecture #attention #kv-cache[Context Engineering]

7.0Benchmarking vLLM vs SGLang vs llama.cpp on a mixed Blackwell/Ada cluster

在混合Blackwell/Ada集群上对比vLLM、SGLang和llama.cpp的长上下文推理性能。

Reddit r/LocalLLaMA#inference-engine #benchmark #long-context[Evals]

7.0#1 on memory benchmark LongMemEval with Gemini Flash, not Pro [R]

使用Gemini Flash在LongMemEval上取得第一，非Pro版本。

Reddit r/MachineLearning#memory #benchmark #gemini[Evals]

6.5Testing llama.cpp MTP support on Qwen3.6 - RTX 5090

在RTX 5090上测试llama.cpp MTP对Qwen3.6的支持。

Reddit r/LocalLLaMA#llama.cpp #mtp #qwen

5.8HKUDS/ViMax

智能体视频生成框架，集导演、编剧、制片于一体。

GitHub trending:python (+174★)#video-generation #agentic #multimodal

> Engineering & Resources

8.3tinyhumansai/openhuman

OpenHuman：个人AI超级智能，私密且强大。

GitHub trending:all (+1690★)#personal-ai #open-source[Agent Harness]

8.1colbymchenry/codegraph

预索引代码知识图谱，减少Claude Code等工具调用。

GitHub trending:typescript (+857★)#code-graph #claude-code #cursor[Context Engineering][Coding Agents]

8.0huggingface/ml-intern

Hugging Face发布开源ML工程师项目，可读论文、训练模型。

Co-Starred#open-source #ml-engineer #automation[Agent Harness]

7.5antirez/ds4

antirez发布DeepSeek 4 Flash本地推理引擎，支持Metal。

Co-Starred#inference-engine #deepseek #metal[Model Release]

7.4I don't think AI will make your processes go faster

认为AI不会让流程更快，瓶颈在于模糊需求。

HN (495)#productivity #process

7.3Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

开源代码搜索工具Semble，专为AI代理设计，比grep少用98%的token。

HN (165)#ai-coding #code-search #open-source[Coding Agents]

7.1anthropics/skills

Anthropic官方发布的Agent Skills公共仓库。

GitHub trending:python (+514★)#agent-skills #anthropic #official[Agent Harness]

7.1K-Dense-AI/scientific-agent-skills

面向科研、工程、金融等领域的AI代理技能集。

GitHub trending:all (+762★)#agent-skills #research #science[Agent Harness]

7.0Anil-matcha/Open-Generative-AI

开源AI视频平台替代品，支持200+模型。

GitHub trending:all (+703★)#video-generation #open-source[Model Release]

7.0llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 · ggml-org/llama.cpp

llama.cpp PR优化MTP提示解码速度。

Reddit r/LocalLLaMA#llama.cpp #optimization #prompt-processing

6.7HKUDS/CLI-Anything

CLI-Anything：让所有软件成为Agent原生。

GitHub trending:all (+238★)#cli #agent-native[Agent Harness][Tool Use]

6.5AI is a technology not a product

AI是技术而非产品，讨论其本质。

HN (326)#ai-philosophy #product

6.5Subjecting AI to human doctor standards? - MobiHealthNews

研究强调AI医疗应以患者结果而非基准衡量。

mobihealthnews.com#healthcare #ai-evaluation[Evals]

6.5Moving from Composer 2/Kimi 2.6 to Qwen3.6:35b-a3b

用户分享使用Qwen3.6:35b-a3b模型进行日常软件开发的经验。

Reddit r/LocalLLaMA#llm #coding #local-llm[Coding Agents]

6.4joeseesun/qiaomu-anything-to-notebooklm

多源内容处理器，将微信文章等转为播客/PPT。

GitHub trending:python (+558★)#content-processing #notebooklm #claude

6.0University of Arizona students boo Eric Schmidt’s AI cheerleading during commencement

亚利桑那大学学生嘘Eric Schmidt的AI鼓吹。

theverge.com#public-opinion #ethics

6.0The power of structured workflows and small local models

结构化工作流与小型本地模型结合的有效性实验。

Reddit r/LocalLLaMA#agent #workflow #local-llm[Agent Harness]

6.0ROCm 7.13 nightly adds strix halo optimizations

ROCm 7.13 nightly为Strix Halo添加优化。

Reddit r/LocalLLaMA#amd #rocm #strix-halo

6.0Program misleading high school students into paying to perform academic misconduct in ML Research [D]

揭露一个误导高中生付费进行学术不端行为的项目。

Reddit r/MachineLearning#ethics #academic-misconduct

6.0rohitg00/skillkit

SkillKit让AI编程智能体跨平台共享技能。

GitHub trending:typescript (+32★)#ai-coding #agent #skills[Coding Agents]

6.0Apple Silicon costs more than OpenRouter

比较Apple Silicon与OpenRouter运行离线LLM的能耗成本。

HN (297)#llm #energy #cost

5.5Apple's Siri revamp could include auto-deleting chats - TechCrunch

苹果Siri改版可能包含自动删除聊天功能。

techcrunch.com#siri #privacy

5.5If you're giving a commencement speech in 2026, maybe don't ...

2026年毕业典礼演讲建议避免提及AI。

techcrunch.com#public-speaking #ai-sentiment

5.5Gemma-4-Gembrain-31B-it-uncensored-heretic Is Out Now, a Merge of Multiple Gemma 4 31B it Finetunes Designed to Boost Logical and Lateral Thinking for Improved Adherence, Increased Swipe Variety and Enhanced Creative Prose, With KLD of 0.0186 and 13/100 Refusals!

Gemma-4-Gembrain-31B合并模型发布，增强逻辑与创意。

Reddit r/LocalLLaMA#gemma #merge #uncensored[Model Release]

5.5Slop is making me feel disconnected from AI Research [D]

用户抱怨AI研究社区充斥低质量内容，感到疏离。

Reddit r/MachineLearning#community #discussion

5.5ML lead vs PM on eval-methodology layer independence. who's actually right here? [D]

ML负责人与PM就评估方法论的层独立性展开争论。

Reddit r/MachineLearning#eval #methodology #discussion[Evals]

5.4Andyyyy64/whichllm

根据硬件推荐最佳本地LLM，基于真实基准。

GitHub trending:python (+209★)#llm #benchmark #local[Evals]

5.4KeygraphHQ/shannon

自主AI渗透测试工具，分析源码并执行攻击。

GitHub trending:all (+200★)#ai-security #pentesting #autonomous

5.0TechCrunch Mobility: The AI skills arms race is coming for automotive

AI技能军备竞赛正在进入汽车行业。

techcrunch.com#automotive #ai-skills

5.0M5 vs DGX Spark vs Strix Halo vs RTX 6000

对比M5、DGX Spark等硬件在本地LLM上的性能。

Reddit r/LocalLLaMA#hardware #benchmark #local-llm

5.0Dual GPU llama.cpp speedup

双GPU llama.cpp加速技巧与split-mode tensor问题。

Reddit r/LocalLLaMA#llama.cpp #gpu #speedup

5.0MTP experiences on 7900xtx?

用户询问在7900xtx上使用MTP推测解码的体验。

Reddit r/LocalLLaMA#speculative-decoding #hardware

5.0How are you handling training data when public datasets don't match your use case? [D]

讨论当公共数据集不匹配用例时如何处理训练数据。

Reddit r/MachineLearning#training-data #discussion

[STATS] 40 items · 22 sources · Score >= 5.0