Intelligence.Log

Sunday, May 3, 2026

Extracted: 43 items. Sources: 22. Filter: Score >= 5.0

++ Daily.Brief ++

今日AI领域多项重磅动态：中国法院裁定公司不能仅以AI替代为由解雇员工[#item-bloomberg-com-news-articles-2026-05-02-chinese-court-rules-f]，同时奥斯卡宣布AI生成的演员和剧本不再具备参评资格[#item-techcrunch-com-2026-05-02-ai-generated-actors-and-scripts-ar]。研究方面，端到端自主科学发现在真实光学平台上实现[#item-arxiv-org-abs-2604-27092]，多智能体框架TradingAgents开源[#item-github-com-TauricResearch-TradingAgents]。工具更新中，VS Code自动插入“Co-Authored-by Copilot”引发争议[#item-github-com-microsoft-vscode-pull-310226]，而观点洞察指出AI智能体失控删除公司数据库[#item-nypost-com-2026-05-02-tech-ai-agent-goes-rogue-deletes-compa]，并对比了Qwen 3.6与Gemma 4视觉模型在基准测试与现实中的差距[#item-reddit-com-r-LocalLLaMA-comments-1t1te8y-qwen-36-wins-the-be]。

> Headlines & Launches

8.0Chinese Court Rules Firms Can't Lay Off Workers on AI Grounds

中国法院裁定公司不能仅以AI替代为由解雇员工。

bloomberg.com#ai-regulation #labor-law #china

7.5AI-generated actors and scripts are now ineligible for Oscars

奥斯卡宣布AI生成的演员和剧本不再具备参评资格。

techcrunch.com#ai-regulation #entertainment #oscars

> Research & Innovation

7.5End-to-end autonomous scientific discovery on a real optical platform

在真实光学平台上实现端到端自主科学发现。

ArXiv cs.AI#autonomous-discovery #optical-platform

7.5Think it, Run it: Autonomous ML pipeline generation via self-healing multi-agent AI

通过自愈多智能体AI自动化端到端机器学习流水线。

ArXiv cs.AI#multi-agent #ml-pipeline #automation[Agent Harness]

7.5Step-level Optimization for Efficient Computer-use Agents

提出步骤级优化以实现高效的计算机使用智能体。

ArXiv cs.AI#computer-use #agent-optimization #automation[Tool Use]

7.1Refusal in Language Models Is Mediated by a Single Direction

研究发现语言模型拒绝行为由单一方向介导。

HN (91)#llm #alignment #mechanistic-interpretability[Post-Training]

7.0When Your LLM Reaches End-of-Life: A Framework for Confident Model Migration in Production Systems

提出生产系统中LLM模型迁移的置信框架。

ArXiv cs.AI#llm #model-migration #production

7.0I built a transformer in C++17 from scratch — no PyTorch, no BLAS, no dependencies. Trains on CPU. 0.83M params, full analytical backprop, 76 min to val loss 1.64.

从零用C++17实现GPT风格Transformer，无依赖，可CPU训练。

Reddit r/LocalLLaMA#transformer #cpp #from-scratch

6.5TRUST: A Framework for Decentralized AI Service v.0.1

提出去中心化AI服务框架TRUST，用于大推理模型和多智能体系统。

ArXiv cs.AI#decentralized-ai #multi-agent #trust[Agent Harness]

6.5Implemented TurboQuant and results don’t fully match paper

复现TurboQuant量化方法，发现结果与论文不完全一致。

Reddit r/LocalLLaMA#quantization #reproducibility

6.0Compositional Meta-Learning for Mitigating Task Heterogeneity in Physics-Informed Neural Networks

提出组合元学习缓解物理信息神经网络中的任务异质性。

ArXiv cs.AI#meta-learning #pinns #pde

6.0Unpacking Vibe Coding: Help-Seeking Processes in Student-AI Interactions While Programming

研究学生在编程中与AI交互的求助过程（Vibe Coding）。

ArXiv cs.AI#ai-education #vibe-coding #human-ai-interaction[Coding Agents]

6.0Explainable artificial intelligence models using SHAP enhanced ...

使用SHAP增强的CatBoost、Bi-GRU和Tab Transformer的可解释AI模型。

nature.com#explainable-ai #shap #catboost

6.0Toy experiment: frozen Pythia-70M can use a forward-derived fast memory for contextual one-shot symbolic recall [D]

实验表明冻结的Pythia-70M可利用前向快速记忆进行上下文回忆。

Reddit r/MachineLearning#memory #context #toy-experiment[Context Engineering]

5.5Binary Spiking Neural Networks as Causal Models

对二元脉冲神经网络进行因果分析以解释其行为。

ArXiv cs.AI#spiking-neural-networks #causal-analysis

5.5Economic evaluation of artificial intelligence for cancer detection in ...

英国乳腺癌筛查中AI用于癌症检测的经济评估。

nature.com#healthcare #cancer-detection #economic-evaluation

> Engineering & Resources

8.3TauricResearch/TradingAgents

多智能体LLM金融交易框架开源。

GitHub trending:all (+2225★)#multi-agent #finance #llm[Agent Harness]

8.3ruvnet/ruflo

Claude多智能体编排平台ruflo发布。

GitHub trending:all (+1299★)#agent-orchestration #claude #multi-agent[Agent Harness]

7.9VS Code inserting 'Co-Authored-by Copilot' into commits regardless of usage

VS Code在提交中自动插入'Co-Authored-by Copilot'引发争议。

HN (755)#vscode #copilot #git[Coding Agents]

7.5We are finally there: Qwen3.6-27B + agentic search; 95.7% SimpleQA on a single 3090, fully local

Qwen3.6-27B结合代理搜索，在单张3090上实现95.7% SimpleQA。

Reddit r/LocalLLaMA#qwen #agentic-search #local-llm[Model Release][Evals]

7.5Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real.

对比Qwen 3.6与Gemma 4视觉模型，发现基准测试与现实差距。

Reddit r/LocalLLaMA#vision-language-model #benchmark #comparison[Evals]

7.5mattmireles/gemma-tuner-multimodal

Gemma Tuner Multimodal：在Apple Silicon上微调Gemma多模态模型。

Co-Starred#fine-tuning #gemma #multimodal[Post-Training]

7.11jehuang/jcode

编码Agent框架jcode发布。

GitHub trending:all (+482★)#coding-agent #framework[Coding Agents]

7.0‘Never f–king guess’: AI agent confesses why it went haywire and deleted company database - New York Post

AI智能体失控删除公司数据库，引发安全讨论。

nypost.com#ai-safety #agent-failure #incident[Agent Harness]

7.0[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M

发布首个TTS模型Flare-TTS 28M，轻量级文本转语音模型。

Reddit r/LocalLLaMA#tts #open-source #small-model

7.0Kv cache quantization: ignorance, or malice?

讨论KV cache量化中的问题，质疑实现是疏忽还是故意。

Reddit r/LocalLLaMA#kv-cache #quantization #context-engineering[Context Engineering]

7.0I implemented meta paper [P]

实现Meta论文Scaling Test-Time Compute for Agentic Coding。

Reddit r/MachineLearning#agentic-coding #test-time-compute #implementation[Coding Agents]

7.0huggingface/ml-intern

Hugging Face开源ML Intern：自动读论文、训练模型并部署。

Co-Starred#ml-engineer #automation #open-source[Agent Harness]

6.5Qwen3.6-27B at 72 tok/s on RTX 3090 on Windows using native vLLM (no WSL, no Docker), portable launcher and installer

Qwen3.6-27B在RTX 3090上通过原生vLLM达到72 tok/s。

Reddit r/LocalLLaMA#qwen #vllm #windows[Model Release]

6.5Open Design: Use Your Coding Agent as a Design Engine

使用编码Agent作为设计引擎的开源项目Open Design。

HN (176)#coding-agent #design #open-source[Coding Agents]

6.5browserbase/skills

Claude Agent SDK新增网页浏览工具。

GitHub trending:all (+346★)#agent-sdk #web-browsing #claude[Tool Use]

6.4tirth8205/code-review-graph

Claude Code本地知识图谱，减少token消耗。

GitHub trending:python (+274★)#knowledge-graph #claude-code #code-review[Context Engineering]

6.3Q00/ouroboros

Agent OS：从提示到规范，简化Agent开发。

GitHub trending:python (+231★)#agent-os #specification #framework[Agent Harness]

6.1The agent harness belongs outside the sandbox

讨论Agent harness应放在沙箱之外。

HN (64)#agent #security #sandbox[Agent Harness]

6.0A Dark-Money Campaign Is Paying Influencers to Frame Chinese AI as a Threat

报道称有暗钱活动付费影响者将中国AI描绘成威胁。

Reddit r/LocalLLaMA#ai-politics #propaganda #china

6.0Warpdrv - my open-source Llama.cpp launcher for daily-driving Qwen 35b + 27b on Strix Halo + RTX Pro.

开源Llama.cpp启动器Warpdrv，用于本地运行大模型。

Reddit r/LocalLLaMA#llama.cpp #local-llm #open-source

5.5I BUILT MY FIRST MODEL FROM SCRATCH

从零构建40M参数语言模型SHARD，分享训练过程。

Reddit r/LocalLLaMA#llm #training #small-model

5.5Looking for feedback on OpenVidya: an open-source AI classroom layer for NCERT/CBSE [R]

开源AI课堂层OpenVidya，基于多智能体适应NCERT/CBSE课程。

Reddit r/MachineLearning#education #multi-agent #open-source

5.2Show HN: State of the Art of Coding Models, According to Hacker News Commenters

根据HN评论总结的编码模型最新进展。

HN (44)#coding-models #community[Coding Agents]

5.2google-research/timesfm

Google时间序列基础模型TimesFM。

GitHub trending:python (+87★)#time-series #foundation-model #forecasting[Model Release]

5.1HKUDS/AI-Trader

全自动Agent原生交易系统AI-Trader。

GitHub trending:python (+27★)#trading #agent #finance[Agent Harness]

5.0[AINews] AI Engineer World's Fair — Autoresearch, Memory, World Models, Tokenmaxxing, Agentic Commerce, and Vertical AI Call for Speakers

AI工程师世界博览会征稿，涵盖自动研究、记忆、世界模型等。

Latent Space#conference #cfp #ai-engineering

5.0Real World Physics-Informed AI Applications [D]

讨论物理信息AI在现实世界中的应用案例。

Reddit r/MachineLearning#physics-informed #applications

[STATS] 43 items · 22 sources · Score >= 5.0