2026-W19 Overview

本周 AI 代码编辑器生态迎来多项重要更新。Windsurf 发布 Wave 4 重大更新，引入 Devin Review 和 Quick Review 功能。Cursor 新增 PR Review、并行构建和上下文使用量分解功能，并推出 Cursor SDK 公测版。GitHub Copilot 发生重大变化，暂停新用户注册并移除 Opus 模型，同时代理模式正式发布。Gemini CLI 和 Claude Code 持续发布多个版本修复 Bug 和增加新功能。腾讯云 CodeBuddy 宣布价格大幅上调，最高达154%。

2026-05-04 — 2026-05-10

Updated

Editor Updates

Week-over-Week Overview

IDE

Cursor2.5+

●●●●●

releasefeaturefixintegration

→steady

Windsurfv2.2.17

●●●●●

releasefeaturefixperf

→steady

AugmentIntent 0.3.10

●●○○○

release

↓slowing

Trae

○○○○○

—

⚠silent

CLI / Plugin

Claude Code2.1.138

●●●●●

releasefeaturefixperfintegration

→steady

Copilot

●●●●●

releasefeatureintegration

→steady

Gemini CLIv0.42.0-nightly

●●●●●

releasefeaturefixperfintegration

→steady

OpenCode

●●●●●

fix

→steady

CodeBuddy2.95.1-next

●●●○○

release

→steady

Aider

○○○○○

—

→steady

activity:●IDE active●CLI active|WoW:✨ new⚠ silent↑ accel↓ slowing→ steady

IDE

Cursor

IDE2.5+

•新增 PR Review 体验、并行构建计划和快速操作按钮。
•新增上下文使用量分解视图。
•企业版新增模型控制、支出管理和使用分析。
•Bugbot 订阅模式调整，从每席位每月40美元变更。
•推出 Cursor SDK 公测版，支持构建和部署编码代理。

Cursor 本周发布了多项更新，包括 PR Review、并行构建计划、上下文使用量分解等新功能。企业版新增了模型控制和支出管理功能，同时推出了 Cursor SDK 公测版，允许团队构建自定义编码代理。

Releasebot, Cursor Blog →

Windsurf

IDEv2.2.17

•发布 Windsurf Wave 4，引入 Devin Review 和 Quick Review。
•Devin 代理现可用于终端。
•改进代理收件箱，新增列表显示选项。
•修复 Windows 更新问题和 Cascade 对话崩溃。
•改进会话侧边栏排序和筛选。

Windsurf 发布了 Wave 4 重大更新，为所有用户带来了 Devin Review 和 Quick Review 功能，并让 Devin 代理进入终端。同时修复了多个 Bug 并改进了性能。

Windsurf Changelog, Neowin →

Trae

IDE

本周暂无重大更新

Augment

IDEIntent 0.3.10

•工作区 UI 现显示 Auggie 信用使用统计。

Augment 发布了 Intent 0.3.10 版本，主要更新是在工作区 UI 中增加了 Auggie 信用使用统计功能，可悬停查看每个代理的详细使用情况。

Augment Code Changelog →

CLI

Claude Code

CLI / Plugin2.1.138

•新增企业反馈调查支持和更严格的自动模式规则。
•修复 MCP 服务器在 /clear 后消失的问题。
•修复 WebFetch 在大型 HTML 页面挂起的问题。
•修复代理 SDK reload_plugins 重新连接 MCP 服务器的问题。
•多项稳定性和性能修复。

Claude Code 本周发布了多个版本，主要修复了 MCP 服务器连接、WebFetch 挂起等稳定性问题，并新增了企业反馈调查支持。

Releasebot, npm →

Gemini CLI

CLI / Pluginv0.42.0-nightly

•新增 shell 命令安全评估功能。
•支持在压缩期间排队消息。
•修复非交互模式下 JSON 输出问题。
•修复自定义计划目录处理错误。
•新增 Auto Memory 收件箱流程。

Gemini CLI 本周发布了多个 nightly 和 preview 版本，新增了 shell 命令安全评估、消息排队和 Auto Memory 收件箱等功能，并修复了多项 Bug。

GitHub Releases →

OpenCode

CLI / Plugin

•修复 warp 流中仅显示已连接工作区的问题。
•修复包含 diff --git 文本时的 diff 渲染问题。
•修复 PTY websocket 连接问题。
•修复 v2 会话 API 响应中可选字段编码问题。
•修复 HTTP API 工作区适配器丢失实例上下文的问题。

OpenCode 本周修复了多个 Bug，包括 warp 流工作区显示、diff 渲染、PTY 连接和 API 响应编码等问题。

OpenCode Changelog →

Aider

CLI / Plugin

本周暂无重大更新

Copilot

CLI / Plugin

•暂停新用户注册 Copilot Pro、Pro+ 和学生计划。
•从 Pro 层级移除 Opus 模型。
•Copilot 代码审查评论类型现纳入使用指标 API。
•VS Code 中 Copilot 支持按语义搜索和 grep 查询。
•代理模式在 VS Code 和 JetBrains 上正式发布。

GitHub Copilot 本周发生重大变化：暂停了新用户注册并移除 Opus 模型，同时代理模式正式发布。VS Code 版本新增了语义搜索和 grep 查询功能。

TechSifted, GitHub Blog →

CodeBuddy

CLI / Plugin2.95.1-next

•腾讯云宣布 CodeBuddy 价格上调，最高达154%。
•发布多个 next 版本。

CodeBuddy 本周发布了多个 next 版本，同时腾讯云宣布其 AI 编码助手 CodeBuddy 价格将大幅上调，最高达154%，这是今年的第三次涨价。

BigGo Finance, npm →

Company Blogs

CursorMay 7

PR Review, Build Plan in Parallel, and Split PRs

Cursor 发布新版本，引入 PR Review 体验、并行构建计划和快速操作按钮。

#Cursor#PR Review#并行构建

CursorMay 6

Context Usage Breakdown

Cursor 新增上下文使用量分解视图，让用户了解代理的上下文使用情况。

#Cursor#上下文#使用量

CursorMay 4

Model controls, spend management, and usage analytics

Cursor 为企业管理员推出模型控制、支出管理和使用分析更新。

#Cursor#企业#管理

WindsurfMay 6

Fast and Comprehensive Code Review, Now in Windsurf

Windsurf 集成 Devin Review 和 Quick Review，将代码审查功能带入编辑器。

#Windsurf#代码审查#Devin

GoogleMay 4

Reduce friction and latency for long-running jobs with Webhooks in Gemini API

Gemini API 新增 Webhooks 支持，减少长时间运行任务的摩擦和延迟。

#Google#Gemini API#Webhooks

VercelMay 8

Chat SDK adds Messenger adapter support

Vercel Chat SDK 新增 Messenger 适配器支持，可构建支持消息、反应、多媒体下载等功能的代理。

#Vercel#Chat SDK#Messenger

VercelMay 8

Chat SDK adds web adapter support

Vercel Chat SDK 新增 Web 适配器，支持在浏览器中构建聊天 UI。

#Vercel#Chat SDK#Web

VercelMay 8

Chat SDK now supports conversation history

Vercel Chat SDK 新增跨平台对话历史支持，用户消息历史可在不同平台间持久化。

#Vercel#Chat SDK#对话历史

VercelMay 7

Next.js May 2026 security release

Next.js 发布安全更新，修复了13个安全公告，涉及拒绝服务、中间件绕过、SSRF等问题。

#Vercel#Next.js#安全

VercelMay 7

Vercel Flags now supports JSON values

Vercel Flags 新增 JSON 值支持，可将多个相关标志合并为一个功能标志。

#Vercel#Flags#JSON

VercelMay 6

Auto-add Git committers to your team

Vercel Pro 团队可自动将 Git 提交者添加到团队，支持自动审批模式。

#Vercel#团队#Git

VercelMay 6

Secure Marketplace credentials with Production-only access

Vercel 新增生产环境仅访问权限，可限制集成资源的使用范围以保护凭证。

#Vercel#安全#凭证

VercelMay 5

Query observability metrics using the Vercel CLI

Vercel CLI 新增 observability 指标查询功能，可通过 vercel metrics 命令分析应用性能。

#Vercel#CLI#可观测性

VercelMay 4

How General Intelligence used agents to build an agent platform on Vercel

General Intelligence 使用代理在 Vercel 上构建代理平台，8人团队每天每人提交10个PR和70+次提交。

#Vercel#代理#案例

Coding Agents Ecosystem

via AI Daily

High-signal items tagged coding-agents by the AI Daily pipeline this week — repos, tools, and writeups beyond the 10 tracked editors.

Reddit r/MachineLearningMay 8

score8.5

META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet?

Meta发布ProgramBench：测试AI能否从头复现ffmpeg等程序

#evals

GitHubMay 7

score8.4

addyosmani/agent-skills

agent-skills：AI编码代理的生产级工程技能库。

GitHubMay 7

score8.3

mksglu/context-mode

Context Mode优化AI编码智能体上下文窗口，减少98%输出。

#context-engineering

Simon WillisonMay 7

score8.0

Live blog: Code w/ Claude 2026

Anthropic Code w/ Claude 2026活动的现场博客。

Reddit r/LocalLLaMAMay 7

score8.0

2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints

Qwen 3.6 27B使用MTP实现2.5倍推理加速，本地编码可行。

HN (241)May 8

score7.9

AlphaEvolve: Gemini-powered coding agent scaling impact across fields

AlphaEvolve：Gemini驱动的编码代理跨领域扩展影响。

HN (416)May 10

score7.6

Using Claude Code: The unreasonable effectiveness of HTML

使用Claude Code生成HTML效果出奇好

HN (383)May 7

score7.5

Higher usage limits for Claude and a compute deal with SpaceX

Anthropic提高Claude使用限制并与SpaceX达成计算协议。

techcrunch.comMay 9

score7.5

Airbnb says AI now writes 60% of its new code | TechCrunch

Airbnb称AI现已编写其60%的新代码，展示AI编程的广泛应用。

theverge.comMay 8

score7.5

OpenAI launched a Codex extension for Chrome. | The Verge

OpenAI为Chrome推出Codex扩展

Reddit r/MachineLearningMay 7

score7.5

Model automatically developed by the AIBuildAI Agent ranked among top 5.7% out of 3,219 human teams in the Kaggle TGS Salt Identification Challenge [P]

AI自动开发的模型在Kaggle挑战中排名前5.7%。

globenewswire.comMay 7

score7.5

Coder Sets a New Standard for AI Coding with Self-Hosted, AI Model Agnostic Coder Agents - GlobeNewswire

Coder发布自托管、模型无关的AI编码代理新标准。

arstechnica.comMay 7

score7.5

Anthropic raises Claude Code usage limits, credits new deal with SpaceX - Ars Technica

Anthropic提高Claude Code使用限制，归功于与SpaceX的新计算协议。

GitHubMay 9

score7.3

anomalyco/opencode

开源编码代理，支持多种LLM和工具。

GitHubMay 10

score7.1

earendil-works/pi

AI智能体工具包，含编码CLI、统一LLM API等。

#agent-harness

GitHubMay 9

score7.0

decolua/9router

免费AI编码路由工具，连接多种模型

Simon WillisonMay 9

score7.0

Using Claude Code: The Unreasonable Effectiveness of HTML

使用Claude Code体验HTML的惊人效果。

infosecurity-magazine.comMay 8

score7.0

Cline Kanban Flaw Lets Websites Hijack AI Coding Agents - Infosecurity Magazine

Cline Kanban漏洞可让网站劫持AI编码代理。

Simon WillisonMay 8

score7.0

Behind the Scenes Hardening Firefox with Claude Mythos Preview

Mozilla使用Claude Mythos预览版加固Firefox安全。

Reddit r/LocalLLaMAMay 8

score7.0

I embedded an AI agent in my shell. It can now run interactive programs.

在shell中嵌入AI代理，可运行交互式程序

Live Rank

Chatbot Arena

Chatbot Arena Rankings
#	Model	Elo	Δ	Org
1	Claude Opus 4.7 Thinking	1570	—	Anthropic
2	Claude Opus 4.7	1560	—	Anthropic
3	Claude Opus 4.6 Thinking	1549	—	Anthropic
4	Claude Opus 4.6	1544	—	Anthropic
5	GLM 5.1	1531	—	Z.ai

SWE-bench Verified

SWE-bench Verified Leaderboard
Model	Resolved %	Org
live-SWE-agent + Claude 4.5 Opus medium (20251101)	79.2%	UIUC
Sonar Foundation Agent + Claude 4.5 Opus	79.2%	Sonar
TRAE + Doubao-Seed-Code	78.8%	ByteDance
live-SWE-agent + Gemini 3 Pro Preview (2025-11-18)	77.4%	UIUC
Atlassian Rovo Dev (2025-09-02)	76.8%	Atlassian
EPAM AI/Run Developer Agent v20250719 + Claude 4 Sonnet	76.8%	EPAM Systems, Inc.
mini-SWE-agent + Claude 4.5 Opus (high reasoning)	76.8%	Anthropic
ACoder	76.4%	ACoder
mini-SWE-agent + Gemini 3 Flash (high reasoning)	75.8%	Google DeepMind
mini-SWE-agent + MiniMax M2.5 (high reasoning)	75.8%	Minimax

Aider Leaderboard

Aider Code Editing Leaderboard
Model	Pass Rate	Δ
gpt-5 (high)	88%	—
gpt-5 (medium)	86.7%	—
o3-pro (high)	84.9%	—
gemini-2.5-pro-preview-06-05 (32k think)	83.1%	—
o3 (high)	81.3%	—

LiveCodeBench

LiveCodeBench Leaderboard
Model	Pass@1	Easy	Med	Hard
O4-Mini (High)	87.3%	98.4%	92.7%	71.1%
O3 (High)	84.7%	99.1%	89.8%	66.0%
O4-Mini (Medium)	84.5%	98.8%	92.2%	62.9%
DeepSeek-R1-0528	84.4%	99.2%	90.9%	63.6%
Gemini-2.5-Pro-06-05	84.3%	99.1%	92.2%	62.0%
Gemini-2.5-Pro-05-06	82.7%	98.8%	90.6%	59.4%
OpenReasoning-Nemotron-32B	81.0%	98.6%	87.5%	57.5%
EXAONE-4.0-32B	80.9%	98.8%	88.3%	56.3%
Qwen3-235B-A22B	80.4%	99.1%	88.8%	54.0%
XBai-o4-medium	80.1%	98.8%	90.1%	52.0%

← 2026-W18

2026-W20 2026-W19 2026-W18 2026-W17 2026-W16

2026-W20 →