Claude AI Harness
feat 🔍 PPT 现在「生成后会自己看一眼」:渲染 + 视觉质检闭环 🔍 PPT now "looks at itself" after building: a render + visual-QA loop
做 PPT 不再是「盲生成」。现在每份幻灯片在交付前都会被真正渲染成图、由 AI 逐页看一遍:检查有没有隐形/低对比文字、文字溢出被裁、装饰压字、元素重叠这类「一眼可见」的毛病。有硬伤就自动打回、列出每页问题并修正后重做,确认没问题才给你下载——所以基本不会再收到文字被裁掉或看不见的幻灯片了。主观的版面/留白只作建议、不会卡住一份能用的稿。 Building a deck is no longer "blind". Every presentation is now actually rendered to images and reviewed page-by-page by an AI before delivery — checking for invisible/low-contrast text, clipped overflow, decorations covering text, and overlapping elements. Hard defects are auto-rejected, listed per slide, fixed, and rebuilt; you only get the download once it's clean — so you should no longer receive slides with clipped or invisible text. Subjective layout/whitespace is advice only and won't block a usable deck.
feat 📊 PPT 生成大升级:矢量图标 + 设计系统 + 自动防溢出 📊 Major PPT upgrade: vector icons + design system + auto overflow guard
让 AI 做 PPT 现在能出专业观感的幻灯片,对齐官方 pptx skill 水准:① 矢量图标——内置 react-icons,概念自动配上 crisp 的圆形图标徽章(不再靠糊掉的 AI 大图);② 设计系统——统一调色板、衬线+无衬线字体配对、卡片阴影、去「AI 味」,并内置封面/2×3 网格/架构纵向流/双栏等版式范式;③ AI 生图——封面与氛围插图可一键生成嵌入;④ 自动防溢出卡口——生成前做确定性几何自检,元素越界会被拦下重排,不再出现被裁切的页面。只需说「做一份 XX 的 PPT」,下载即用、可二次编辑。 AI-built decks now reach a professional look, on par with the official pptx skill: ① Vector icons — built-in react-icons gives every concept a crisp circular icon badge (no more blurry AI images); ② Design system — unified palette, serif+sans pairing, card shadows, de-"AI-flavored" styling, plus built-in layout patterns (cover / 2×3 grid / vertical architecture flow / two-column); ③ AI image generation — one-shot covers and atmospheric illustrations embedded into slides; ④ Auto overflow guard — a deterministic geometry check before export rejects out-of-bounds elements and reflows them, so slides are never clipped. Just say "make a deck about X" — download-ready and fully editable.
perf ⚡ 流畅模式跑通完整工具链:评测 21/30,与 Claude 持平 ⚡ Smooth Mode now runs the full tool chain: 21/30 on eval, on par with Claude
流畅模式(DeepSeek)现在跑的是同一套完整 harness——联网搜索、抓网页、多轮推理全部保留,不再只是纯聊天。在 30 题的 agent 评测数据集上拿到 21/30(70%),与 Claude 路径(20/30)基本持平,远高于原始基线(37%)。换句话说:日常用流畅模式又快又稳,能力也不打折。 Smooth Mode (DeepSeek) now drives the same full harness — web search, page fetching and multi-round reasoning all kept, no longer plain chat only. On our 30-task agent eval it scores 21/30 (70%), on par with the Claude path (20/30) and far above the original baseline (37%). In short: Smooth Mode is fast and stable for everyday use, with no drop in capability.
feat ⚡ 流畅模式 —— DeepSeek 官方 API,快速稳定 ⚡ Smooth Mode — DeepSeek official API, fast and stable
顶栏点模型名字即可在「Claude Opus 4.7」和「⚡ 流畅模式」之间切换。流畅模式走 DeepSeek 官方 API,国内访问又快又稳,秒回、几乎不掉线,适合日常使用。它和 Claude 共用同一套完整 harness,联网搜索 / 抓网页 / 多轮推理等能力照常可用。选择会记住。 Tap the model name in the top bar to switch between "Claude Opus 4.7" and "⚡ Smooth Mode". Smooth Mode runs on DeepSeek's official API — fast and reliable in China, near-instant replies, rarely drops, great for everyday use. It shares the same full harness as Claude, so web search / page fetching / multi-round reasoning all keep working. Your choice is remembered.
feat 等待回复时的动态状态提示 Live status while waiting for a reply
发出消息后、第一个字到达前不再是一片空白。现在会显示「✶ 正在思考…」的流光动画(短语会轮换),模型在联网 / 读网页 / 写文档时实时切换成「正在搜索…」「正在阅读…」「正在撰写…」,正文开始流式输出后自动消失。参考 Claude Code / Claude.ai 的等待反馈。 After you send a message, the gap before the first token is no longer dead air. A shimmering "✶ Thinking…" indicator now shows (the phrase cycles), switching live to "Searching…" / "Reading…" / "Writing…" while the agent runs a tool, and vanishing once prose starts streaming. Inspired by Claude Code / Claude.ai.
fix 移动端 & iOS PWA 体验一揽子修复 Mobile & iOS PWA experience — a batch of fixes
围绕手机和"添加到主屏幕"的体验补齐:
  • 右侧"空气墙"空白:侧栏在手机上是抽屉,却仍占着一条 300px 网格轨道把内容挤到左边、右侧大片留白 —— 现已恢复单列满宽(720–1020px 宽度最明显)。
  • iOS 主屏 App 间歇性 "Load failed":聊天请求遇到 WebKit 这个网络坑时自动重连最多 2 次,对用户透明。
  • 主屏图标满铺铜色:修了 Android 自适应图标被裁后缩水发灰、iOS 圆角留米色边框的问题。
  • iOS 安装图文引导页/ios-install.html):内嵌二维码 + 添加到主屏幕步骤,方便分享给 iPhone 用户。
Polishing the phone and "Add to Home Screen" experience:
  • Right-side blank "wall": the sidebar is a drawer on phones but still reserved a 300px grid track, squishing content left with a big blank on the right — now single-column full width (most visible at 720–1020px).
  • Intermittent "Load failed" in the iOS home-screen app: chat requests now auto-retry up to twice on this WebKit network quirk, transparently.
  • Full-bleed home-screen icons: fixed Android adaptive-icon shrinkage/greying after masking and the iOS cream border after rounding.
  • iOS install guide page (/ios-install.html): embedded QR code + add-to-home-screen steps to share with iPhone users.
perf 搜索 / 抓取管线重写 —— 又快又准 Search & fetch pipeline rewrite — faster and more accurate
把检索管线对照 Claude.ai 逐层重写:Brave 内联片段优先(够富就不抓页)、Exa highlights 做查询相关抽取(替掉整页字符截断)、9 引擎兜底链保覆盖、system/tools 前缀缓存 + 历史只追加撑住多轮(最多 12 轮)。在 30 道 GAIA+FRAMES 基准上,自建 harness 从 37% 提到 67%,对标 Claude.ai 的 80%。 The retrieval pipeline was rewritten layer by layer against Claude.ai: inline Brave snippets first (skip the fetch when rich enough), Exa highlights for query-relevant extraction (replacing whole-page truncation), a 9-engine fallback chain for coverage, and prefix caching + append-only history to sustain many rounds (up to 12). On a 30-task GAIA+FRAMES benchmark the harness went from 37% to 67%, versus Claude.ai's 80%.
blog 技术博客:把搜索调到 Claude.ai 的水准 Tech blog: Tuning Search to Claude.ai Level
双语技术长文,拆解 Claude.ai 搜索"又快又准"的 5 个工程优化,再用自建 harness 的受控实验逐条验证(延迟、token、逐题 pass/fail,配框架图)。阅读全文 → A bilingual deep-dive: the 5 engineering optimizations behind Claude.ai's fast, accurate search, then validated one by one with controlled experiments on the hand-rolled harness (latency, tokens, per-task pass/fail, with diagrams). Read more →
feat 设置统一入口 + 左栏布局优化 Unified settings entry + sidebar layout fix
左栏底部新增「设置」入口,把"长期记忆 / 我的数据 / 关于"合并成带 tab 的设置模态 —— 顶栏不再被账户级按钮挤占。同时修了左栏的老问题:「最近」对话多到一定数量就把「收藏」和「文档」挤到看不见 —— 现在三段各自有滚动边界,收藏和文档永远可见。 A new "Settings" entry now lives at the bottom of the left sidebar, consolidating "Memory / My Data / About" into a tabbed modal — the top bar is no longer crowded with account-level buttons. We also fixed a long-standing sidebar layout bug: when "Recent" filled up, "Starred" and "Documents" got pushed off-screen. Each section now has its own scroll boundary and stays visible.
feat 隐私三件套 — 记忆 UI + 数据导出 + 一键彻底删除 Privacy triad — Memory UI + data export + one-click full deletion
围绕"数据主体权利"补齐三件事:
  • 长期记忆 UI:模型主动记下的事实现在能看 / 改 / 单条删除 / 清空全部
  • 导出我的数据:一个 JSON 包含全部对话 / 消息 / 文档 / 记忆 / 遥测 / 反馈
  • 彻底删除账号:两步确认(输邮箱 + 5s 倒计时),DB 里 threads / messages / documents / telemetry / user_memory / bug-reports 与 bug-images 全部级联清理
  • privacy.html 透明化:明确列出收集了什么字段、admin 可见范围、对应每条权利的操作入口
对应 GDPR/PIPL 的数据可携权 + 被遗忘权基本款。
Closing the loop on data-subject rights:
  • Memory UI: see, edit, delete individual entries, or wipe all of what the model has remembered about you
  • Export my data: one JSON containing all your threads / messages / documents / memory / telemetry / feedback
  • Full account deletion: two-step confirmation (type your email + 5s countdown), then cascading cleanup of every table plus bug-report files referencing your email
  • Transparent privacy.html: explicit field-level list of what's collected, admin visibility scope, and direct links to each rights-fulfillment entry
Covers the baseline GDPR/PIPL portability + erasure rights.
feat 跨对话长期记忆 Cross-conversation long-term memory
新增 manage_memory 工具 + user_memory 表。告诉模型一次你的身份 / 项目 / 长期偏好("我是软件工程师"、"请用中文"),新开任意对话都能延续 —— 不再每次重新自我介绍。模型自主决定记什么(read / append / replace 三种 action),不记录一次性问题或敏感信息。 New manage_memory tool + user_memory table. Tell the model once about your identity / project / persistent preferences ("I'm a software engineer", "reply in Chinese"), and every new conversation picks up where the last left off. The model decides autonomously what's worth remembering (read / append / replace actions); one-off questions and sensitive info are excluded by design.
perf 启用 Prompt Caching,input 等效成本砍 ~50%、TTFT 显著下降 Prompt caching enabled — ~50% effective input cost reduction, faster TTFT
system prompt 和工具定义走 Anthropic 的 ephemeral cache,多轮对话里这部分基本只付一次"创建"成本,后续按 10% 成本读取。telemetry 新增 cache_creation_tokens / cache_read_tokens 两列可在管理后台追踪命中率。实测 8 轮新加坡新闻对话中 cache_read 累计 100K+ tokens,input 端等效成本约下降一半。 System prompt and tool definitions now ride Anthropic's ephemeral cache — paid once per ~5 min window, replayed at 10% of normal input cost. Telemetry exposes cache_creation_tokens / cache_read_tokens in the admin panel for hit-rate tracking. Measured on an 8-turn session: ~100K cached tokens replayed, effective input cost roughly halved.
perf 主动上下文压缩 + Haiku 摘要长 tool_result Proactive context compression + Haiku-summarized tool results
长对话场景的压缩链路:超过预算阈值后用 Haiku 把过长的 tool_result(搜索全文等)摘成 1-2 句,按 content hash 缓存复用;早期工具轮整体摘要化;最后还有硬上限兜底。主对话仍走配置的好模型(默认 claude-opus-4-7),Haiku 只跑后台辅助任务 —— 不牺牲主回答质量。配套 telemetry 新增 compressed_from/to_tokenshaiku_calls 字段。 Long-conversation compression pipeline: above a token budget, oversized tool_results (search dumps, etc.) get Haiku-summarized to 1-2 sentences and cached by content hash; early tool rounds collapse to summaries; a hard cap is the last-line backstop. The main chat path still uses the configured strong model (default claude-opus-4-7), with Haiku only running offstage helper tasks — no quality trade-off on user-facing answers. Telemetry gains compressed_from/to_tokens and haiku_calls fields.
feat 反馈带上对话 ID,bug 复现一步到位 Bug reports auto-attach conversation ID
反馈表单现在自动带上当前对话 ID 并落进通知邮件,省去用户手抄。管理员能直接点进对应对话看上下文。 The bug-report modal now auto-attaches the active thread ID and includes it in notification emails — admins can jump straight into the conversation context.
fix Artifact 生成全链路体验改进 Artifact generation pipeline polish
一波连续修复:
  • 简单问题不再硬塞进 artifact,AI 先反问"要做成文档吗"再决定
  • docx 生成前先弹选项让用户确认风格 / 大纲,不再生成完才问
  • 预览面板支持 docx 直显,不再重复生成一份 html 副本
  • 进度条覆盖所有 artifact / Office 文件生成路径
  • 选项块从输出中段漂浮的 bug 修复,统一渲染在 message 末尾
  • 服务端拦截空 / 过短的 create_artifact,避免重试导致的空文档与重复
A batch of artifact-related fixes:
  • Simple questions no longer get force-shunted into artifacts — model asks first
  • For docx whitepapers, style/outline options now appear BEFORE generation, not after
  • Preview pane renders docx directly — no more parallel HTML regeneration
  • Progress indicators now cover all artifact / Office generation paths
  • Option blocks no longer float mid-message — consistently rendered at the end
  • Server rejects empty / under-length create_artifact calls — no more empty-doc spam from retries
feat Agent 可读回历史文档增量修改 Agents can now read & edit existing artifacts
新增 list_artifacts / get_artifact 工具。用户说"把第二章再扩两段"时,AI 先列出当前对话已有文档、读出原文,基于现有内容增量修改后 upsert —— 不再每次从零重新生成。 New list_artifacts / get_artifact tools. When users say "expand chapter two", the model now lists existing thread documents, reads the current body, and patches in place instead of regenerating from scratch.
feat LaTeX 数学公式渲染 LaTeX math rendering
回答中的行内 $...$ 和块级 $$...$$ 公式现在直接渲染成排版好的数学表达式(基于 KaTeX)。反向传播推导、损失函数、积分号、矩阵这种密集公式不再原文露出。 Inline $...$ and block-level $$...$$ formulas in responses now render as properly typeset math (powered by KaTeX). Dense formulas — backprop derivations, loss functions, integrals, matrices — no longer leak raw source.
feat UX 体验细节大复刻 claude.ai UX polish pass — closing the gap to claude.ai
Markdown 渲染补全(链接 / 有序列表 / 斜体 / 代码块语言标签 + 复制按钮)、代码块语法高亮(12 种语言,无外部依赖)、主题化弹层替换浏览器原生 prompt/confirm、滚动历史时浮现「跳到底部」按钮、空输入框时禁用发送按钮、文件拖拽上传、键盘快捷键(Cmd+N 新对话 / Esc 关面板 / Cmd+K 聚焦输入框)、侧边栏对话搜索与关键词高亮。 Full markdown rendering (links / ordered lists / italics / code-block language label + copy button), in-house syntax highlighting (12 languages, zero deps), themed inline modals replacing native prompt/confirm, scroll-to-bottom floating button, disabled send on empty input, file drag-and-drop, keyboard shortcuts (Cmd+N / Esc / Cmd+K), sidebar conversation search with keyword highlight.
fix 修复代码块路由 + 点赞按钮无反馈 Code-block routing + thumbs-up no-feedback fixes
Python / JS / CSS 等代码块不再被错误推送到右侧文档面板,现在直接在聊天中渲染(仅 HTML / SVG 走文档预览)。点赞 / 踩按钮加上可见的橙色背景高亮(彩色 emoji 不响应 CSS color,原来的实现看上去毫无变化)。 Python / JS / CSS code blocks no longer hijacked into the artifact panel — they now render inline (only HTML / SVG continue to the preview pane). Thumbs-up / down buttons gain a visible orange background tint (color emojis ignore CSS color, so the previous attempt was a visual no-op).
feat Office 文件生成 Office File Generation
在对话中直接生成 Word (.docx)、Excel (.xlsx) 和 PowerPoint (.pptx) 文件,在 Artifact 面板中一键下载。 Generate Word, Excel, and PowerPoint files directly from conversation — downloadable from the artifact panel.
blog 技术博客:用 30 道题量化我和 Claude.ai 的差距 Tech blog: Benchmarking My Agent vs Claude.ai with 30 Tasks
固定模型、只变 harness 的对照评测:裸模型 0% → 自建 harness 37% → Claude.ai 80%。差距主要在检索管线而非模型——这也是上面"管线重写"的起点。阅读全文 → A controlled benchmark holding the model fixed and varying only the harness: naked model 0% → hand-rolled harness 37% → Claude.ai 80%. The gap is mostly in the retrieval pipeline, not the model — the starting point for the rewrite above. Read more →
feat 用量追踪与管理后台 Usage Tracking & Admin Panel
新增按用户统计的 Token 用量面板,管理员可查看所有账户的月度 API 消耗。 Per-user token usage tracking with a dedicated admin panel tab and monthly breakdowns.
fix 图表渲染修复 Matplotlib & Chart Rendering
  • 自动注册中文字体,matplotlib 图表正确显示中文Auto-register Chinese fonts for matplotlib code execution
  • 修复重复生成文档和图表显示截断的问题Fix duplicate documents and chart display cutoff
feat 内联图表与 Bug 反馈增强 Inline Charts & Bug Report
Matplotlib 图表直接在聊天流中渲染;Bug 反馈支持图片上传、粘贴和拖拽。 Matplotlib charts render inline in chat. Bug reports support image upload, paste, and drag-drop.
perf 缓解等待焦虑 Reduced Waiting Anxiety
新增进度条、计时器和轮播提示,减少 AI 响应时的等待焦虑。API Key 池支持轮询和自动冷却。 Progress bar, elapsed timer, and rotating tips during AI responses. API key pool with round-robin rotation.
feat 分享功能与移动端文档访问 Sharing & Mobile Document Access
  • 生成临时分享链接(24 小时有效)Temporary share links with 24h expiry
  • 移动端浮动按钮快速访问文档Floating action button for mobile document access
  • Artifact 持久化到 SQLite,刷新页面不丢失Artifacts persist to SQLite — survive page refresh
  • 长文档生成断线可恢复Long document generation recoverable after disconnect
feat SQLite 存储与多引擎搜索 SQLite Storage & Multi-Engine Search
从纯 localStorage 迁移到 SQLite 后端。搜索支持 5 引擎级联回退:Serper → Tavily → Brave → Google CSE → DuckDuckGo。 Migrated to SQLite backend. Web search supports 5-engine fallback chain.
  • 长文档并行分章生成Long document generation with parallel chapter writing
  • DOCX 导出DOCX export for generated documents
  • 移动端响应式布局优化Mobile-optimized responsive layout
feat 多用户系统与 Bug 反馈 Multi-User Accounts & Bug Reports
用户注册与登录,Session 管理。内置 Bug 反馈功能,用户可直接在应用内提交问题。 User registration and login with session management. Built-in bug report feature.
feat 双主题与代码解释器 Dual Theme & Code Interpreter
大版本 UX 升级:日/夜主题切换、沙盒 JavaScript 代码执行、文档管理侧栏与版本追踪。 Major UX overhaul — light/dark theme, sandboxed JS execution, document sidebar with versioning.
feat v1 Agent 工具调用循环 Agentic Tool Use Loop
核心 Agent 架构上线:Claude 在循环中迭代调用工具(网络搜索、代码执行、Artifact 生成),直到任务完成。单次请求最多 8 轮,80K 上下文预算管理。 Core agentic architecture — Claude iteratively calls tools (web search, code execution, artifact creation) in a loop. Up to 8 rounds per request with 80K context budget.