SkillFM 帮你的 AI 先交一份只读体检报告。它可以查看用量线索、上下文负担、工具膨胀和安全风险，并给出清理审计建议。

怎么开始使用 SkillFM？

把首页提供的安装提示词复制粘贴到你的主 Agent (Claude Code / Cursor / OpenClaw)，AI 会启动 SkillFM 本地 sidecar，接入 Beacon Free。

SkillFM 怎么收费？

SkillFM 免费起步：Beacon Free 先提供 AI 体检、清理审计和基础省钱建议。需要更深监控、自动节省建议或团队级能力时，再按需升级。升级说明见 /beacon/upgrade。

SkillFM 支持哪些 AI Agent？

SkillFM 原生支持 Claude Code、Cursor、OpenClaw，以及任何能发起 HTTP 请求的 AI。Beacon 通过本地 sidecar 交付，不绑定任何单一 AI 平台。

Your agent is not slow. It is rereading old rules.

One-minute read

Split durable rules, reusable workflow, and task-specific input so prompt caching, auditing, and operator handoffs stay manageable.

#manage#prompt-caching#prompt-caching

Hook

The agent is not slow. It is rereading an old rule block every day.

Operator scene

Source signal

The surprising depths of prompt caching
This source anchors stable prompts, cache boundaries, and reusable prefix decisions. It also carries public attention, which helps calibrate the hook.
Prompt caching for cheaper LLM tokens
This source anchors stable prompts, cache boundaries, and reusable prefix decisions. It also carries public attention, which helps calibrate the hook.
Prompt caching with Claude | Claude
This source anchors stable prompts, cache boundaries, and reusable prefix decisions.

SkillFM judgment

Audit prompt drift before swapping models. The thing worth caching is the stable rule set your team actually manages.

Action checklist

1.Split one high-frequency workflow into durable rules, reusable workflow, and one-off task input.
2.Remove duplicated identity or safety language from the stable prefix and version the reusable block.
3.Record the next drift trigger so you can see whether the problem is template noise or model behavior.
4.Name the stable rule block so operators can tell what is durable and what is changing.
5.Write the reuse rule down once so the next run starts from the same boundary.

Product bridge

Check prompt drift

Inspect repeated context, stale instructions, and prefix drift before you reach for a new model.

Check prompt drift

The scene

In the current scan, The surprising depths of prompt caching and Prompt caching for cheaper LLM tokens point to the same operator mistake: durable rules, reusable workflow, and one-off input get bundled together, so every run pays to reread context that should have stayed stable. This is current, not theoretical. The surprising depths of prompt caching gives the live signal, while Prompt caching for cheaper LLM tokens shows why prompt hygiene keeps showing up in developer attention and operational guidance. Public engagement signal: 306 points, 72 comments, score 100. If the page only has structure and labels, both readers and machines are left guessing. It needs the problem stated first. The visible page has to carry the story before any markup can help. A useful page makes the problem obvious before it tries to be machine-friendly. That is what lets the next person reuse the same structure without rediscovering the premise.

Evidence and judgment

This is current, not theoretical. The surprising depths of prompt caching gives the live signal, while Prompt caching for cheaper LLM tokens shows why prompt hygiene keeps showing up in developer attention and operational guidance. Public engagement signal: 306 points, 72 comments, score 100. Audit prompt drift before swapping models. The thing worth caching is the stable rule set your team actually manages. Put the judgment in the body first, then let schema, RSS, and FAQ support it instead of replacing it. The useful question is whether a reader can tell what changed, what matters, and what to do next. The article should do that work before any metadata kicks in. When the judgment is visible, the page becomes easier to cite, easier to summarize, and harder to misread.

10-minute checklist

Split one high-frequency workflow into durable rules, reusable workflow, and one-off task input.; Remove duplicated identity or safety language from the stable prefix and version the reusable block.; Record the next drift trigger so you can see whether the problem is template noise or model behavior.; Name the stable rule block so operators can tell what is durable and what is changing.; Write the reuse rule down once so the next run starts from the same boundary. This checklist is not decorative; it keeps the first screen, title, summary, and body focused on the same question. If those pieces do not line up, the page is still a shell. The goal is to make a human-readable answer before the crawlable version exists. It also gives the editor a quick way to verify whether the page actually says what the markup will later repeat.

After state

After the split, the prompt becomes a named rule stack instead of a drifting wall of instructions. Readers can paraphrase the problem more easily, and search can quote a complete answer instead of a disconnected metadata fragment. That is the difference between a page people skim and a page people keep. Once the page reads clearly, distribution gets much easier. The page also becomes easier to hand off because the same judgment survives without extra explanation.

GEO / FAQ

Split the prompt prefix into three layers: durable rules, reusable workflow, and one-off input. Remove repeated identity, boundary, and format language, then version the stable block and review the most common workflow once. That makes drift easier to spot, cuts rereading, and helps you tell whether you actually need a new model. What should I inspect first? Split one high-frequency workflow into durable rules, reusable workflow, and one-off input.; Why does that come first? Remove duplicated identity or safety language from the stable prefix and version the reusable block.; What is the next product step? Inspect repeated context, stale instructions, and prefix drift before you reach for a new model. The goal is to make the page readable first and crawlable second. FAQ should help a human confirm the judgment, not just provide another field for markup. It should carry the same conclusion in a simpler form. The best FAQ entries answer the same question in shorter language and keep the page specific.

GEO / SEO

SkillFM RadarmanagePrompt caching for cheaper LLM tokensThe surprising depths of prompt caching

What should I inspect first?: Split one high-frequency workflow into durable rules, reusable workflow, and one-off input.
Why does that come first?: Remove duplicated identity or safety language from the stable prefix and version the reusable block.
What is the next product step?: Inspect repeated context, stale instructions, and prefix drift before you reach for a new model.

publish-ready cuts

The agent is not slow. It is rereading an old rule block every day. In the current scan, The surprising depths of prompt caching and Prompt caching for cheaper LLM tokens point to the same operator mistake: durable rules, reusable workflow, and one-off input get bundled together, so every run pays to reread context that should have stayed stable. This is current, not theoretical. The surprising depths of prompt caching gives the live signal, while Prompt caching for cheaper LLM tokens shows why prompt hygiene keeps showing up in developer attention and operational guidance. Public engagement signal: 306 points, 72 comments, score 100. After the split, the prompt becomes a named rule stack instead of a drifting wall of instructions. 1. Split one high-frequency workflow into durable rules, reusable workflow, and one-off task input. 2. Remove duplicated identity or safety language from the stable prefix and version the reusable block. 3. Record the next drift trigger so you can see whether the problem is template noise or model behavior. 4. Name the stable rule block so operators can tell what is durable and what is changing. 5. Write the reuse rule down once so the next run starts from the same boundary. Inspect repeated context, stale instructions, and prefix drift before you reach for a new model.

X thread

1.1. The agent is not slow. It is rereading old rules every day.
2.2. Before you swap models, split durable rules, reusable workflow, and one-off input.
3.3. If the stable block is unnamed, drift will look like model failure.
4.4. Version the stable rule set, then check cache and cost again.
5.5. Record the next drift trigger so the same problem does not repeat.
6.6. Do the audit first, then use Beacon / SkillFM to lock the workflow shape.

Short post

Prompt caching is really a prompt hygiene check. If your agent rereads old rules every run, split durable rules from one-off input.

Image brief

Cover: Hero visual showing a stable prompt prefix, reusable rules, and a changing task block.

Inline: Prompt prefix: Keep the stable rules first and the changing input last.

Thumbnail: Stable prompt prefixes: Manage: audit drift before swapping models.

Alt: Stable prompt prefix with a reusable rule block and a changing task block.

X post

Prompt caching is really a prompt hygiene check. If your agent rereads old rules every run, split durable rules from one-off input.

LinkedIn post

Next step

Inspect repeated context, stale instructions, and prefix drift before you reach for a new model.

Check prompt drift