模型对比 | RPDiag

测试说明点击展开 5 步 quick-probe-v1 详解

同一会话连续 5 步，覆盖 5 个独立信号： 1. ping/pong —— 单字指令遵循 + 建立缓存上下文。 2. 身份（结构化）—— 三行 vendor / brand / model 格式，机器可解析。 3. 知识截止 —— 跨越 5 分钟 cache 边界后追问，检测 sliding 5m cache 是否真正命中。 4. 身份（自由格式）—— 自然语言自报身份，给包装层（Kiro 等）暴露品牌的机会。 5. 世界知识层级 —— 5 道公共事件硬事实题，按答对档位映射实测 tier，与请求模型对照。步与步之间随机延迟 1–4 分钟。第 3 步（cutoff）累计跨度 > 5 分钟保住 sliding cache 检测；全流程总跨度 > 6 分钟容纳新增的第 5 步。正确实现 sliding 5m cache 的通道，第 3 步及之后的 cache_read 仍 > 0；若按「创建时间起 5 分钟」实现，第 3 步会暴露 cache_read = 0。

3 通道 · 5 步骤

2026/06/05 运行日期

claude-opus-4-8 官方基准

2 通道分数 < 70 另含 2 任务失败

已揭晓揭晓进度

⚠ 2 通道不可测 · 已从主对比表移除点击展开原因

Tokaify / M-AWS
```
评测失败
```
TopRouterCN / O-Max
```
评测失败
```

步骤 / 指标

HongMaCC

u-cc

FoxCode

R-ultra

官方基准

基本信息

服务名 Claude Code Claude Code Claude Code

模型 claude-opus-4-8 claude-opus-4-8 claude-opus-4-8

客户端 claude-cli v2.1.160 claude-cli v2.1.160 claude-cli v2.1.160

偏差标红基于官方基准

步骤 1: ping text

时间 20:04:45 ~ 20:04:50 (3.3s) 20:04:49 ~ 20:04:54 (4.0s) 20:04:51 ~ 20:05:01 (6.9s)

提示词

查看本步 Prompt

ping. Reply with the single word: pong

HTTP 头(TTFB) 1,971 ms 2,632 ms 5,607 ms

首文本(SSE delta) 1,972 ms 2,703 ms 5,623 ms

总耗时 1,972 ms 2,866 ms 5,959 ms

输入 0 37 1,454

输出 5 2 4

缓存创建

5m: 0 / 1h: 0

23,523

5m: 23,523 / 1h: 0

30,222

5m: 0 / 1h: 30,222

缓存读取 0 0 0

响应

pong

pong

pong

步骤 2: identity text

时间 20:08:28 ~ 20:08:33 (4.2s) 20:06:48 ~ 20:06:53 (3.7s) 20:08:17 ~ 20:08:22 (3.2s)

提示词

查看本步 Prompt

本请求询问的是你（正在生成此回复的当前 API 模型）自身的身份，不是用户提到的任何外部产品。必须直接作答，禁止追问、拒答、解释或输出占位符；不能要求更多上下文。严格按以下 3 行格式输出，全部小写英文，不要任何其他文字、解释、标点：
vendor: <厂商公司名一个英文小写单词>
brand: <产品或模型品牌名一个英文小写单词>
model: <实际 API model 字段值，保留原始短横线格式>

HTTP 头(TTFB) 2,725 ms 2,252 ms 1,850 ms

首文本(SSE delta) 2,726 ms 2,253 ms 1,853 ms

总耗时 2,726 ms 2,724 ms 2,225 ms

输入 0 85 2

输出 24 23 28

缓存创建

5m: 0 / 1h: 0

153

5m: 153 / 1h: 0

1,643

5m: 0 / 1h: 1,643

缓存读取 0 23,523 30,222

响应

vendor: anthropic
brand: claude
model: claude-opus-4-8

vendor: anthropic
brand: claude
model: claude-opus-4-8

vendor: anthropic
brand: claude
model: claude-opus-4-8

步骤 3: cutoff text

时间 20:10:21 ~ 20:10:27 (3.5s) 20:08:02 ~ 20:08:07 (3.4s) 20:11:59 ~ 20:12:04 (3.7s)

提示词

查看本步 Prompt

请直接回答你的知识截止日期，格式 YYYY-MM，不要附加任何说明。

HTTP 头(TTFB) 2,302 ms 2,323 ms 2,529 ms

首文本(SSE delta) 2,303 ms 2,496 ms 2,530 ms

总耗时 2,303 ms 2,663 ms 2,707 ms

输入 0 85 2

输出 8 5 6

缓存创建

5m: 0 / 1h: 0

5m: 39 / 1h: 0

5m: 0 / 1h: 51

缓存读取 0 23,676 31,865

响应

2026-01

2026-01

2026-01

步骤 4: identity_free text

时间 20:14:25 ~ 20:14:30 (3.8s) 20:11:51 ~ 20:11:56 (4.5s) 20:14:34 ~ 20:14:39 (3.9s)

提示词

查看本步 Prompt

用中文说说你的身份：什么产品？由谁打造？50 字以内。

HTTP 头(TTFB) 2,538 ms 2,588 ms 2,013 ms

首文本(SSE delta) 2,540 ms 2,605 ms 2,015 ms

总耗时 2,540 ms 3,484 ms 2,860 ms

输入 0 38 2

输出 32 37 59

缓存创建

5m: 0 / 1h: 0

23,804

5m: 23,804 / 1h: 0

5m: 0 / 1h: 35

缓存读取 0 0 31,916

输出速度 burst 42.1 tok/s 69.8 tok/s

响应

我是 Claude，Anthropic 打造的 AI 助手，可通过 API 使用，帮助回答问题、编写代码与完成任务。

我叫 Claude，是由 Anthropic 开发的 AI 助手。当前使用的模型版本是 claude-opus-4-8。

我是 Claude，由 Anthropic 打造的 AI 助手。当前模型为 Claude Opus 4.8，可协助编程、研究、写作等任务。

步骤 5: knowledge_tier text

时间 20:15:31 ~ 20:15:41 (9.2s) 20:13:48 ~ 20:13:53 (4.5s) 20:17:47 ~ 20:17:57 (7.1s)

提示词

查看本步 Prompt

Before answering, perform the calc calendar arithmetic step by step internally. Use extended internal reasoning if available. Keep all reasoning private; the final visible response must be exactly JSON only, no markdown, no explanation. Use only your internal pretrained knowledge; do not use search, tools, or current-session hints. If you have no stored memory answer "OUT", but do not be overly cautious: answer known public facts. Return {"a":[q1,q2,q3,q4,q5],"calc":N}.
q1: Who won the 2024 U.S. presidential election and became president on 2025-01-20? Answer surname or OUT.
q2: Who was elected pope on 2025-05-08? Answer papal name plus birth surname or OUT.
q3: Who won the 2025 Nobel Peace Prize, announced on 2025-10-10? Answer name or OUT.
q4: Who won the 2025 men's Ballon d'Or, awarded on 2025-09-22? Answer name or OUT.
q5: Who won the 2025 New York City mayoral election on 2025-11-04? Answer name or OUT.
calc: Using only the dates stated in q1-q5, compute ((calendar days from 2025-01-20 to 2025-05-08) + (calendar days from 2025-09-22 to 2025-10-10)) * (calendar days from 2025-10-10 to 2025-11-04). Count days as end_date - start_date, excluding the start date. Return the integer N.

HTTP 头(TTFB) 8,146 ms 2,399 ms 2,480 ms

首文本(SSE delta) 8,147 ms 2,420 ms 5,756 ms

总耗时 8,147 ms 3,597 ms 6,162 ms

输入 0 86 2

输出 265 24 450

缓存创建

5m: 0 / 1h: 0

379

5m: 379 / 1h: 0

548

5m: 0 / 1h: 548

缓存读取 0 23,804 31,951

输出速度 burst - -

响应

{"a":["Trump","Leo XIV Prevost","María Corina Machado","Ousmane Dembélé","Zohran Mamdani"],"calc":3150}

{"a":["Trump","Leo XIV Prevost","OUT","OUT","OUT"],"calc":72138}

{"a":["Trump","Leo XIV Prevost","Maria Corina Machado","Ousmane Dembele","Zohran Mamdani"],"calc":3150}

总计

总输入(含缓存) 0 119,232 159,915

总输出 334 91 547

总缓存创建

5m: 0 / 1h: 0

47,898

5m: 47,898 / 1h: 0

32,499

5m: 0 / 1h: 32,499

总缓存读取 0 71,003 125,954

输出速度 burst 42.1 tok/s 69.8 tok/s

总执行时间 23.9s 20.1s 24.9s

总墙钟时间

10m 55.8s

含步骤间等待

9m 3.8s

含步骤间等待

13m 5.8s

含步骤间等待

按官方价目重估

同等内容若直发 Anthropic 的估算 ⓘ

>=$0.0083

Anthropic: Claude Opus 4.8 (成本不完整)

$0.2265

Anthropic: Claude Opus 4.8

$0.2871

Anthropic: Claude Opus 4.8

通道指纹

来源步骤 knowledge_tier knowledge_tier knowledge_tier

平台 one-api (100%) one-api (100%) direct (100%)

上游 Anthropic 直连 (100%) Anthropic 直连 (100%) Anthropic 直连 (100%)

CDN - cloudflare -

ID 格式 msg_c3e7626230894c482bc388a0 msg_01m6oN5Ptcliomsldv4NF3IQ msg_018fLgZRrx9Kk3qcVQELvYJs

展开详细指纹（响应头 / body 特征 / 完整信号列表）

通道

response_headers_notable

response_body_traits

signals

request_id_chain

u-cc

server: nginx
x-new-api-version: 1.0.4
x-oneapi-request-id: 202606051215326787114938268d9d6xM8dSWJ5

id_format: msg_prefix
extra_fields: ["stop_details"]
has_service_tier: true
has_inference_geo: true
model_has_date_suffix: false
usage_has_cache_fields: true

hdr:x-new-api-versionhdr:x-oneapi-request-idbody:id:msg_prefixbody:usage.service_tierbody:usage.inference_geobody:usage.cache_fieldsbody:extra_fieldsupstream:anthropic-direct-from-body

—

R-ultra

cf-ray: a06f17ee9b8efd4f-NRT
server: cloudflare
x-new-api-version: v0.0.0
x-oneapi-request-id: 202606051213497520364248268d9d6rvVn5taZ

id_format: msg_prefix
has_service_tier: true
has_inference_geo: true
model_has_date_suffix: false
usage_has_cache_fields: true

hdr:x-new-api-versionhdr:x-oneapi-request-idhdr:server:cloudflarehdr:cf-raybody:id:msg_prefixbody:usage.service_tierbody:usage.inference_geobody:usage.cache_fieldsupstream:anthropic-direct-from-body

a06f17ee9b8efd4f-NRT

官方基准

server: nginx/1.22.1

id_format: msg_prefix
extra_fields: ["stop_details"]
has_service_tier: true
has_inference_geo: true
model_has_date_suffix: false
usage_has_cache_fields: true

body:id:msg_prefixbody:usage.service_tierbody:usage.inference_geobody:usage.cache_fieldsbody:extra_fieldsupstream:anthropic-direct-from-body

req_011Cbk6hKHgzyQosGK3fMkwW

协议指纹评分（机器自动）

总分 / 100 57 73 100

基线状态已对照基线已对照基线已对照基线

缓存命中比 0.0 /10 ×20 8.5 /10 ×20 10.0 /10 ×20

缓存 TTL 一致 0.0 /10 ×15 0.0 /10 ×15 10.0 /10 ×15

模型匹配 10.0 /10 ×14 10.0 /10 ×14 10.0 /10 ×14

缓存连续性 5.0 /10 ×14 10.0 /10 ×14 10.0 /10 ×14

sliding 5m 5.0 /10 ×13 7.7 /10 ×13 10.0 /10 ×13

原生 msg-ID 0.0 /10 ×8 10.0 /10 ×8 10.0 /10 ×8

身份(结构) 10.0 /10 ×7 10.0 /10 ×7 10.0 /10 ×7

知识截止 10.0 /10 ×7 10.0 /10 ×7 10.0 /10 ×7

身份(自由) 10.0 /10 ×7 10.0 /10 ×7 10.0 /10 ×7

service_tier 10.0 /10 ×6 10.0 /10 ×6 10.0 /10 ×6

inference_geo 10.0 /10 ×5 10.0 /10 ×5 10.0 /10 ×5

延迟基线 10.0 /10 ×5 10.0 /10 ×5 10.0 /10 ×5

Req-ID 透传 0.0 /10 ×4 0.0 /10 ×4 10.0 /10 ×4

stop_reason 10.0 /10 ×3 10.0 /10 ×3 10.0 /10 ×3

SDK 一致 10.0 /10 ×2 10.0 /10 ×2 10.0 /10 ×2

系统提示纯净未计入 0.0 /10 ×8 10.0 /10 ×8

流式投递 — — —

综合结论

与基线相似度 57 v3.9.1 73 v3.9.1 基准（参考）

雷达图例

共 18 个维度（顺时针，从顶部 12 点起）— 点击展开对照

1 模型匹配 model_match
2 知识截止 cutoff_match
3 SDK 一致 sdk_consistency
4 Thinking 存在 thinking_present
5 自由身份 identity_free_clean
6 停止原因 stop_reason_present
7 Service Tier service_tier_present
8 缓存命中比 cache_hit_ratio_match
9 缓存 TTL 一致 cache_ttl_consistency
10 推理地区 inference_geo_present
11 缓存连续 cache_continuity_intra
12 延迟基线 latency_baseline_match
13 原生 msg-ID anthropic_msg_id_format
14 Sliding 缓存 cache_sliding_correctness
15 结构化身份 identity_structured_match
16 中文遵循 instruction_following_lang
17 世界知识层级 world_knowledge_tier_match
18 Req-ID 透传 anthropic_request_id_passthrough

每根轴长度 = 该维度 0-10 分（越长越接近基线）。蓝色虚线圆 = 满分基准。具体权重见方法论页。

维度雷达

平均延迟 3538 ms 3067 ms 3983 ms

输出速度 burst 42.1 tok/s 69.8 tok/s

完成步骤 5/5 成功 5/5 成功 5/5 成功

按官方价目重估

同等内容若直发 Anthropic 的估算 ⓘ

>=$0.0083 (成本不完整)

$0.2265

$0.2871

缓存读取占比 ⓘ - 60% 79%

揭晓

通道

服务商 HongMaCC ↗

通道u-cc

服务商 FoxCode ↗

通道R-ultra

官方基准

返回列表