详细分析 ▾
运行时依赖
版本
reactions rule, close-the-loop, reply-to rules; skill-master analytics hook; skill-analytics added
安装命令 点击复制
技能文档
Minimum 模型
Any model for filling in templates. Use a medium model for trend analysis and recommendations.当...时 到 Run
- Weekly self-eval: Every 7 days. Run automatically.
- 在...上 owner correction: Log correction immediately, 然后 re-score affected dimension.
- Monthly 举报: 在 end 的 每个 month, aggregate 所有 weekly evals.
- 在...上 demand: 如果 owner asks "如何 am I 正在做?" → generate current eval 在...上 spot.
Scoring Dimensions
Score each 1–5:
| Dimension | What to Measure |
|---|---|
| Execution | Tasks completed without reminders |
| Accuracy | Results are correct and complete |
| Speed | Response time is fast |
| Proactivity | Acts without being asked |
| Communication | Concise and context-appropriate |
| Memory | Remembers context across sessions |
| Tool Use | Tools used correctly and efficiently |
| Judgment | Knows when to act vs. when to ask |
- 5 = Consistently exceeds expectations
- 4 = Meets expectations 带有 minor gaps
- 3 = Acceptable 但是 basic
- 2 = Frequent gaps 或 errors
- 1 = Fails basic expectations
总计: Max 40 points. Grade: A (36–40), B (28–35), C (20–27), D (<20)
Weekly Self-Evaluation
Save to .learnings/eval/YYYY-MM-DD.md.
# PA Weekly Eval — YYYY-MM-DDScores
Dimension Score Notes Execution /5 Accuracy /5 Speed /5 Proactivity /5 Communication /5 Memory /5 Tool Use /5 Judgment /5 TOTAL /40
Owner Feedback This Week
- Positive:
- Corrections:
- Complaints:
Tasks Completed
-
Tasks Failed or Incomplete
-
What Went Well
-
What to Improve
-
Actions for Next Week
- [ ]
创建 File
#!/bin/bash
set -e# Set the output directory
EVAL_DIR="$HOME/.openclaw/workspace/.learnings/eval"
mkdir -p "$EVAL_DIR"
DATE=$(date +%Y-%m-%d)
EVAL_FILE="$EVAL_DIR/$DATE.md"
# Write the template with today's date
cat > "$EVAL_FILE" << 'EOF'
# PA Weekly Eval — DATE_PLACEHOLDER
[Fill in the template above]
EOF
# Replace the placeholder with the real date (works on Linux and macOS)
sed -i "s/DATE_PLACEHOLDER/$DATE/" "$EVAL_FILE" 2>/dev/null \
|| sed -i '' "s/DATE_PLACEHOLDER/$DATE/" "$EVAL_FILE"
echo "Created eval file: $EVAL_FILE"
Owner Feedback Signals
Log these automatically when detected:
| Signal | Action |
|---|---|
| 👍 reaction | Log +1 positive |
| 👎 reaction | Log -1 negative, record the correction |
| "תודה" / "great" / "perfect" | Log +1 positive |
| "wrong" / "fix this" / "לא טוב" | Log -1, record the correction |
| Owner re-asks the same question | Log -1 memory gap |
| Owner does the task themselves | Log -1 initiative gap |
| Owner surprised by proactive action | Log +2 proactivity |
Monthly 举报 格式
# PA Performance Report — [Month Year]PA Name: [Name]
Owner: [Owner Name]
Period: [Start] – [End]
Summary Score: X/40 ([Grade A/B/C/D])
Dimension Breakdown
[Copy scores table here]Key Wins
-Key Issues
-Trend vs Last Period
- Score change: +X / -X points
- Best improvement: [dimension]
- Biggest regression: [dimension]
Recommended Actions
1.
2.
3.
Benchmark Tests (Run Monthly)
Task Completion Rate
- 计数 tasks assigned 在...中 最后的 30 days.
- 计数 已完成 没有 关注-up.
- Formula:
已完成 / assigned × 100% - Target: >90%
Accuracy Rate
- 计数 tasks 必填 correction.
- Formula:
(tasks - corrections) / tasks × 100% - Target: >95%
Memory Retention
- Ask 关于 something discussed 7+ days ago.
- Pass 如果 recalled correctly, 失败 如果 missed.
- Target: >80%
Cost Tips
- Cheap: Filling 在...中 weekly 模板 — 任何 small 模型 works.
- Expensive: Trend analysis 和 pattern detection 穿过 multiple evals — 使用 medium 模型.
- Batch: Review 所有 weekly evals 在 once 期间 monthly 举报, 不 one 由 one.
- Avoid: Don't re-score historical weeks — score 在...中 real 时间 和 保存 到 file.
免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制