MiniMax Vision Captcha — 技能工具

Name: MiniMax Vision Captcha — 技能工具
Author: Nurmamat

Nurmamat

MiniMax Vision Captcha — 技能工具

v1.0.0

使用MiniMax视觉模型识别图片中的验证码、滑块位置、文字内容等。适用于需要AI视觉分析的场景，如微信验证码识别、网页截图分析、图片文字提取。当需要识别图片内容、分析验证码、提取截图信息时使用此技能。

0· 1,003·1 当前·2 累计

by @ai-nurmamat (Nurmamat)·MIT-0

下载技能包

License

MIT-0

最后更新

2026/4/12

安全扫描

VirusTotal

可疑

查看报告

OpenClaw

可疑

medium confidence

Skill functionality aligns with its description, but there are security and transparency concerns (unsanitized shell invocation, implicit credentials/config expectations, and automatic local file access) that the user should review before installing.

评估建议

This skill appears to do what it claims (solve captchas using a MiniMax visual service) but you should be cautious before installing: 1) The included script runs a shell command via execSync with prompt and image path interpolated verbatim — this is vulnerable to command injection if those values include malicious characters. Prefer a version that calls mcporter with an argument array (spawn/execFile) or properly escapes inputs. 2) The script will read the latest PNG from /root/.openclaw/media/b...

详细分析 ▾

✓ 用途与能力

Name/description, SKILL.md, and the included script all consistently implement an image/captcha recognition helper that calls a MiniMax visual MCP via mcporter and uses OpenClaw browser screenshots. The files and commands requested are coherent with the stated purpose.

⚠ 指令范围

Runtime instructions tell the agent to take screenshots and call mcporter.minimax-coding-plan.understand_image — that matches purpose. However the included script will automatically look in /root/.openclaw/media/browser for screenshots if none provided (accessing local user/root files), and it constructs a shell command by interpolating user-supplied prompt and imagePath directly into a single string passed to execSync. That creates a command-injection risk and means the skill can read and submit arbitrary local images to the MCP.

✓ 安装机制

No install spec or remote downloads — the skill is instruction-only with a small local Node script. Nothing is fetched from arbitrary URLs during install, which lowers supply-chain risk.

ℹ 凭证需求

The skill declares no environment variables or credentials, but SKILL.md says 'ensure MiniMax MCP is configured' and the script uses mcporter. Credentials/config required to call the MCP are not documented in requires.env or marketplace.json — an omission that reduces transparency. No unrelated credentials are requested.

✓ 持久化与权限

Skill is user-invocable, not always:true, and does not request elevated or persistent platform privileges nor modify other skills' configurations.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.02026/3/2

首次发布

● 可疑

安装命令点击复制

官方npx clawhub@latest install minimax-vision-captcha

镜像加速npx clawhub@latest install minimax-vision-captcha --registry https://cn.clawhub-mirror.com

技能文档

使用MiniMax MCP的视觉理解能力识别图片中的内容，解决验证码、滑块分析等问题。

快速开始

1. 截图

使用OpenClaw浏览器截图：

browser action=screenshot targetId=<页面ID>

或使用OpenClaw的snapshot获取页面结构后分析。

2. 调用MiniMax视觉识别

mcporter call minimax-coding-plan.understand_image prompt="描述图片内容" image_source="/path/to/screenshot.png"

3. 分析结果

根据返回结果进行下一步操作。

典型使用场景

场景1：微信滑块验证码

访问微信页面，触发验证码
截图：browser action=screenshot
发送给视觉模型分析
获取滑块位置描述

场景2：图片文字识别

截图或获取图片路径
调用视觉模型识别文字
返回文字内容

场景3：网页元素分析

使用snapshot获取页面结构
分析特定元素的可见内容和属性

注意事项

确保MiniMax MCP已配置
图片路径需要是服务器可访问的绝对路径
滑块验证码需要描述缺口位置（左侧/右侧/距离）

依赖

minimax-coding-plan MCP
OpenClaw浏览器工具

使用MiniMax MCP的视觉理解能力识别图片中的内容，解决验证码、滑块分析等问题。

快速开始

1. 截图

使用OpenClaw浏览器截图：

browser action=screenshot targetId=<页面ID>

或使用OpenClaw的snapshot获取页面结构后分析。

2. 调用MiniMax视觉识别

mcporter call minimax-coding-plan.understand_image prompt="描述图片内容" image_source="/path/to/screenshot.png"

3. 分析结果

根据返回结果进行下一步操作。

典型使用场景

场景1：微信滑块验证码

访问微信页面，触发验证码
截图：browser action=screenshot
发送给视觉模型分析
获取滑块位置描述

场景2：图片文字识别

截图或获取图片路径
调用视觉模型识别文字
返回文字内容

场景3：网页元素分析

使用snapshot获取页面结构
分析特定元素的可见内容和属性

注意事项

确保MiniMax MCP已配置
图片路径需要是服务器可访问的绝对路径
滑块验证码需要描述缺口位置（左侧/右侧/距离）

依赖

minimax-coding-plan MCP
OpenClaw浏览器工具

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

快速开始

1. 截图

2. 调用MiniMax视觉识别

3. 分析结果

典型使用场景

场景1：微信滑块验证码

场景2：图片文字识别

场景3：网页元素分析

注意事项

依赖

快速开始

1. 截图

2. 调用MiniMax视觉识别

3. 分析结果

典型使用场景

场景1：微信滑块验证码

场景2：图片文字识别

场景3：网页元素分析

注意事项

依赖

安装命令点击复制