首页龙虾技能列表 › HTML Markdown

📄 HTML Markdown

v0.4.0

Convert HTML to Markdown using MinerU's document processing engine. Transforms HTML pages and files into clean, readable Markdown format. Features: HTML to M...

0· 124·0 当前·0 累计
by @mzlzyca (mzlzyCA)·MIT-0
下载技能包
License
MIT-0
最后更新
2026/4/3
安全扫描
VirusTotal
无害
查看报告
OpenClaw
安全
high confidence
The skill is internally consistent: it wraps the mineru-open-api CLI to convert HTML to Markdown and only requests the MinerU token and the mineru-open-api binary that such a tool would reasonably need.
评估建议
This skill appears coherent: it calls the MinerU CLI and needs only MINERU_TOKEN. Before installing, verify the npm package and GitHub repo (https://github.com/opendatalab/MinerU-Ecosystem) to ensure you trust the upstream code. Treat MINERU_TOKEN like any secret: do not reuse high-privilege tokens, restrict its scope if possible, and run the CLI in a controlled environment when processing untrusted HTML. If you cannot or will not install third-party binaries, do not grant the skill access to an...
详细分析 ▾
用途与能力
Name/description match the requested artifacts: the skill requires the mineru-open-api binary and a MINERU_TOKEN, both of which are expected for a CLI that calls MinerU's service.
指令范围
SKILL.md instructs the agent to run mineru-open-api extract/crawl and to set MINERU_TOKEN. It does not request other environment variables, read unrelated system files, or send data to unexpected endpoints beyond MinerU's service.
安装机制
Installers are npm (mineru-open-api) and a go install from a GitHub repo — common and traceable mechanisms. As with any third-party package, installing a global npm package or go binary executes upstream code; review the npm package and GitHub repo before installing.
凭证需求
Only MINERU_TOKEN is required and declared as the primary credential. That is proportionate for a hosted document-processing API. No unrelated credentials or config paths are requested.
持久化与权限
always is false and the skill does not request system-wide configuration changes. It is an instruction-only skill that invokes an external CLI at runtime.
安全有层次,运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发,无需署名。

运行时依赖

无特殊依赖

版本

latestv0.4.02026/3/27

SEO: expand description for better ClawHub vector search discovery

● 无害

安装命令 点击复制

官方npx clawhub@latest install html-markdown
镜像加速npx clawhub@latest install html-markdown --registry https://cn.clawhub-mirror.com

技能文档

Convert HTML files to clean Markdown using MinerU. For local HTML files use extract; for live web pages use crawl.

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Convert local HTML file to Markdown (requires token)
mineru-open-api extract page.html -o ./out/

# Convert remote HTML file to Markdown (requires token) mineru-open-api extract https://example.com/doc.html -o ./out/

# Convert live web page to Markdown via crawl (requires token) mineru-open-api crawl https://example.com/article -o ./out/

Authentication

Token required:

mineru-open-api auth             # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supported input: local .html file or HTTP/HTTPS URL
  • Local .html file: use extract (token required)
  • Live web page URL: use crawl (token required)
  • HTML is NOT supported by flash-extract
  • Language hint with --language (default: ch, use en for English)

Notes

  • HTML input always requires token (no flash-extract support)
  • For live web pages with JavaScript rendering, prefer crawl
  • Output goes to stdout by default; use -o to save to a file or directory
  • All progress/status messages go to stderr; document content goes to stdout
  • MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
数据来源:ClawHub ↗ · 中文优化:龙虾技能库
OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险,如需更匹配、更安全的方案,建议联系付费定制

了解定制服务