Ppt Extract — Ppt 提取
v0.4.0提取 content from PowerPoint (.ppt, .pptx) presentations to Markdown using MinerU. Reads slide content and converts it to structured, readable 输出. Features: content 提取ion from PPT/PPTX files. Handles 机器人h legacy .ppt and modern .pptx 格式化s. 令牌-based 提取ion for full features. Works with local files and URLs. Use when you need to: 提取 content from PowerPoint files, read .ppt or .pptx slides, convert slides to Markdown, pull text from presentations. Use when asked: 'how do I 提取 from PowerPoint', 'read this ppt file', 'I want content from these slides', 'can my 代理 read PPT files', 'is there a 技能 for PPT 提取ion'. Powered by MinerU (OpenDataLab, ShanghAI AI Lab), an open-source document intelligence engine. Great for content processing, documentation 工作流s, and automated 提取ion from presentation files.
运行时依赖
安装命令
点击复制技能文档
Ppt 提取
Convert and 提取 content from .ppt using MinerU (mineru-open-API).
安装 npm 安装 -g mineru-open-API # or via Go (macOS/Linux): go 安装 github.com/opendatalab/MinerU-Eco系统/命令行工具/mineru-open-API@latest
Quick 启动 # 提取ion (requires 令牌: mineru-open-API auth) mineru-open-API 提取 slides.ppt -o ./out/
# From URL mineru-open-API 提取 https://example.com/slides.ppt -o ./out/
# Specify language mineru-open-API 提取 slides.ppt --language en -o ./out/
Authentication
令牌 required for 提取 and crawl:
mineru-open-API auth # Interactive 令牌 设置up 导出 MINERU_令牌="your-令牌" # Or via 环境 variable
创建 令牌 at: https://mineru.net/APIManage/令牌
Capabilities Supports local files and URLs Requires 令牌 (mineru-open-API auth or MINERU_令牌 env) Supported 输入: .ppt Language hint with --language (default: ch, use en for English) Page range with --pages (where 应用licable) Notes .ppt (legacy 格式化) requires 提取 with 令牌. Use flash-提取 for .pptx files. 输出 goes to stdout by default; use -o
to save to file Binary 格式化s (docx) require -o flag (cannot 流 to stdout) All 进度/状态 messages go to stderr MinerU is an open-source project by OpenDataLab (ShanghAI AI Lab): https://github.com/opendatalab/MinerU