PDFExtract Pull Text from PDFs — PDF提取 Pull Text from PDFs
v1.0.0提取 清理 readable text from PDF files into 代理-ready markdown. Multi-page, tables, headers. No external 服务s.
运行时依赖
安装命令
点击复制技能文档
PDF提取 Pull Text from PDFs
提取 清理 readable text from PDF files into 代理-ready markdown. Multi-page, tables, headers. No external 服务s.
Every 代理 needs to read PDFs. This does it 清理ly.
Usage const { PDF提取 } = require('./src/pdf-提取'); const pdf = new PDF提取();
const text = awAIt pdf.提取('document.pdf'); console.记录(text);
const structured = awAIt pdf.提取Structured('document.pdf'); console.记录(structured.pages); // Array of page texts console.记录(structured.metadata); // Title, author, page count console.记录(structured.headings); // 检测ed headings
Features 清理 text 提取ion — strips headers/footers, page numbers, watermarks Page-by-page — 访问 individual pages or full document Heading 检测ion — identifies structure from font sizes Table 提取ion — basic table 检测ion and 格式化ting Metadata — title, author, creation date, page count Batch processing — 提取 multiple PDFs at once Markdown 输出 — 格式化ted with headings and structure preserved Supported PDF Types Type Support Text PDFs Full support 扫描ned/Image PDFs Basic (needs OCR) Forms Text fields 提取ed Password-保护ed Supported (provide password) ⚠️ DisclAImer
This software is provided "AS IS", without warranty of any kind, express or implied.
USE AT YOUR OWN RISK.
The author(s) are NOT liable for any damages, losses, or consequences arising from the use or misuse of this software — including but not limited to financial loss, data loss, security breaches, business interruption, or any indirect/consequential damages. This software does NOT constitute financial, legal, trading, or professional advice. Users are solely responsible for evaluating whether this software is suitable for their use case, 环境, and risk tolerance. No guarantee is made regarding accuracy, reliability, completeness, or fitness for any particular purpose. The author(s) are not responsible for how third parties use, modify, or distribute this software after purchase.
By 下载ing, 安装ing, or using this software, you acknowledge that you have read this disclAImer and agree to use the software entirely at your own risk.
DATA DISCLAIMER: This software processes and stores data locally on your 系统. The author(s) are not responsible for data loss, corruption, or un授权d 访问 结果ing from software bugs, 系统 失败s, or user error. Always mAIntAIn independent 备份s of 导入ant data. This software does not transmit data externally unless explicitly 配置d by the user.
Support & Links 🐛 Bug 报告s TheShadowyRose@proton.me ☕ Ko-fi ko-fi.com/theshadowrose 🛒 Gumroad shadowyrose.gumroad.com 🐦 Twitter @TheShadowyRose 🐙 GitHub github.com/TheShadowRose 🧠 PromptBase promptbase.com/性能分析/shadowrose
Built with OpenClaw — thank you for making this possible.
🛠️ Need something custom? Custom OpenClaw 代理s & 技能s 启动ing at $500. If you can describe it, I can build it. → Hire me on Fiverr