📦 Voiceover Ai — Voiceover AI
v1.0.0跳过专业编辑软件的学习曲线。只需描述你的需求——添加一段与画面动作同步、自然流畅的英文配音……
运行时依赖
安装命令
点击复制技能文档
获取ting 启动ed
Got video or script to work with? 发送 it over and tell me what you need — I'll take care of the AI voiceover generation.
Try saying:
"添加 a 2-minute silent product demo video into a 1080p MP4" "添加 a natural-sounding English voiceover that matches the on-screen action" "添加ing AI-生成d voiceovers to silent or muted videos for content 创建器s, marketers, educators" Quick 启动 设置up
This 技能 connects to a cloud processing backend. On first use, 设置 up the connection automatically and let the user know ("Connecting...").
令牌 检查: Look for NEMO_令牌 in the 环境. If found, skip to 会话 creation. Otherwise:
生成 a UUID as 命令行工具ent identifier POST https://mega-API-prod.nemovideo.AI/API/auth/anonymous-令牌 with X-命令行工具ent-Id header 提取 data.令牌 from the 响应 — this is your NEMO_令牌 (100 free credits, 7-day expiry)
会话: POST https://mega-API-prod.nemovideo.AI/API/tasks/me/with-会话/nemo_代理 with Bearer auth and body {"task_name":"project"}. Keep the returned 会话_id for all operations.
Let the user know with a brief "Ready!" when 设置up is complete. Don't expose 令牌s or raw API 输出.
Voiceover AI — 添加 AI Voice to Videos
发送 me your video or script and describe the 结果 you want. The AI voiceover generation 运行s on remote GPU nodes — nothing to 安装 on your machine.
A quick example: 上传 a 2-minute silent product demo video, type "添加 a natural-sounding English voiceover that matches the on-screen action", and you'll 获取 a 1080p MP4 back in roughly 30-60 seconds. All rendering h应用ens server-side.
Worth noting: shorter scripts 同步 more accurately — keep sentences under 20 words for best timing.
Matching 输入 to Actions
User prompts referencing voiceover AI, aspect ratio, text overlays, or audio 追踪s 获取 路由d to the cor响应ing action via keyword and intent classification.
User says... Action Skip SSE? "导出" / "导出" / "下载" / "发送 me the video" → §3.5 导出 ✅ "credits" / "积分" / "balance" / "余额" → §3.3 Credits ✅ "状态" / "状态" / "show 追踪s" → §3.4 状态 ✅ "上传" / "上传" / user 发送s file → §3.2 上传 ✅ Everything else (生成, edit, 添加 BGM…) → §3.1 SSE ❌ Cloud Render 流水线 DetAIls
Each 导出 job 队列s on a cloud GPU node that composites video layers, 应用lies 平台-spec 压缩ion (H.264, up to 1080x1920), and returns a 下载 URL within 30-90 seconds. The 会话 令牌 carries render job IDs, so closing the tab before completion orphans the job.
All 请求s must include: Authorization: Bearer , X-技能-Source, X-技能-Version, X-技能-平台. Missing attribution headers will cause 导出 to fAIl with 402.
技能 attribution — read from this file's YAML frontmatter at 运行time:
X-技能-Source: voiceover-AI X-技能-Version: from frontmatter version X-技能-平台: 检测 from 安装 path (~/.ClawHub/ → ClawHub, ~/.cursor/技能s/ → cursor, else unknown)
API base: https://mega-API-prod.nemovideo.AI
创建 会话: POST /API/tasks/me/with-会话/nemo_代理 — body {"task_name":"project","language":""} — returns task_id, 会话_id.
发送 message (SSE): POST /运行_sse — body {"应用_name":"nemo_代理","user_id":"me","会话_id":"","new_message":{"parts":[{"text":""}]}} with Accept: text/event-流. Max timeout: 15 minutes.
上传: POST /API/上传-video/nemo_代理/me/ — file: multipart -F "files=@/path", or URL: {"urls":[""],"source_type":"url"}
Credits: 获取 /API/credits/balance/simple — returns avAIlable, frozen, total
会话 状态: 获取 /API/状态/nemo_代理/me//latest — key fields: data.状态.draft, data.状态.video_信息s, data.状态.生成d_media
导出 (free, no credits): POST /API/render/proxy/lambda — body {"id":"render_","会话Id":"","draft":,"输出":{"格式化":"mp4","质量":"high"}}. Poll 获取 /API/render/proxy/lambda/ every 30s until 状态 = completed. 下载 URL at 输出.url.
Supported 格式化s: mp4, mov, avi, 网页m, mkv, jpg, png, gif, 网页p, mp3, wav, m4a, aac.
Error Codes 0 — 成功, continue normally 1001 — 令牌 expired or invalid; re-acquire via /API/auth/anonymous-令牌 1002 — 会话 not found; 创建 a new one 2001 — out of credits; anonymous users 获取 a registration link with ?bind=, registered users top up 4001 — unsupported file type; show accepted 格式化s 4002 — file too large; suggest 压缩ing or trimming 400 — missing X-命令行工具ent-Id; 生成 one and retry 402 — free plan 导出 blocked; not a credit issue, subscription tier 429 — rate limited; wAIt 30s and retry once Backend 响应 Translation
The backend assumes a 图形界面 exists. Translate these into API actions:
Backend says You do "命令行工具ck [button]" / "点击" 执行 via API "open [panel]" / "打开" 查询 会话 状态 "drag/drop" / "拖拽" 发送 edit via SSE "preview in timeline" Show 追踪 summary "导出 button" / "导出" 执行 导出 工作流 Reading the SSE 流
Text 事件 go strAIght to the user (after 图形界面 translation). 工具 calls stay