🎬 Ai Image To Video Openart — 技能工具

v1.0.0

Turn a single illustrated character portrait or digital artwork into 1080p animated video clips just by typing what you need. Whether it's converting static...

0· 8·0 当前·0 累计

by @roca-677·MIT-0

下载技能包

License

MIT-0

最后更新

2026/4/16

安全扫描

VirusTotal

无害

查看报告

OpenClaw

可疑

medium confidence

The skill's runtime instructions and requested credential (NEMO_TOKEN) match its claimed purpose (upload images to a cloud rendering API), but there are a few metadata inconsistencies and provenance gaps you should review before trusting it with private images or long-lived credentials.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv1.0.02026/4/16

AI Image to Video OpenArt v1.0.0 - Initial release: Instantly convert static illustrated portraits and digital artworks into animated 1080p video clips. - Upload a single image, describe your desired animation, and receive videos in under a minute—no timeline editing or export setup required. - Automated cloud rendering pipeline processes each job and returns high-quality MP4 downloads. - Supports batch processing, timeline previews, and a wide range of input/output media formats. - Dynamic session handling and anonymous token generation streamline new user onboarding. - Simple commands for checking credits, session state, or exporting your completed videos.

● 无害

安装命令

点击复制

官方npx clawhub@latest install ai-image-to-video-openart

🇨🇳 镜像加速npx clawhub@latest install ai-image-to-video-openart --registry https://cn.longxiaskill.com

技能文档

Getting Started

Send me your static images and I'll handle the AI video creation. Or just describe what you're after.

Try saying:

"convert a single illustrated character portrait or digital artwork into a 1080p MP4"
"animate this image into a 5-second video with smooth motion"
"converting static AI-generated images into short animated videos for AI artists and digital creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: . The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

# AI Image to Video OpenArt — Convert Images into Animated Videos

This tool takes your static images and runs AI video creation through a cloud rendering pipeline. You upload, describe what you want, and download the result.

Say you have a single illustrated character portrait or digital artwork and want to animate this image into a 5-second video with smooth motion — the backend processes it in about 30-60 seconds and hands you a 1080p MP4.

Tip: high-contrast images with clear subjects animate more smoothly than busy backgrounds.

Matching Input to Actions

User prompts referencing ai image to video openart, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All calls go to https://mega-api-prod.nemovideo.ai. The main endpoints:

Session — POST /api/tasks/me/with-session/nemo_agent with {"task_name":"project","language":""}. Gives you a session_id.
Chat (SSE) — POST /run_sse with session_id and your message in new_message.parts[0].text. Set Accept: text/event-stream. Up to 15 min.
Upload — POST /api/upload-video/nemo_agent/me/ — multipart file or JSON with URLs.
Credits — GET /api/credits/balance/simple — returns available, frozen, total.
State — GET /api/state/nemo_agent/me//latest — current draft and media info.
Export — POST /api/render/proxy/lambda with render ID and draft JSON. Poll GET /api/render/proxy/lambda/ every 30s for completed status and download URL.

Formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Headers are derived from this file's YAML frontmatter. X-Skill-Source is ai-image-to-video-openart, X-Skill-Version comes from the version field, and X-Skill-Platform is detected from the install path (~/.clawhub/ = clawhub, ~/.cursor/skills/ = cursor, otherwise unknown).

All requests must include: Authorization: Bearer , X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Translating GUI Instructions

The backend responds as if there's a visual interface. Map its instructions to API calls:

"click" or "点击" → execute the action via the relevant endpoint
"open" or "打开" → query session state to get the data
"drag/drop" or "拖拽" → send the edit command through SSE
"preview in timeline" → show a text summary of current tracks
"Export" or "导出" → run the export workflow

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Error Codes

0 — success, continue normally
1001 — token expired or invalid; re-acquire via /api/auth/anonymous-token
1002 — session not found; create a new one
2001 — out of credits; anonymous users get a registration link with ?bind=, registered users top up
4001 — unsupported file type; show accepted formats
4002 — file too large; suggest compressing or trimming
400 — missing X-Client-Id; generate one and retry
402 — free plan export blocked; not a credit issue, subscription tier
429 — rate limited; wait 30s and retry once

Common Workflows

Quick edit: Upload → "animate this image into a 5-second video with smooth motion" → Download MP4. Takes 30-60 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "animate this image into a 5-second video with smooth motion" — concrete instructions get better results.

Max file size is 200MB. Stick to JPG, PNG, WEBP, GIF for the smoothest experience.

Export as MP4 for widest compatibility across social platforms and editing tools.

数据来源：ClawHub ↗ · 中文优化：龙虾技能库