Ai Video Remix — Ai工具

Name: Ai Video Remix — Ai工具
Rating: 1 (1 reviews)
Author: Yoki

Yoki

Ai Video Remix — Ai工具

v0.1.3

[AI辅助] AI-driven video remix generator that uses ShotAI semantic search + LLM planning + Remotion rendering to produce styled video compositions from a user's local...

1· 156·0 当前·0 累计

by @abu-shotai (Yoki)·MIT-0

AI模型访问代码生成系统工具网络工具开发工具

下载技能包

License

MIT-0

最后更新

2026/4/4

安全扫描

VirusTotal

无害

查看报告

OpenClaw

安全

high confidence

The skill's requested binaries and environment variables line up with its stated purpose (local ShotAI + Remotion video remixing); the main caution is that it is instruction-only and tells you to clone and run third‑party code from GitHub, which you should review before executing.

评估建议

This skill appears to do what it says: a local ShotAI + Remotion video remix pipeline. Before installing/using it, review the GitHub repo code you will clone and the Node dependencies (npm install) because the skill instructs you to run that code locally. Keep SHOTAI_URL pointing to localhost and keep SHOTAI_TOKEN secret (it's used to query your local MCP server). If you don't want external network access, avoid using the optional yt-dlp music download and set AGENT_PROVIDER=none to use heuristi...

详细分析 ▾

✓ 用途与能力

Name/description, required binaries (ffmpeg, node), and required env vars (SHOTAI_URL, SHOTAI_TOKEN) are consistent with a local ShotAI + Remotion video pipeline. No unrelated credentials or binaries are requested.

ℹ 指令范围

SKILL.md instructs the agent/user to git clone the repository, run npm install and execute the CLI (npx tsx src/skill/cli.ts). That is expected for an instruction-only skill that relies on external runtime code, but it means the agent will be guided to run third‑party code on the host. The instructions otherwise stay within the stated domain (local ShotAI MCP, local HTTP server on 127.0.0.1, ffmpeg processing, optional yt-dlp for music).

ℹ 安装机制

No formal install spec in the registry (lowest platform install risk). However, runtime usage requires cloning a GitHub repository and running npm install (downloading Node packages). GitHub is a common source, but cloning and executing external code increases operational risk and should be reviewed by the user.

✓ 凭证需求

Declared env vars are minimal and appropriate: SHOTAI_URL and SHOTAI_TOKEN (primaryEnv). Additional LLM/API keys appear only in docs as optional (used if you configure AGENT_PROVIDER). There is no evidence the skill requests unrelated or excessive secrets. The docs state SHOTAI_TOKEN is sent only to the SHOTAI_URL (local MCP).

✓ 持久化与权限

Skill is not always-on and uses default autonomous invocation behavior; it does not request system-wide config paths or modify other skills. No elevated persistence or unusual privileges are declared.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv0.1.32026/3/19

ai-video-remix 0.1.2 - Converted this skill to an instruction-only/documentation format. - Added explicit notice that runtime source code must be cloned separately; no code is bundled. - Provided GitHub source and homepage links in metadata. - Updated Quick Start instructions to clarify cloning and setup process. - No code or functional changes included in this release.

● 无害

安装命令点击复制

官方npx clawhub@latest install ai-video-remix

镜像加速npx clawhub@latest install ai-video-remix --registry https://cn.clawhub-mirror.com

技能文档

This is an instruction-only skill — it provides guidance and reference documentation for the AI Video Remix CLI tool. The runtime source code lives in the GitHub repository and must be cloned separately (see Quick Start below).

Generate styled video compositions from a local ShotAI video library using natural language.

Important: Video 库 Requirement

This skill can only search and use videos that have been imported into ShotAI. Videos simply stored on your hard drive are not searchable — they must be added to a ShotAI collection and fully indexed first.

Before using this skill, make sure you have:

Opened ShotAI 和 created collection
Added video files 或 folders 到 collection
Waited 对于 indexing 到 complete (shot detection + semantic analysis — progress shown 在...中 ShotAI)

If the search returns no results or low-quality matches, the most common reason is that the relevant videos have not been imported into ShotAI yet.

Prerequisites

See references/setup.md for full installation instructions, including:

ShotAI 下载和 setup
ffmpeg installation
yt-dlp installation (对于 auto music)
节点.js dependencies

Quick 开始

Note: This skill does not bundle runtime code. Clone the source repository first.

git clone https://github.com/abu-ShotAI/ai-video-remix.git
cd ai-video-editor
npm install
cp .env.example .env    # fill in SHOTAI_URL, SHOTAI_TOKEN, and optionally AGENT_PROVIDER
npx tsx src/skill/cli.ts "帮我做一个旅行混剪"

Pipeline (8 steps)

Agent: parseIntent — LLM extracts 主题, selects composition, optionally overrides music 样式
Agent: refineQueries — LLM rewrites per-slot 搜索 terms 到 match 库 content
ShotAI: pickShots — Semantic 搜索 per slot 通过 local ShotAI MCP server (localhost 仅), best shot selected
Music: resolveMusic — Uses local MP3 通过 --bgm (recommended), 或 optionally downloads 从 YouTube 通过 yt-dlp
ffmpeg: extractClip — 每个 shot trimmed 到 independent .mp4 clip file (local 处理中仅)
Agent: annotateClips — LLM assigns per-clip visual effect params (tone, dramatic, kenBurns, caption)
File Server — Localhost-仅 HTTP server (127.0.0.1) serves clips 到 Remotion renderer 在...内相同 machine
Remotion: render — Composition rendered 到 final MP4

CLI Usage

After cloning the repository and running npm install:

npx tsx src/skill/cli.ts "" [options]
Options:
  --composition    Override composition (skip LLM selection)
  --bgm          Local MP3 path (skip YouTube search)
  --output        Output directory (default: ./output)
  --lang        Output language: zh Chinese (default) / en English
                       Affects: video title, per-clip captions & location labels, attribution line
  --probe              Scan library first, let LLM plan slots from actual content

Compositions

ID	Label	Best For
`CyberpunkCity`	赛博朋克夜景	Neon city, night scenes, sci-fi
`TravelVlog`	旅行 Vlog	Multi-city travel with location cards
`MoodDriven`	情绪驱动混剪	Fast/slow emotion cuts
`NatureWild`	自然野生动物	BBC nature documentary style
`SwitzerlandScenic`	瑞士风光	Alpine/scenic travel with captions
`SportsHighlight`	体育集锦	ESPN-style with goal captions

Modes

Standard mode (默认): LLM picks composition + generates 搜索 queries 从 registry templates.

Probe mode (--probe): Scans 库 videos 第一个 (names, shot samples, mood/scene tags), 然后 LLM generates custom slots tailored 到什么 actually exists.

Choose probe mode when: library content is unknown, user wants "best of my library", or standard slots return low-quality shots.

Environment Variables

See references/config.md for all environment variables and LLM provider setup.

Troubleshooting & Quality Tuning

See references/tuning.md for solutions to:

Clip boundary flicker / 1–2 frame flash 在 cuts
Red flash artifact 在...中 CyberpunkCity (GlitchFlicker 在...上 short clips)
Low-quality 或 off-topic shots
Music 下载 failures

Recommended .env defaults 对于 best quality:

MIN_SCORE=0.5    # filter short/low-quality shots

Writing ShotAI 搜索 Queries

ShotAI uses semantic search powered by AI-generated tags and embedding vectors. Query quality is the single biggest factor in shot relevance — invest time here.

查询 construction rules

Always 写入满 sentences 或 rich phrases, never bare keywords.

The search engine understands semantic similarity ("ocean" matches "sea", "waves", "shoreline"), so richer context produces better recall.

Quality	Example	When to use
⭐ Detailed description	`"A white seagull with spread wings gliding smoothly over calm blue ocean water, golden sunset light reflecting on the waves"`	Best precision — use for hero shots
⭐ Full sentence	`"A seagull flying gracefully over the ocean at sunset"`	Good balance of precision and recall
Short phrase	`"seagull flying over ocean"`	Acceptable fallback
Single keyword	`"seagull"`	Avoid — low precision, noisy results

什么到 include 在...中查询

Describe the visual content of the ideal shot across these dimensions:

Subject: 什么/谁在...中 frame ( lone hiker, city traffic 在 night, athlete celebrating)
Action: 什么 happening (walking slowly 通过 fog, speeding 通过 intersection, jumping 带有 arms raised)
Environment: location, 设置, 时间的 day (rain-soaked Tokyo street, mountain meadow 在 golden hour, 空 stadium 在...下 floodlights)
Mood / atmosphere: emotional tone (melancholic, tense, euphoric, serene)
Camera feel: implied movement 或 framing (wide establishing shot, tight 关闭-up, slow pan, handheld shaky)

Not all dimensions are needed every time — include whichever are most distinctive for the shot you want.

refineQueries step

When the agent runs refineQueries, it rewrites the composition's default slot queries to better match the user's actual library. Apply these principles:

开始从 slot's semantic intent — 什么 emotional 或 narrative 角色做 shot play 在...中 composition?
Incorporate 任何 context 从用户's 请求 — location names, 事件 names, specific subjects mentioned
Expand synonyms — 如果 slot says "water", try "river flowing 通过 forest" 或 "lake reflecting mountains" based 在...上什么库 likely contains
Avoid negations — "不 indoors" 做不 work; 代替 describe positive version ("outdoor daylight scene")
One 查询 per slot — 使 specific rather 比 trying 到 cover multiple scenarios

Examples: slot 查询 → refined 查询

Slot default: "city at night" User request: "帮我做一个东京旅行混剪" Refined: "Neon-lit Tokyo street at night, pedestrians crossing under glowing signs, rain reflections on pavement" Slot default: "nature landscape" User request: "trip to Patagonia last month" Refined: "Dramatic Patagonia mountain landscape, snow-capped peaks under stormy clouds, vast open wilderness"

Slot default: "athlete in action" User request: "basketball highlight from last game" Refined: "Basketball player driving to the hoop, explosive movement, crowd in background blurred"

Adding 新的 Composition

See references/composition-guide.md to add a new Remotion composition to the registry.

Safety 和 Fallback

Network & credential scope

所有 credentials stay local. SHOTAI_TOKEN sent 仅到 local ShotAI MCP server (127.0.0.1). LLM API keys (如果 configured) sent 仅到 respective provider endpoints — never 到 ShotAI, YouTube, 或任何其他服务.
clip file server binds 到 127.0.0.1 仅 (默认 port 8080). 不 accessible 从其他 machines 在...上 network. serves temporary clip files 到 Remotion renderer running 在...上相同 machine 和 shuts down 之后 rendering completes.
yt-dlp 可选. 使用 --bgm /path/到/local.mp3 到 skip 所有 YouTube network access. 当...时 yt-dlp used, 仅 downloads single background music track — 否其他 data sent 到 YouTube.
LLM access 可选. 设置 AGENT_PROVIDER=无 到 run 在...中 heuristic mode 带有 zero external network calls (aside 从 local ShotAI MCP server).

错误 handling

如果 SHOTAI_URL 或 SHOTAI_TOKEN unset, display warning: "ShotAI MCP server 不 configured. 设置 SHOTAI_URL 和 SHOTAI_TOKEN 在...中 .env file. 下载 ShotAI 在 https://www.shotai.io."
如果 ShotAI MCP server returns 错误 (连接 refused, HTTP 4xx/5xx), display 错误消息和停止 — 做不 fabricate shot results.
Never fabricate video file paths, shot timestamps, 或 similarity scores.
如果 music 下载 fails (yt-dlp 错误或 network unreachable), suggest 使用 --bgm 到 provide local audio file 代替.
如果 Remotion render fails, display 错误输出和 suggest checking 节点.js version (18+) 和所有 clip files 是 extracted successfully.
如果 LLM provider unreachable, fall back 到 heuristic mode: 使用 composition 默认 queries directly 没有 refinement, 和 skip annotateClips (使用 composition 默认 effect params).

License

MIT-0 — Free to use, modify, and redistribute. No attribution required. See https://spdx.org/licenses/MIT-0.html

This is an instruction-only skill — it provides guidance and reference documentation for the AI Video Remix CLI tool. The runtime source code lives in the GitHub repository and must be cloned separately (see Quick Start below).

Generate styled video compositions from a local ShotAI video library using natural language.

Important: Video Library Requirement

This skill can only search and use videos that have been imported into ShotAI. Videos simply stored on your hard drive are not searchable — they must be added to a ShotAI collection and fully indexed first.

Before using this skill, make sure you have:

Opened ShotAI and created a collection
Added your video files or folders to the collection
Waited for indexing to complete (shot detection + semantic analysis — progress is shown in ShotAI)

If the search returns no results or low-quality matches, the most common reason is that the relevant videos have not been imported into ShotAI yet.

Prerequisites

See references/setup.md for full installation instructions, including:

ShotAI download and setup
ffmpeg installation
yt-dlp installation (for auto music)
Node.js dependencies

Quick Start

Note: This skill does not bundle runtime code. Clone the source repository first.

git clone https://github.com/abu-ShotAI/ai-video-remix.git
cd ai-video-editor
npm install
cp .env.example .env    # fill in SHOTAI_URL, SHOTAI_TOKEN, and optionally AGENT_PROVIDER
npx tsx src/skill/cli.ts "帮我做一个旅行混剪"

Pipeline (8 steps)

Agent: parseIntent — LLM extracts theme, selects composition, optionally overrides music style
Agent: refineQueries — LLM rewrites per-slot search terms to match library content
ShotAI: pickShots — Semantic search per slot via local ShotAI MCP server (localhost only), best shot selected
Music: resolveMusic — Uses local MP3 via --bgm (recommended), or optionally downloads from YouTube via yt-dlp
ffmpeg: extractClip — Each shot trimmed to independent .mp4 clip file (local processing only)
Agent: annotateClips — LLM assigns per-clip visual effect params (tone, dramatic, kenBurns, caption)
File Server — Localhost-only HTTP server (127.0.0.1) serves clips to Remotion renderer within the same machine
Remotion: render — Composition rendered to final MP4

CLI Usage

After cloning the repository and running npm install:

npx tsx src/skill/cli.ts "" [options]
Options:
  --composition    Override composition (skip LLM selection)
  --bgm          Local MP3 path (skip YouTube search)
  --output        Output directory (default: ./output)
  --lang        Output language: zh Chinese (default) / en English
                       Affects: video title, per-clip captions & location labels, attribution line
  --probe              Scan library first, let LLM plan slots from actual content

Compositions

ID	Label	Best For
`CyberpunkCity`	赛博朋克夜景	Neon city, night scenes, sci-fi
`TravelVlog`	旅行 Vlog	Multi-city travel with location cards
`MoodDriven`	情绪驱动混剪	Fast/slow emotion cuts
`NatureWild`	自然野生动物	BBC nature documentary style
`SwitzerlandScenic`	瑞士风光	Alpine/scenic travel with captions
`SportsHighlight`	体育集锦	ESPN-style with goal captions

Modes

Standard mode (default): LLM picks composition + generates search queries from registry templates.

Probe mode (--probe): Scans library videos first (names, shot samples, mood/scene tags), then LLM generates custom slots tailored to what actually exists.

Choose probe mode when: library content is unknown, user wants "best of my library", or standard slots return low-quality shots.

Environment Variables

See references/config.md for all environment variables and LLM provider setup.

Troubleshooting & Quality Tuning

See references/tuning.md for solutions to:

Clip boundary flicker / 1–2 frame flash at cuts
Red flash artifact in CyberpunkCity (GlitchFlicker on short clips)
Low-quality or off-topic shots
Music download failures

Recommended .env defaults for best quality:

MIN_SCORE=0.5    # filter short/low-quality shots

Writing ShotAI Search Queries

ShotAI uses semantic search powered by AI-generated tags and embedding vectors. Query quality is the single biggest factor in shot relevance — invest time here.

Query construction rules

Always write full sentences or rich phrases, never bare keywords.

The search engine understands semantic similarity ("ocean" matches "sea", "waves", "shoreline"), so richer context produces better recall.

Quality	Example	When to use
⭐ Detailed description	`"A white seagull with spread wings gliding smoothly over calm blue ocean water, golden sunset light reflecting on the waves"`	Best precision — use for hero shots
⭐ Full sentence	`"A seagull flying gracefully over the ocean at sunset"`	Good balance of precision and recall
Short phrase	`"seagull flying over ocean"`	Acceptable fallback
Single keyword	`"seagull"`	Avoid — low precision, noisy results

What to include in a query

Describe the visual content of the ideal shot across these dimensions:

Subject: what/who is in frame (a lone hiker, city traffic at night, athlete celebrating)
Action: what is happening (walking slowly through fog, speeding through intersection, jumping with arms raised)
Environment: location, setting, time of day (rain-soaked Tokyo street, mountain meadow at golden hour, empty stadium under floodlights)
Mood / atmosphere: emotional tone (melancholic, tense, euphoric, serene)
Camera feel: implied movement or framing (wide establishing shot, tight close-up, slow pan, handheld shaky)

Not all dimensions are needed every time — include whichever are most distinctive for the shot you want.

The refineQueries step

When the agent runs refineQueries, it rewrites the composition's default slot queries to better match the user's actual library. Apply these principles:

Start from the slot's semantic intent — what emotional or narrative role does this shot play in the composition?
Incorporate any context from the user's request — location names, event names, specific subjects mentioned
Expand synonyms — if the slot says "water", try "river flowing through forest" or "lake reflecting mountains" based on what the library likely contains
Avoid negations — "not indoors" does not work; instead describe the positive version ("outdoor daylight scene")
One query per slot — make it specific rather than trying to cover multiple scenarios

Examples: slot query → refined query

Slot default: "city at night" User request: "帮我做一个东京旅行混剪" Refined: "Neon-lit Tokyo street at night, pedestrians crossing under glowing signs, rain reflections on pavement" Slot default: "nature landscape" User request: "trip to Patagonia last month" Refined: "Dramatic Patagonia mountain landscape, snow-capped peaks under stormy clouds, vast open wilderness"

Slot default: "athlete in action" User request: "basketball highlight from last game" Refined: "Basketball player driving to the hoop, explosive movement, crowd in background blurred"

Adding a New Composition

See references/composition-guide.md to add a new Remotion composition to the registry.

Safety and Fallback

Network & credential scope

All credentials stay local. SHOTAI_TOKEN is sent only to the local ShotAI MCP server (127.0.0.1). LLM API keys (if configured) are sent only to their respective provider endpoints — never to ShotAI, YouTube, or any other service.
The clip file server binds to 127.0.0.1 only (default port 8080). It is not accessible from other machines on the network. It serves temporary clip files to the Remotion renderer running on the same machine and shuts down after rendering completes.
yt-dlp is optional. Use --bgm /path/to/local.mp3 to skip all YouTube network access. When yt-dlp is used, it only downloads a single background music track — no other data is sent to YouTube.
LLM access is optional. Set AGENT_PROVIDER=none to run in heuristic mode with zero external network calls (aside from the local ShotAI MCP server).

Error handling

If SHOTAI_URL or SHOTAI_TOKEN is unset, display a warning: "ShotAI MCP server is not configured. Set SHOTAI_URL and SHOTAI_TOKEN in your .env file. Download ShotAI at https://www.shotai.io."
If the ShotAI MCP server returns an error (connection refused, HTTP 4xx/5xx), display the error message and stop — do not fabricate shot results.
Never fabricate video file paths, shot timestamps, or similarity scores.
If music download fails (yt-dlp error or network unreachable), suggest using --bgm to provide a local audio file instead.
If Remotion render fails, display the error output and suggest checking Node.js version (18+) and that all clip files were extracted successfully.
If the LLM provider is unreachable, fall back to heuristic mode: use composition default queries directly without refinement, and skip annotateClips (use composition default effect params).

License

MIT-0 — Free to use, modify, and redistribute. No attribution required. See https://spdx.org/licenses/MIT-0.html

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

Important: Video 库 Requirement

Prerequisites

Quick 开始

Pipeline (8 steps)

CLI Usage

Compositions

Modes

Environment Variables

Troubleshooting & Quality Tuning

Writing ShotAI 搜索 Queries

查询 construction rules

什么 到 include 在...中 查询

refineQueries step

Examples: slot 查询 → refined 查询

Adding 新的 Composition

Safety 和 Fallback

Network & credential scope

错误 handling

License

Important: Video Library Requirement

Prerequisites

Quick Start

Pipeline (8 steps)

CLI Usage

Compositions

Modes

Environment Variables

Troubleshooting & Quality Tuning

Writing ShotAI Search Queries

Query construction rules

What to include in a query

The refineQueries step

Examples: slot query → refined query

Adding a New Composition

Safety and Fallback

Network & credential scope

Error handling

License

安装命令点击复制

什么到 include 在...中查询