Image Gen — 技能工具

Name: Image Gen — 技能工具
Author: zweien

zweien

Image Gen — 技能工具

v0.1.0

Nano-banana 图像生成与编辑 Skill。支持文生图和图生图，根据文字描述或参考图使用 AI 生成/编辑图像，自动压缩大图，保存到 workspace/img 目录，并直接在飞书中展示。触发场景：用户要求生成图片、画图、AI 绘图、创建图像、修改图片、基于图片生成、图生图。

0· 1,000·4 当前·4 累计

by @zweien·MIT-0

代码生成 AI模型访问自动化开发工具

下载技能包

License

MIT-0

最后更新

2026/4/8

安全扫描

VirusTotal

无害

查看报告

OpenClaw

可疑

medium confidence

The skill largely does what it claims (generate/edit images) but has multiple inconsistencies and missing declarations—most importantly it requires an API key and external endpoint (not declared in metadata) and depends on CLI/tools/Python packages that aren't documented; you should verify the external service and dependencies before use.

评估建议

Before installing or running this skill: - Be aware it will send prompts and any reference images to a third-party endpoint (https://api.imyaigc.top). Only provide images/prompts you are comfortable sharing with that service. - The registry metadata omits required env vars; you must create a .env containing IMAGE_API_BASE_URL and IMAGE_API_KEY. Treat that API key like a secret. - Verify and trust the external API host before providing credentials. Consider contacting the skill author or using a ...

详细分析 ▾

⚠ 用途与能力

The skill's stated purpose (text→image and image→image) matches the included scripts and compressor. However the registry metadata lists no required environment variables or credentials while the SKILL.md and scripts clearly require IMAGE_API_BASE_URL and IMAGE_API_KEY (and optional IMAGE_MODEL, IMAGE_SIZE, IMAGE_ASPECT_RATIO). This mismatch is incoherent and should have been declared as required credentials.

ℹ 指令范围

Runtime instructions convert user-provided reference images to base64 and send them (and prompts) to an external API (https://api.imyaigc.top). They save originals under ~/.openclaw/workspace/img and compress >1MB images. That behavior is expected for this skill but has privacy implications (user images and prompts are transmitted to a third party). Also SKILL.md says create .env in the workspace, while image-gen.sh sources .env from the script directory—this path mismatch may cause confusion.

⚠ 安装机制

There is no install spec (instruction-only), but code files are included. The scripts depend on external tools/libraries (curl, jq, file, base64, Python with Pillow) but the skill does not declare these dependencies or provide install steps. Lack of dependency declaration increases risk of runtime errors and hidden behaviors when you try to run it.

⚠ 凭证需求

The skill requires a single API key and base URL to call an external image-generation service—this is proportionate to its function. However the metadata does not declare these required env vars or mark IMAGE_API_KEY as the primary credential. The external base URL (api.imyaigc.top) is not a well-known provider; sending images and prompts there may be a privacy/data-exfiltration risk if you don't trust the service.

✓ 持久化与权限

The skill does not request always:true or system-wide privileges, and does not attempt to modify other skills or system configuration. It only writes image files to ~/.openclaw/workspace/img/ (as described).

安装前注意事项

and Python Pillow (PIL) but don’t declare or install them—install/verify these dependencies in a sandbox first. - Note the .env path discrepancy: SKILL.md suggests a workspace .env, while image-gen.sh sources .env from the skill directory—confirm which file the runtime will actually load to avoid leaking a key to the wrong place. - If you cannot verify the endpoint or the author, run the skill in an isolated environment or prefer an alternative skill that declares its credentials and dependencies explicitly.

安全有层次，运行前请审查代码。

License

MIT-0

可自由使用、修改和再分发，无需署名。

查看条款 ↗

运行时依赖

无特殊依赖

版本

latestv0.1.02026/3/11

image-gen-skill v0.1.0 changelog - Initial release of the Nano-banana 图像生成与编辑 Skill. - Supports text-to-image and image-to-image generation/editing via Gemini-3.1-Flash API. - Automatically compresses images larger than 1MB before sending, preserving both original and compressed versions in workspace/img. - Offers flexible aspect ratio and size settings; saves images with timestamp-based filenames. - Includes clear usage instructions and sample API calls for both modes. - Provides built-in compression utility and detailed directory/configuration guidance.

● 无害

安装命令点击复制

官方npx clawhub@latest install image-gen-skill

镜像加速npx clawhub@latest install image-gen-skill --registry https://cn.clawhub-mirror.com

技能文档

使用 Nano-banana-3.1-Flash API 进行图像生成与编辑，支持：

文生图 (text-to-image): 根据文字描述生成图像
图生图 (image-to-image): 基于参考图生成新图像

自动压缩超过 1MB 的图片，保存到 workspace，并直接在飞书中展示。

配置

在workspace目录下创建 .env 文件：

# .env
IMAGE_API_BASE_URL=https://api.imyaigc.top
IMAGE_API_KEY=your-api-key-here
IMAGE_MODEL=gemini-3.1-flash-image-preview
IMAGE_SIZE=2K
IMAGE_ASPECT_RATIO=1:1

图片保存位置

所有生成的图片保存在：

~/.openclaw/workspace/img/

文件名格式：image_YYYYMMDD_HHMMSS.png

压缩规则：

原始图片保存为：image_YYYYMMDD_HHMMSS_original.png
如果原图 > 1MB，自动压缩并保存为：image_YYYYMMDD_HHMMSS.png
发送到时使用压缩后的版本

使用方法

1. 文生图 (Text-to-Image)

直接告诉我要生成什么图片：

示例：

"生成一张夕阳下的海滩图片"
"画一只可爱的猫咪"
"创建一个赛博朋克风格的大龙虾，16:9 比例"

2. 图生图 (Image-to-Image)

提供参考图和修改描述：

示例：

"基于这张图，改成水彩风格"
"把这张图改成夜景"
"参考这张图，生成一个类似风格的建筑"
"把这张照片变成油画风格"

工作流程：

用户提供参考图（发送图片）
下载图片并转为 base64
调用 API 时传入 image 参数
返回基于参考图生成的新图像

工作流程

文生图流程

解析参数

- 提取提示词 - 识别宽高比参数（如 "16:9"、"1:1"）

调用 API

   curl -X POST "${IMAGE_API_BASE_URL}/v1/images/generations" \
       -H "Authorization: Bearer ${IMAGE_API_KEY}" \
       -H "Content-Type: application/json" \
       -d '{
           "model": "'"${IMAGE_MODEL}"'",
           "prompt": "描述内容",
           "response_format": "url",
           "aspect_ratio": "16:9"
       }'

图生图流程

获取参考图

- 用户发送图片 - 下载图片并转为 base64

调用 API

   curl -X POST "${IMAGE_API_BASE_URL}/v1/images/generations" \
       -H "Authorization: Bearer ${IMAGE_API_KEY}" \
       -H "Content-Type: application/json" \
       -d '{
           "model": "'"${IMAGE_MODEL}"'",
           "prompt": "修改描述（如：改为水彩风格）",
           "response_format": "url",
           "aspect_ratio": "16:9",
           "image": ["data:image/png;base64,iVBORw0KGgo..."]
       }'

获取图片 URL

- 从响应中提取 data[0].url

下载并保存原始图片

   OUTPUT_DIR=~/.openclaw/workspace/img
   TIMESTAMP=$(date +%Y%m%d_%H%M%S)
   ORIGINAL_FILE="${OUTPUT_DIR}/image_${TIMESTAMP}_original.png"
   FINAL_FILE="${OUTPUT_DIR}/image_${TIMESTAMP}.png"   curl -o "$ORIGINAL_FILE" "$IMAGE_URL"

检查大小并压缩（如需要）

   FILE_SIZE=$(stat -f%z "$ORIGINAL_FILE" 2>/dev/null || stat -c%s "$ORIGINAL_FILE")
   MAX_SIZE=$((1024 * 1024))  # 1MB   if [ "$FILE_SIZE" -gt "$MAX_SIZE" ]; then
       echo "图片超过 1MB，正在压缩..."
       python3 compress_image.py "$ORIGINAL_FILE" "$FINAL_FILE" 1
   else
       cp "$ORIGINAL_FILE" "$FINAL_FILE"
   fi

发送到飞书

如果在飞书对话中，注意以下发送方式

   message(
     action="send",
     channel="feishu",
     accountId="second",
     target="用户或群组ID",
     media="~/.openclaw/workspace/img/image_xxx.png"
   )

压缩工具

使用 compress_image.py 进行图片压缩：

# 用法 python3 compress_image.py [max_size_mb]

# 示例 python3 compress_image.py original.png compressed.png 1

压缩策略：

如果图片 ≤ 1MB，不压缩
如果图片 > 1MB：

- 逐步降低 JPEG 质量 (85 → 10) - 如果仍超过限制，缩小尺寸 (90% → 30%)

保留原始文件，压缩结果另存

支持的参数

宽高比 (aspect_ratio)

1:1 - 正方形（默认）
4:3, 3:4 - 标准比例
16:9, 9:16 - 宽屏/竖屏
2:3, 3:2 - 照片比例
4:5, 5:4 - 接近正方形
21:9 - 超宽屏
1:4, 4:1 - 长条形
8:1, 1:8 - 超长条形

图像尺寸 (image_size)

1K - 适合快速预览
2K - 推荐默认（默认）
4K - 高清大图
512px - 小图

参考图 (image)

类型：字符串数组
格式：URL 或 base64 编码（推荐 data:image/png;base64,... 格式）
支持：单张或多张参考图

API 信息

Endpoint: POST /v1/images/generations
认证: Authorization: Bearer {{YOUR_API_KEY}}
模型: ${IMAGE_MODEL} (默认: gemini-3.1-flash-image-preview)
Base URL: https://api.imyaigc.top

完整请求参数

参数	类型	必填	说明
model	string	✅	模型名称
prompt	string	✅	生成描述
response_format	string	❌	`url` 或 `b64_json`
aspect_ratio	string	❌	宽高比
image	array	❌	参考图数组（URL 或 base64）
image_size	string	❌	图像尺寸（仅 nano-banana-2 支持）

目录结构

~/.openclaw/workspace/
├── img/                          # 生成的图片保存目录
│   ├── image_20260302_221900_original.png  # 原始图片
│   ├── image_20260302_221900.png           # 压缩后（用于发送）
│   └── ...
└── skills/
    └── image-gen/
        ├── SKILL.md              # 本说明文件
        ├── image-gen.sh          # 生成脚本
        ├── compress_image.py     # 压缩工具
        ├── .env                  # API 配置
        └── .env.example          # 配置示例

注意事项

图片必须保存在 ~/.openclaw/workspace/img/ 目录
超过 1MB 的图片会自动压缩后再发送到飞书
原始图片保留，方便后续使用
文件名使用时间戳避免冲突
提示词会自动翻译为英文以获得更好的生成效果
API 调用可能需要 30-60 秒，请告知用户等待
图生图时：参考图会转为 base64 传入 API 的 image 参数

使用 Nano-banana-3.1-Flash API 进行图像生成与编辑，支持：

文生图 (text-to-image): 根据文字描述生成图像
图生图 (image-to-image): 基于参考图生成新图像

自动压缩超过 1MB 的图片，保存到 workspace，并直接在飞书中展示。

配置

在workspace目录下创建 .env 文件：

# .env
IMAGE_API_BASE_URL=https://api.imyaigc.top
IMAGE_API_KEY=your-api-key-here
IMAGE_MODEL=gemini-3.1-flash-image-preview
IMAGE_SIZE=2K
IMAGE_ASPECT_RATIO=1:1

图片保存位置

所有生成的图片保存在：

~/.openclaw/workspace/img/

文件名格式：image_YYYYMMDD_HHMMSS.png

压缩规则：

原始图片保存为：image_YYYYMMDD_HHMMSS_original.png
如果原图 > 1MB，自动压缩并保存为：image_YYYYMMDD_HHMMSS.png
发送到时使用压缩后的版本

使用方法

1. 文生图 (Text-to-Image)

直接告诉我要生成什么图片：

示例：

"生成一张夕阳下的海滩图片"
"画一只可爱的猫咪"
"创建一个赛博朋克风格的大龙虾，16:9 比例"

2. 图生图 (Image-to-Image)

提供参考图和修改描述：

示例：

"基于这张图，改成水彩风格"
"把这张图改成夜景"
"参考这张图，生成一个类似风格的建筑"
"把这张照片变成油画风格"

工作流程：

用户提供参考图（发送图片）
下载图片并转为 base64
调用 API 时传入 image 参数
返回基于参考图生成的新图像

工作流程

文生图流程

解析参数

- 提取提示词 - 识别宽高比参数（如 "16:9"、"1:1"）

调用 API

   curl -X POST "${IMAGE_API_BASE_URL}/v1/images/generations" \
       -H "Authorization: Bearer ${IMAGE_API_KEY}" \
       -H "Content-Type: application/json" \
       -d '{
           "model": "'"${IMAGE_MODEL}"'",
           "prompt": "描述内容",
           "response_format": "url",
           "aspect_ratio": "16:9"
       }'

图生图流程

获取参考图

- 用户发送图片 - 下载图片并转为 base64

调用 API

   curl -X POST "${IMAGE_API_BASE_URL}/v1/images/generations" \
       -H "Authorization: Bearer ${IMAGE_API_KEY}" \
       -H "Content-Type: application/json" \
       -d '{
           "model": "'"${IMAGE_MODEL}"'",
           "prompt": "修改描述（如：改为水彩风格）",
           "response_format": "url",
           "aspect_ratio": "16:9",
           "image": ["data:image/png;base64,iVBORw0KGgo..."]
       }'

获取图片 URL

- 从响应中提取 data[0].url

下载并保存原始图片

   OUTPUT_DIR=~/.openclaw/workspace/img
   TIMESTAMP=$(date +%Y%m%d_%H%M%S)
   ORIGINAL_FILE="${OUTPUT_DIR}/image_${TIMESTAMP}_original.png"
   FINAL_FILE="${OUTPUT_DIR}/image_${TIMESTAMP}.png"   curl -o "$ORIGINAL_FILE" "$IMAGE_URL"

检查大小并压缩（如需要）

   FILE_SIZE=$(stat -f%z "$ORIGINAL_FILE" 2>/dev/null || stat -c%s "$ORIGINAL_FILE")
   MAX_SIZE=$((1024 * 1024))  # 1MB   if [ "$FILE_SIZE" -gt "$MAX_SIZE" ]; then
       echo "图片超过 1MB，正在压缩..."
       python3 compress_image.py "$ORIGINAL_FILE" "$FINAL_FILE" 1
   else
       cp "$ORIGINAL_FILE" "$FINAL_FILE"
   fi

发送到飞书

如果在飞书对话中，注意以下发送方式

   message(
     action="send",
     channel="feishu",
     accountId="second",
     target="用户或群组ID",
     media="~/.openclaw/workspace/img/image_xxx.png"
   )

压缩工具

使用 compress_image.py 进行图片压缩：

# 用法 python3 compress_image.py [max_size_mb]

# 示例 python3 compress_image.py original.png compressed.png 1

压缩策略：

如果图片 ≤ 1MB，不压缩
如果图片 > 1MB：

- 逐步降低 JPEG 质量 (85 → 10) - 如果仍超过限制，缩小尺寸 (90% → 30%)

保留原始文件，压缩结果另存

支持的参数

宽高比 (aspect_ratio)

1:1 - 正方形（默认）
4:3, 3:4 - 标准比例
16:9, 9:16 - 宽屏/竖屏
2:3, 3:2 - 照片比例
4:5, 5:4 - 接近正方形
21:9 - 超宽屏
1:4, 4:1 - 长条形
8:1, 1:8 - 超长条形

图像尺寸 (image_size)

1K - 适合快速预览
2K - 推荐默认（默认）
4K - 高清大图
512px - 小图

参考图 (image)

类型：字符串数组
格式：URL 或 base64 编码（推荐 data:image/png;base64,... 格式）
支持：单张或多张参考图

API 信息

Endpoint: POST /v1/images/generations
认证: Authorization: Bearer {{YOUR_API_KEY}}
模型: ${IMAGE_MODEL} (默认: gemini-3.1-flash-image-preview)
Base URL: https://api.imyaigc.top

完整请求参数

参数	类型	必填	说明
model	string	✅	模型名称
prompt	string	✅	生成描述
response_format	string	❌	`url` 或 `b64_json`
aspect_ratio	string	❌	宽高比
image	array	❌	参考图数组（URL 或 base64）
image_size	string	❌	图像尺寸（仅 nano-banana-2 支持）

目录结构

~/.openclaw/workspace/
├── img/                          # 生成的图片保存目录
│   ├── image_20260302_221900_original.png  # 原始图片
│   ├── image_20260302_221900.png           # 压缩后（用于发送）
│   └── ...
└── skills/
    └── image-gen/
        ├── SKILL.md              # 本说明文件
        ├── image-gen.sh          # 生成脚本
        ├── compress_image.py     # 压缩工具
        ├── .env                  # API 配置
        └── .env.example          # 配置示例

注意事项

图片必须保存在 ~/.openclaw/workspace/img/ 目录
超过 1MB 的图片会自动压缩后再发送到飞书
原始图片保留，方便后续使用
文件名使用时间戳避免冲突
提示词会自动翻译为英文以获得更好的生成效果
API 调用可能需要 30-60 秒，请告知用户等待
图生图时：参考图会转为 base64 传入 API 的 image 参数

数据来源：ClawHub ↗ · 中文优化：龙虾技能库

OpenClaw 技能定制 / 插件定制 / 私有工作流定制

免费技能或插件可能存在安全风险，如需更匹配、更安全的方案，建议联系付费定制

了解定制服务

License

运行时依赖

版本

安装命令 点击复制

技能文档

配置

图片保存位置

使用方法

1. 文生图 (Text-to-Image)

2. 图生图 (Image-to-Image)

工作流程

文生图流程

图生图流程

压缩工具

支持的参数

宽高比 (aspect_ratio)

图像尺寸 (image_size)

参考图 (image)

API 信息

完整请求参数

目录结构

注意事项

配置

图片保存位置

使用方法

1. 文生图 (Text-to-Image)

2. 图生图 (Image-to-Image)

工作流程

文生图流程

图生图流程

压缩工具

支持的参数

宽高比 (aspect_ratio)

图像尺寸 (image_size)

参考图 (image)

API 信息

完整请求参数

目录结构

注意事项

安装命令点击复制