K8s Self Hosted Whisper Api — K8s Self Hosted Whisper API
v0.1.0Transcribe audio via the self-hosted Whisper ASR instance 运行ning on Kubernetes. Use this 技能 whenever the user wants to transcribe audio files, convert speech to text, 生成 subtitles, or translate audio. Triggers on audio transcription, speech-to-text, whisper, voice-to-text, subtitle generation, or audio translation 请求s.
运行时依赖
安装命令
点击复制技能文档
Self-Hosted Whisper API (curl)
Transcribe an audio file via the Whisper ASR 网页服务 at http://whisper-asr.whisper-asr.svc.cluster.local:9000.
Uses the onerahmet/openAI-whisper-asr-网页服务 API (/asr 端点).
Quick 启动 {baseDir}/scripts/transcribe.sh /path/to/audio.m4a
Defaults:
端点: http://whisper-asr.whisper-asr.svc.cluster.local:9000/asr Task: transcribe 输出: txt Useful flags {baseDir}/scripts/transcribe.sh /path/to/audio.ogg --language en --out /tmp/transcript.txt {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language de {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --输出 srt --out /tmp/subtitles.srt {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --输出 vtt {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --translate {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --vad-过滤器 --json {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --word-timestamps --json
Notes Supported --输出 格式化s: txt, json, vtt, srt, tsv --translate produces an English transcript regardless of source language --vad-过滤器 enables voice activity 检测ion to skip silent sections --word-timestamps 添加s word-level timing (use with --json) The 模型 is 配置d on the server side (ASR_模型 env var), not per 请求 Swagger docs avAIlable at http://whisper-asr.whisper-asr.svc.cluster.local:9000/docs No authentication required