audio-processing
تایید شدهAudio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
(0)
۱.۰k
۴۵
۵۵
نصب مهارت
مهارتها کدهای شخص ثالث از مخازن عمومی GitHub هستند. SkillHub الگوهای مخرب شناختهشده را اسکن میکند اما نمیتواند امنیت را تضمین کند. قبل از نصب، کد منبع را بررسی کنید.
نصب سراسری (سطح کاربر):
npx skillhub install openclaw/skills/audio-processingنصب در پروژه فعلی:
npx skillhub install openclaw/skills/audio-processing --projectمسیر پیشنهادی: ~/.claude/skills/audio-processing/
بررسی هوش مصنوعی
کیفیت دستورالعمل60
دقت توضیحات40
کاربردی بودن72
صحت فنی65
Scored 59 — solid Python script with 5 audio actions and strong security validation, but SKILL.md instruction structure is flat (parameter list, not workflow steps) and description lacks trigger phrases entirely.
محتوای SKILL.md
---
name: audio-processing
description: Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
metadata:
{
"openclaw":
{
"emoji": "🎙️",
"requires": {
"bins": ["ffmpeg", "python3"],
"pip": ["openai-whisper", "gTTS", "librosa", "pydub", "soundfile", "numpy", "webrtcvad-wheels"]
},
"install":
[
{
"id": "ffmpeg",
"kind": "brew",
"package": "ffmpeg",
"label": "Install ffmpeg",
},
{
"id": "python-deps",
"kind": "pip",
"package": "openai-whisper gTTS librosa pydub soundfile numpy webrtcvad-wheels",
"label": "Install Python dependencies",
}
],
},
}
---
# Audio Processing Skill
A comprehensive toolset for audio manipulation and analysis.
## Tool API
### audio_tool
Perform audio operations like transcription, text-to-speech, and feature extraction.
- **Parameters:**
- `action` (string, required): One of `transcribe`, `tts`, `extract_features`, `vad_segments`, `transform`.
- `file_path` (string, optional): Path to input audio file.
- `text` (string, optional): Text for TTS.
- `output_path` (string, optional): Path for output file (default: auto-generated).
- `model` (string, optional): Whisper model size (tiny, base, small, medium, large). Default: `base`.
**Usage:**
```bash
# Transcribe
uv run --with "openai-whisper" --with "pydub" --with "numpy" skills/audio-processing/tool.py transcribe --file_path input.wav
# TTS
uv run --with "gTTS" skills/audio-processing/tool.py tts --text "Hello world" --output_path hello.mp3
# Features
uv run --with "librosa" --with "numpy" --with "soundfile" skills/audio-processing/tool.py extract_features --file_path input.wav
```