Speech & TranscriptionDocumentedScanned

transcribe

Transcribe audio files to text using local Whisper (Docker).

Share:

Installation

npx clawhub@latest install transcribe

View the full skill documentation and source below.

Documentation

Transcribe

Local audio transcription using faster-whisper in Docker.

Installation

cd /path/to/skills/transcribe/scripts
chmod +x install.sh
./install.sh

This builds the Docker image whisper:local and installs the transcribe CLI.

Usage

transcribe /path/to/audio.mp3 [language]
  • Default language: es (Spanish)
  • Use auto for auto-detection
  • Outputs plain text to stdout

Examples

transcribe /tmp/voice.ogg          # Spanish (default)
transcribe /tmp/meeting.mp3 en     # English
transcribe /tmp/audio.m4a auto     # Auto-detect

Supported Formats

mp3, m4a, ogg, wav, webm, flac, aac

When Receiving Voice Messages

  • Save the audio attachment to a temp file

  • Run transcribe

  • Include the transcription in your response

  • Clean up the temp file
  • Files

    • scripts/transcribe - CLI wrapper (bash)
    • scripts/install.sh - Installation script (includes Dockerfile inline)

    Notes

    • Model: small (fast) - edit install.sh for large-v3 (accurate)
    • Fully local, no API key needed