Speech & TranscriptionDocumentedScanned

kokoro-tts

Generate spoken audio from text using the local Kokoro TTS engine.

Installation

npx clawhub@latest install kokoro-tts

View the full skill documentation and source below.

Documentation

Kokoro TTS

This skill allows you to generate high-quality AI speech using a local or remote Kokoro-TTS instance.

Configuration

The skill uses the KOKORO_API_URL environment variable to locate the API.

Default: - **To Configure:** AddKOKORO_API_URL= to your .env file or environment.

Usage

To generate speech, run the included Node.js script.

Command

node skills/kokoro-tts/scripts/tts.js "<text>" [voice] [speed]

text: The text to speak. Wrap in quotes.
voice: (Optional) The voice ID. Defaults to af_heart.
speed: (Optional) Speech speed (0.25 to 4.0). Defaults to 1.0.

Example

node skills/kokoro-tts/scripts/tts.js "Hello Ed, this is Theosaurus speaking." af_nova

Output

The script will output a single line starting with MEDIA: followed by the path to the generated MP3 file. OpenClaw will automatically pick this up and send it as an audio attachment.

Example Output:
MEDIA: media/tts_1706745000000.mp3

Available Voices

Common choices:

af_heart (Default, Female, Warm)

af_nova (Female, Professional)

am_adam (Male, Deep)

bf_alice (British Female)

For a full list, see references/voices.md or query the API.

Back to Skills Directory

Speech & TranscriptionDocumentedScanned

kokoro-tts

Generate spoken audio from text using the local Kokoro TTS engine.

Installation

npx clawhub@latest install kokoro-tts

View the full skill documentation and source below.

Documentation

Kokoro TTS

This skill allows you to generate high-quality AI speech using a local or remote Kokoro-TTS instance.

Configuration

The skill uses the KOKORO_API_URL environment variable to locate the API.

Default: - **To Configure:** AddKOKORO_API_URL= to your .env file or environment.

Usage

To generate speech, run the included Node.js script.

Command

node skills/kokoro-tts/scripts/tts.js "<text>" [voice] [speed]

text: The text to speak. Wrap in quotes.
voice: (Optional) The voice ID. Defaults to af_heart.
speed: (Optional) Speech speed (0.25 to 4.0). Defaults to 1.0.

Example

node skills/kokoro-tts/scripts/tts.js "Hello Ed, this is Theosaurus speaking." af_nova

Output

The script will output a single line starting with MEDIA: followed by the path to the generated MP3 file. OpenClaw will automatically pick this up and send it as an audio attachment.

Example Output:
MEDIA: media/tts_1706745000000.mp3

Available Voices

Common choices:

af_heart (Default, Female, Warm)

af_nova (Female, Professional)

am_adam (Male, Deep)

bf_alice (British Female)

For a full list, see references/voices.md or query the API.

Back to Skills Directory

kokoro-tts

Installation

Documentation

Kokoro TTS

Configuration

Usage

Command

Example

Output

Available Voices

Related Skills in Speech & Transcription

addis-assistant-stt

assemblyai-transcribe

audio-gen

audio-reply

critical-article-writer

kokoro-tts

Installation

Documentation

Kokoro TTS

Configuration

Usage

Command

Example

Output

Available Voices

Related Skills in Speech & Transcription

addis-assistant-stt

assemblyai-transcribe

audio-gen

audio-reply

critical-article-writer