CommunicationDocumentedScanned
walkie-talkie
WhatsApp voice conversations: transcribe audio, respond with TTS.
Share:
Installation
npx clawhub@latest install walkie-talkieView the full skill documentation and source below.
Documentation
Walkie-Talkie Mode
This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.
Workflow
- Use
tools/transcribe_voice.sh to get the text.- Process the text as a normal user prompt.
- Instead of a text reply, generate speech using
bin/sherpa-onnx-tts.- Send the resulting
.ogg file back to the user as a voice note.
Triggers
- User sends an audio message.
- User says "activa modo walkie-talkie" or "hablemos por voz".
Constraints
- Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
- Maintain a fast response time (RTF < 0.5).
- Always reply with BOTH text (for clarity) and audio.
Manual Execution (Internal)
To respond with voice manually:
bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"Then send
/tmp/reply.ogg via message tool with filePath.