Speech & TranscriptionDocumentedScanned

llmwhisperer

Extract text and layout from images and PDFs using LLMWhisperer API.

Share:

Installation

npx clawhub@latest install llmwhisperer

View the full skill documentation and source below.

Documentation

LLMWhisperer

Extract text from images and PDFs using the [LLMWhisperer API]() — great for handwriting and complex forms.

Configuration

Requires LLMWHISPERER_API_KEY in ~/.clawdbot/.env:

echo "LLMWHISPERER_API_KEY=your_key_here" >> ~/.clawdbot/.env

Get an API Key

Get a free API key at [unstract.com/llmwhisperer]().
  • Free Tier: 100 pages/day

Usage

llmwhisperer <file>

Script Source

The executable script is located at scripts/llmwhisperer.

#!/bin/bash
# Extract text using LLMWhisperer API

if [ -z "$LLMWHISPERER_API_KEY" ]; then
  if [ -f ~/.clawdbot/.env ]; then
    # shellcheck disable=SC2046
    export $(grep -v '^#' ~/.clawdbot/.env | grep 'LLMWHISPERER_API_KEY' | xargs)
  fi
fi

if [ -z "$LLMWHISPERER_API_KEY" ]; then
  echo "Error: LLMWHISPERER_API_KEY not found in env or ~/.clawdbot/.env"
  exit 1
fi

FILE="$1"
if [ -z "$FILE" ]; then
  echo "Usage: $0 <file>"
  exit 1
fi

curl -s -X POST "" \
  -H "Content-Type: application/octet-stream" \
  -H "unstract-key: $LLMWHISPERER_API_KEY" \
  --data-binary "@$FILE"

Examples

Print text to terminal:

llmwhisperer flyer.jpg

Save output to a text file:

llmwhisperer invoice.pdf > invoice.txt

Process a handwritten note:

llmwhisperer notes.jpg