AI & LLMsDocumentedScanned
gemini-computer-use
Build and run Gemini 2.5 Computer Use browser-control agents with Playwright.
Share:
Installation
npx clawhub@latest install gemini-computer-useView the full skill documentation and source below.
Documentation
Gemini Computer Use
Quick start
cp env.example env.sh
$EDITOR env.sh
source env.sh
python -m venv .venv
source .venv/bin/activate
pip install google-genai playwright
playwright install chromium
python scripts/computer_use_agent.py \
--prompt "Find the latest blog post title on example.com" \
--start-url "" \
--turn-limit 6
Browser selection
- Default: Playwright's bundled Chromium (no env vars required).
- Choose a channel (Chrome/Edge) with
COMPUTER_USE_BROWSER_CHANNEL. - Use a custom Chromium-based executable (e.g., Brave) with
COMPUTER_USE_BROWSER_EXECUTABLE.
COMPUTER_USE_BROWSER_EXECUTABLE takes precedence.
Core workflow (agent loop)
function_call actions in the response.safety_decision is require_confirmation, prompt the user before executing.function_response objects containing the latest URL + screenshot.Operational guidance
- Run in a sandboxed browser profile or container.
- Use
--excludeto block risky actions you do not want the model to take. - Keep the viewport at 1440x900 unless you have a reason to change it.
Resources
- Script:
scripts/computer_use_agent.py - Reference notes:
references/google-computer-use.md - Env template:
env.example