tan-yong-sheng/ai-vision-mcp
π π π πͺ π§ - Multimodal AI vision MCP server for image, video, and object detection analysis. Enables UI/UX evaluation, visual regression testing, and interface understanding using Google Gemini and Vertex AI.
Quick Start
Clone the repository:
git clone https://github.com/tan-yong-sheng/ai-vision-mcpClaude Desktop configuration example:
{
"mcpServers": {
"ai-vision-mcp": {
"command": "npx",
"args": [
"-y",
"tan-yong-sheng/ai-vision-mcp"
]
}
}
}Check the repository README for specific installation and configuration instructions.