
mcp screenshot server
A lightweight MCP-compatible Python server for capturing Windows screenshots via REST API. Supports full screen, region-based, or window-specific captures. Ideal for AI agent integrations and automation workflows.
Repository Info
About This Server
A lightweight MCP-compatible Python server for capturing Windows screenshots via REST API. Supports full screen, region-based, or window-specific captures. Ideal for AI agent integrations and automation workflows.
Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.
Documentation
🖼️ MCP Screenshot Server
A lightweight, MCP-compatible screenshot microservice built with FastAPI for Windows.
It allows AI agents and automation tools to capture full-screen, region-based, or window-specific screenshots via simple HTTP calls.
!MCP Compatible !License: MIT !Python
🔧 Features
- 📸 Capture full-screen screenshots
- 🪟 Capture specific window by title
- 🔲 Capture custom regions
[x, y, width, height] - 🧠 MCP-compliant REST API
- 🖼️ Returns images as PNG or base64
- 🚀 Built with FastAPI, ready for production or LLM use
🧠 Use Cases
- Integrating with LLMs using Model Context Protocol (MCP)
- QA test automation pipelines
- Monitoring and remote capture tools
- Visual logging/debugging tools for agents
📦 Installation
git clone https://github.com/yourusername/mcp-screenshot-server.git
cd mcp-screenshot-server
python -m venv .venv
source .venv/bin/activate # or .venv\Scripts\activate on Windows
pip install -r requirements.txt
uvicorn main:app --reload
🔌 API Usage
POST /screenshot
Request JSON body:
{
"region": [0, 0, 1280, 720], // optional
"window_title": "Untitled - Notepad", // optional
"base64": true // optional (default: false)
}
Response (base64 mode):
{
"status": "ok",
"mode": "region",
"image_format": "base64",
"image": "<base64-encoded-image>"
}
🛠️ Tech Stack
- Python 3.11+
- FastAPI
mssorpyautoguifor screenshotpillowfor image processingpygetwindowfor window matching (optional)
📄 License
MIT License.
Feel free to use, fork, and integrate — commercial or personal.
See LICENSE for details.
📬 Contributing
Pull requests and issues welcome!
Open a PR to add features or improve compatibility across platforms (e.g., Mac/Linux support).
🙋 FAQ
-
Does it work on Linux/macOS?
Not yet. This version is Windows-focused, but you’re welcome to extend it. -
Is it MCP-certified?
This project aims to follow the MCP spec as closely as possible for maximum compatibility with LLM agents.
🧠 Inspired By
- Anthropic’s Model Context Protocol
- Real-world automation use cases powered by LLMs and Python
Would you like me to tailor a specific section to emphasize AI agent use (e.g., “how to use with o1 or GPT-4o via plugin”)?
Quick Start
Clone the repository
git clone https://github.com/margusmartsepp/mcp-screenshot-serverInstall dependencies
cd mcp-screenshot-server
npm installFollow the documentation
Check the repository's README.md file for specific installation and usage instructions.
Repository Details
Recommended MCP Servers
Discord MCP
Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.
Knit MCP
Connect AI agents to 200+ SaaS applications and automate workflows.
Apify MCP Server
Deploy and interact with Apify actors for web scraping and data extraction.
BrowserStack MCP
BrowserStack MCP Server for automated testing across multiple browsers.
Zapier MCP
A Zapier server that provides automation capabilities for various apps.