
v
为 VLM 提供图像处理工具,支持裁剪、OCR、绘图等功能。
Repository Info
About This Server
为 VLM 提供图像处理工具,支持裁剪、OCR、绘图等功能。
Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.
Documentation
We are writing an image interface for vlm, aiding it in visual question answering tasks.
Specifically. We offer in the form of MCP servers the following visual tools:
cropping. [ocr related tools (rotating, warping an image)]. drawing and making marks on image. blacking out the selected area. generating a new cropped image spawn a subagent.
We will later optimize the interaction with movable cursor. But for this first implementation, all interactions are coordinate based.
Quick Start
Clone the repository
git clone https://github.com/jkpjkpjkp/vInstall dependencies
cd v
npm installFollow the documentation
Check the repository's README.md file for specific installation and usage instructions.
Repository Details
Recommended MCP Servers
Discord MCP
Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.
Knit MCP
Connect AI agents to 200+ SaaS applications and automate workflows.
Apify MCP Server
Deploy and interact with Apify actors for web scraping and data extraction.
BrowserStack MCP
BrowserStack MCP Server for automated testing across multiple browsers.
Zapier MCP
A Zapier server that provides automation capabilities for various apps.