
document processor mcp
一个强大的文档处理工具,支持多种格式并集成Claude和Unstructured API。
Repository Info
About This Server
一个强大的文档处理工具,支持多种格式并集成Claude和Unstructured API。
Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.
Documentation
Document Processor MCP
A powerful document processing MCP (Model Context Protocol) that integrates with Claude to extract and process text from various document formats. Built with the Unstructured API for accurate document parsing and structure preservation.
🚀 Features
- Multi-format Support: Process PDFs, DOCX, PPTX, TXT, and image files.
- Intelligent Extraction: Preserve document structure and formatting.
- LLM Integration: Output formatted for Large Language Models.
- Claude Desktop Integration: Seamless processing within Claude.
🛠️ Quick Start
1️⃣ Prerequisites
- Node.js (v14+ recommended)
- npm (Node Package Manager)
- Unstructured API Key (Get it from here)
- Claude Desktop (Installed locally)
2️⃣ Installation
git clone https://github.com/yashshah1910/document-processor-mcp.git
cd document-processor-mcp
npm install
npm run build
3️⃣ Configuration
- Create a
.envfile in the root directory and add:
UNSTRUCTURED_API_KEY=your_api_key_here
- Set up Claude Desktop config:
{
"mcpServers": {
"documentProcessorMcp": {
"command": "node",
"args": ["YOUR/ABSOLUTE/PATH/document-processor-mcp/dist/index.js"],
"disabled": false
}
}
}
- Restart Claude Desktop after making changes.
🔧 Usage
- Start Claude Desktop.
- Provide the file path to your document in Claude.
- The MCP will automatically:
- Read the file from the provided path.
- Extract text content.
- Maintain document structure.
- Clean and preprocess data.
- Return LLM-ready output.
📁 Project Structure
document-processor-mcp/
├── dist/ # Compiled files
├── src/ # Source files
├── node_modules/ # Dependencies
├── .env # Environment variables
└── package.json # Project dependencies
Feel free to open pull requests to improve the project! 💙
Quick Start
Clone the repository
git clone https://github.com/yashshah1910/document-processor-mcpInstall dependencies
cd document-processor-mcp
npm installFollow the documentation
Check the repository's README.md file for specific installation and usage instructions.
Repository Details
Recommended MCP Servers
Discord MCP
Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.
Knit MCP
Connect AI agents to 200+ SaaS applications and automate workflows.
Apify MCP Server
Deploy and interact with Apify actors for web scraping and data extraction.
BrowserStack MCP
BrowserStack MCP Server for automated testing across multiple browsers.
Zapier MCP
A Zapier server that provides automation capabilities for various apps.