yashshah1910
MCP Serveryashshah1910public

document processor mcp

一个强大的文档处理工具,支持多种格式并集成Claude和Unstructured API。

Repository Info

0
Stars
0
Forks
0
Watchers
0
Issues
TypeScript
Language
-
License

About This Server

一个强大的文档处理工具,支持多种格式并集成Claude和Unstructured API。

Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.

Documentation

Document Processor MCP

A powerful document processing MCP (Model Context Protocol) that integrates with Claude to extract and process text from various document formats. Built with the Unstructured API for accurate document parsing and structure preservation.

🚀 Features

  • Multi-format Support: Process PDFs, DOCX, PPTX, TXT, and image files.
  • Intelligent Extraction: Preserve document structure and formatting.
  • LLM Integration: Output formatted for Large Language Models.
  • Claude Desktop Integration: Seamless processing within Claude.

🛠️ Quick Start

1️⃣ Prerequisites

  • Node.js (v14+ recommended)
  • npm (Node Package Manager)
  • Unstructured API Key (Get it from here)
  • Claude Desktop (Installed locally)

2️⃣ Installation

git clone https://github.com/yashshah1910/document-processor-mcp.git
cd document-processor-mcp
npm install
npm run build

3️⃣ Configuration

  • Create a .env file in the root directory and add:
UNSTRUCTURED_API_KEY=your_api_key_here
  • Set up Claude Desktop config:
{
  "mcpServers": {
    "documentProcessorMcp": {
      "command": "node",
      "args": ["YOUR/ABSOLUTE/PATH/document-processor-mcp/dist/index.js"],
      "disabled": false
    }
  }
}
  • Restart Claude Desktop after making changes.

🔧 Usage

  1. Start Claude Desktop.
  2. Provide the file path to your document in Claude.
  3. The MCP will automatically:
    • Read the file from the provided path.
    • Extract text content.
    • Maintain document structure.
    • Clean and preprocess data.
    • Return LLM-ready output.

📁 Project Structure

document-processor-mcp/
├── dist/           # Compiled files
├── src/            # Source files
├── node_modules/   # Dependencies
├── .env            # Environment variables
└── package.json    # Project dependencies

Feel free to open pull requests to improve the project! 💙

Quick Start

1

Clone the repository

git clone https://github.com/yashshah1910/document-processor-mcp
2

Install dependencies

cd document-processor-mcp
npm install
3

Follow the documentation

Check the repository's README.md file for specific installation and usage instructions.

Repository Details

Owneryashshah1910
Repodocument-processor-mcp
LanguageTypeScript
License-
Last fetched8/10/2025

Recommended MCP Servers

💬

Discord MCP

Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.

integrationsdiscordchat
🔗

Knit MCP

Connect AI agents to 200+ SaaS applications and automate workflows.

integrationsautomationsaas
🕷️

Apify MCP Server

Deploy and interact with Apify actors for web scraping and data extraction.

apifycrawlerdata
🌐

BrowserStack MCP

BrowserStack MCP Server for automated testing across multiple browsers.

testingqabrowsers

Zapier MCP

A Zapier server that provides automation capabilities for various apps.

zapierautomation