MCP Serverreaandrewpublic

ee ai rag mcp demo

ee-ai-rag-mcp-demo

Repository Info

Stars

Forks

Watchers

Issues

Python

Language

License

View on GitHubGitHub Download DocumentationDocs

About This Server

ee-ai-rag-mcp-demo

Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.

Documentation

ee-ai-rag-mcp-demo

Quality Gate Status Coverage Bugs Code Smells Duplicated Lines (%) Reliability Rating

A repository with conventional commit enforcement, automatic versioning, and SonarQube code quality checks. This project implements a RAG (Retrieval Augmented Generation) pattern using AWS Services including Lambda, OpenSearch, and Amazon Bedrock.

Quality Gates with SonarQube

This repository uses SonarQube for quality checks:

PR Quality Gate: Runs SonarQube analysis on all PRs to the main branch
Main Branch Quality Gate: Runs analysis after merges to main
Release Protection: Version tags are only created if SonarQube checks pass

Setup Requirements

Create a SonarCloud account at https://sonarcloud.io/
Create or join a SonarCloud organization named "reaandrew"
Create a new project with key "reaandrew_ee-ai-rag-mcp-demo" in SonarCloud
Add the following secrets to your GitHub repository:
- SONAR_TOKEN: Your SonarCloud API token

Configuration

The SonarCloud configuration is simplified using the standard approach:

Configuration is stored in sonar-project.properties
Uses SonarSource's official GitHub Actions
Works with both PR analysis and branch analysis

Conventional Commits

This repository uses Conventional Commits to standardize commit messages and automate versioning.

Commit Format

Each commit message should follow this format:

<type>[optional scope]: <description>

[optional body]

[optional footer(s)]

Examples:

feat: add new feature
fix: resolve login bug
docs: update README
chore(deps): update dependencies

Common Types

feat: A new feature
fix: A bug fix
docs: Documentation changes
style: Code style changes (formatting, etc.)
refactor: Code changes that neither fix bugs nor add features
perf: Performance improvements
test: Adding or correcting tests
chore: Changes to the build process or auxiliary tools

Automatic Versioning

This repository uses semantic-release to automatically version and tag releases based on commit messages:

feat: commits trigger a minor version bump (1.0.0 → 1.1.0)
fix: commits trigger a patch version bump (1.0.0 → 1.0.1)
feat!: or commits with BREAKING CHANGE: in the footer trigger a major version bump (1.0.0 → 2.0.0)

Setup

Run the following to set up the commit hooks locally:

npm install

Architecture Overview

This project implements a RAG (Retrieval Augmented Generation) pipeline with the following components:

Document Ingestion:
- PDF documents are uploaded to an S3 bucket
- A Lambda function extracts text from the PDFs using AWS Textract
Text Processing:
- Extracted text is chunked into smaller segments by another Lambda function
- Each chunk is stored in a dedicated S3 bucket
Vector Generation and Storage:
- The vector_generator Lambda processes each text chunk
- Vector embeddings are generated using Amazon Bedrock's Titan model
- Embeddings are stored in Amazon OpenSearch for efficient vector search
Search and Retrieval:
- OpenSearch provides vector similarity search capabilities
- The stored vectors can be queried to find semantically similar content

Infrastructure

The infrastructure is deployed using Terraform and includes:

S3 buckets for document storage at various processing stages
Lambda functions for text extraction, chunking, and vector generation
OpenSearch domain for vector storage and similarity search
IAM roles and policies for secure service interactions
CloudWatch logging for monitoring and debugging

Using with RAG Applications

To integrate with your RAG applications:

Upload PDF documents to the raw PDFs S3 bucket
The system automatically processes documents and generates vector embeddings
Query the OpenSearch domain to perform semantic searches
Retrieve relevant text chunks to provide context to your LLM

API Access

This project includes a REST API endpoint to directly query the RAG system:

# Using the provided API script (available as a GitHub Actions artifact)
./query_api.sh "What is our password policy?"

# Manually using curl
curl -X POST \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -d '{"query": "What is our password policy?"}' \
  https://YOUR_API_GATEWAY_ID.execute-api.YOUR_REGION.amazonaws.com/search

The system will:

Convert your query to embeddings
Find the most relevant policy chunks
Use Claude 3 Sonnet to generate a comprehensive answer
Return the answer with source citations

You can download the ready-to-use API script from the latest successful GitHub Actions workflow as an artifact named api-query-script.

Using Bruno for API Testing

Bruno is an open-source API client that makes testing APIs simple. This project includes a Bruno collection that's automatically generated with your API endpoint.

To use it:

Install Bruno from usebruno.com
Download the bruno-collection artifact from the latest GitHub Actions workflow
Open Bruno and import the collection from the downloaded artifact
Run the "Search Policy" request to test the API

The collection includes the correct endpoint URL and is pre-configured with sample queries.

For details on the OpenSearch configuration and how to query the vectors, refer to the AWS OpenSearch documentation.

Future Improvements

Improve test coverage for utility modules:
- Create comprehensive tests for src/utils/bedrock_utils.py (current coverage: 0%)
- Create comprehensive tests for src/utils/opensearch_utils.py (current coverage: 0%)
- Target at least 80% coverage for these modules
Enhance X-Ray tracing for better request tracking and performance monitoring

Quick Start

Clone the repository

git clone https://github.com/reaandrew/ee-ai-rag-mcp-demo

Install dependencies

cd ee-ai-rag-mcp-demo
npm install

Follow the documentation

Check the repository's README.md file for specific installation and usage instructions.

Repository Details

Ownerreaandrew

Repoee-ai-rag-mcp-demo

LanguagePython

License-

Last fetched8/10/2025

Quick Links

Issues

Releases

License

Recommended MCP Servers

💬

Discord MCP

Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.

integrationsdiscordchat

🔗

Knit MCP

Connect AI agents to 200+ SaaS applications and automate workflows.

integrationsautomationsaas

🕷️

Apify MCP Server

Deploy and interact with Apify actors for web scraping and data extraction.

apifycrawlerdata

🌐

BrowserStack MCP

BrowserStack MCP Server for automated testing across multiple browsers.

testingqabrowsers

⚡

Zapier MCP

A Zapier server that provides automation capabilities for various apps.

zapierautomation