foundinblank
MCP Serverfoundinblankpublic

mcp data sandbox

使用 dlt 和 DuckDB 快速搭建最小数据栈,并通过 MCP 服务器与 GitHub Copilot 实现 AI 驱动的 SQL 操作。

Repository Info

6
Stars
1
Forks
6
Watchers
0
Issues
Python
Language
-
License

About This Server

使用 dlt 和 DuckDB 快速搭建最小数据栈,并通过 MCP 服务器与 GitHub Copilot 实现 AI 驱动的 SQL 操作。

Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.

Documentation

mcp-data-sandbox

Python DuckDB dlt MCP

A quick start guide to setting up a minimal data stack using dlt and DuckDB, and then leveraging the DuckDB MCP server with VSCode + GitHub Copilot for AI-powered SQL operations. Written as a companion to my MCP Servers for Dummies blog post.

This guide will help you:

  1. Quickly set up dlt and DuckDB.
  2. Understand what an MCP server is and why it's useful.
  3. Connect GitHub Copilot to your DuckDB data using a MCP server.

Prerequisites:

  • VSCode with MCP Server enabled.
  • GitHub Copilot in VSCode
  • Python 3.9 or higher

Set Up Your Minimal Data Stack

  1. Install uv:

    curl -LsSf https://astral.sh/uv/install.sh | sh
    
  2. Create and navigate to your project directory:

    mkdir mcp-data-sandbox
    cd mcp-data-sandbox
    
  3. Initialize a new project, create a virtual environment, and install dependencies:

    uv init
    uv venv
    source .venv/bin/activate
    uv add duckdb
    uv add dlt[duckdb]
    uv sync
    
  4. Set up a dlt pipeline to ingest Chess.com data into DuckDB:

    dlt init chess duckdb
    
  5. Run the data load job. You should see a new chess_pipeline.duckdb file appear in your directory:

    python chess_pipeline.py
    
  6. Explore your dataset using DuckDB's GUI:

    duckdb chess_pipeline.duckdb -ui
    

    For a CLI-based option, see harlequin.

Now you have data to work with. While LLMs can help you write code and queries, they don't natively interface with DuckDB. Let's fix that by using a MCP server!

Set Up the MCP Server

MCP servers are standardized interfaces (like REST APIs) that let large language models (LLMs) interact with external tools such as DuckDB or Figma or Chrome.

Today, many LLM-powered apps use proprietary solutions to achieve this. For example, GitHub Copilot in VS Code can access your files and monitor your terminal. The ChatGPT desktop app for macOS can read content from apps like Notes or Terminal.

MCP is different. It’s open, programmable, and designed to be a standard. It gives LLMs a consistent way to integrate with external systems—-without vendor lock-in.

Think of it as a power-up 🍄 for your LLM. Without MCP, LLMs are boxed into their immediate environment. By using MCP servers, LLMs gain the ability to "agentically" access other tools, systems, or data sources.

It's similar to exposing a remote database through a REST API endpoint. Suddenly, your data becomes accessible to any app or user (with the right credentials), and the possibilities multiply.

Now let’s apply that idea: with the DuckDB MCP server by MotherDuck, your LLM can power up 💪 with the ability to query DuckDB and MotherDuck databases directly.

  1. Create a .vscode/mcp.json file. This saves your MCP configuration in your workspace settings (i.e., your project folder) instead of your user settings:

    mkdir .vscode
    cd .vscode
    touch mcp.json
    
  2. Add DuckDB MCP configuration by copy and pasting this code block into mcp.json:

    {
      "servers": {
        "motherduck": {
          "command": "uvx",
          "args": [
            "mcp-server-motherduck",
            "--db-path",
            "${workspaceFolder}/chess_pipeline.duckdb"
          ]
        }
      }
    }
    
  3. Open a Copilot chat in VSCode and ask it a question about your DuckDB file. For example:

    • "How many schemas and tables do I have in my DuckDB file?"
    • "Tell me something interesting about the data in my DuckDB file."
    • "Create a query that shows the top 10 chess players by rating."
    • "Create a visualization of the opening moves frequency in my chess data."

Troubleshooting

If you encounter issues:

  1. MCP Server not connecting:

    • Ensure MCP support is enabled in VSCode
    • Check that your mcp.json file has the correct path to your DuckDB file
    • Restart VSCode and try again
  2. DuckDB file not found:

    • Verify that you've run python chess_pipeline.py successfully
    • Check the path in your mcp.json file

Quick Start

1

Clone the repository

git clone https://github.com/foundinblank/mcp-data-sandbox
2

Install dependencies

cd mcp-data-sandbox
npm install
3

Follow the documentation

Check the repository's README.md file for specific installation and usage instructions.

Repository Details

Ownerfoundinblank
Repomcp-data-sandbox
LanguagePython
License-
Last fetched8/10/2025

Recommended MCP Servers

💬

Discord MCP

Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.

integrationsdiscordchat
🔗

Knit MCP

Connect AI agents to 200+ SaaS applications and automate workflows.

integrationsautomationsaas
🕷️

Apify MCP Server

Deploy and interact with Apify actors for web scraping and data extraction.

apifycrawlerdata
🌐

BrowserStack MCP

BrowserStack MCP Server for automated testing across multiple browsers.

testingqabrowsers

Zapier MCP

A Zapier server that provides automation capabilities for various apps.

zapierautomation