
obsidian index service
A service that monitors an Obsidian vault for file changes (new, modified, or deleted Markdown files) and indexes each note’s metadata and content into an SQLite database, exposing it for e.g. an MCP-server
Repository Info
About This Server
A service that monitors an Obsidian vault for file changes (new, modified, or deleted Markdown files) and indexes each note’s metadata and content into an SQLite database, exposing it for e.g. an MCP-server
Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.
Documentation
Obsidian Index Service
This service monitors an Obsidian vault directory and indexes Markdown files—metadata and full content—into an SQLite database. I built it to work with my mcp-server project, but switched to an implementation that uses the Obsidian plugin API instead. I still see use for this as an agnostic note indexer or sync tool (see note under "Future Steps"), so I'm putting it up here.
Functionality
It tracks file changes (create, modify, delete) in an Obsidian vault and stores everything in SQLite, accessible via a Docker volume. It captures:
- Path: File path (unique identifier)
- Title: From filename
- Parent Folders: Relative to vault root
- Tags: From YAML frontmatter
- Created Date: Filesystem timestamp
- Modified Date: Filesystem timestamp
- Content: Full text of the note
- Status: Processing outcome (success/error)
- Error Message: Details if processing fails
Setup and Usage
Prerequisites
- Python 3.12 or later
- Docker and Docker Compose (for containerized use)
uv(optional, but recommended)
Installation
-
Clone the repo:
git clone https://github.com/pmmvr/obsidian-index-service.git cd obsidian-index-service -
Set up a virtual environment:
-
With
uv(recommended):uv venv source .venv/bin/activate # Linux/macOS .venv\Scripts\activate # Windows -
With
python(standard):python -m venv .venv source .venv/bin/activate # Linux/macOS .venv\Scripts\activate # Windows
-
-
Install dependencies:
- With
uv(recommended):uv sync # Installs from uv.lock uv pip install pytest pytest-bdd pytest-mock # For tests - With
pip:pip install -e . pip install pytest pytest-bdd pytest-mock # For tests
- With
Running Locally
Set environment variables:
export OBSIDIAN_VAULT_PATH=/path/to/vault
export DB_PATH=/path/to/notes.sqlite
Run it:
python main.py
With uv:
uv run python main.py
For a one-time scan:
python main.py --scan-only
Or with uv:
uv run python main.py --scan-only
Command-Line Options
--vault-path: Path to vault directory--db-path: Path to SQLite database--scan-only: Scan without watching
Using Docker
- Build and run:
docker-compose up -d - It mounts your vault and exposes the SQLite database.
Read-Only Access for Other Services
To let another service read the database (e.g., for scanning changes):
- Use the same volume as
obsidian-index-servicein yourdocker-compose.yml:services: your-service: image: your-image volumes: - ${DB_VOLUME_PATH:-./data}:/data:ro # Read-only mount Obsidian Index Servicewrites to/data/notes.sqlite(mounted read-write), while other services (e.g. an mcp-server) read it. SQLite's WAL mode handles concurrent access.
How It Works
The Obsidian Index Service operates through the following process:
-
Startup (
ObsidianIndexService.__init__)- Loads configuration from environment variables or command-line arguments
- Initializes the database connection (
DatabaseConnection) - Sets up the note processor (
NoteProcessor) - Establishes signal handlers for graceful shutdown
-
Database Initialization (
DatabaseConnection.__init__)- Creates/connects to an SQLite database
- Sets up the database in WAL (Write-Ahead Logging) mode for better concurrency
- Creates a 'notes' table if it doesn't exist with columns for path, title, tags, etc.
-
Initial Vault Scan (
NoteProcessor.scan_vault)- Finds all Markdown files (*.md, *.markdown) in the vault directory
- For each file, extracts metadata
- Adds all extracted metadata to the database (
NoteOperations.insert_note)
-
Continuous Monitoring (
FileWatcher.watch)- Watches the vault directory for file system events
- Processes different types of events:
- File creation: Indexes new files
- File modification: Updates index for changed files
- File deletion: Removes entries from the index
- File movement/renaming: Updates path information
-
File Processing (
NoteProcessor.process_note)- Extracts metadata from Markdown files
- Includes path, title, parent folders, tags, created/modified dates
- Updates the database with this information (
NoteOperations.upsert_note)
-
Graceful Shutdown (
ObsidianIndexService.shutdown)- Properly closes file watchers and database connections when receiving termination signals
The service operates in the background, continuously keeping the SQLite database in sync with the Obsidian vault. Other applications can then use this database to access note metadata without having to parse Markdown files directly.
Development
Run tests:
pytest
Project Status
- Done: Core indexing (metadata + content), Docker setup, file watching, database CRUD.
- Next: Planned an API, but went with the plugin approach instead.
Future Steps (Sync Tool Potential)
With some rework I can see this as a sync tool:
- Remote Backend: Add support for cloud storage (e.g., Dropbox) or a server.
- Sync Logic: Push local changes (content + metadata) to remote, pull remote updates, handle conflicts (e.g., last-write-wins).
- Database Tweaks: Add
sync_statusandremote_idcolumns. - File Watcher Updates: Queue changes for sync, not just indexing.
- CLI Option: Add
--syncto trigger it manually or continuously. - Error Handling: Retry on network fails, log issues.
Quick Start
Clone the repository
git clone https://github.com/pmmvr/obsidian-index-serviceInstall dependencies
cd obsidian-index-service
npm installFollow the documentation
Check the repository's README.md file for specific installation and usage instructions.
Repository Details
Recommended MCP Servers
Discord MCP
Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.
Knit MCP
Connect AI agents to 200+ SaaS applications and automate workflows.
Apify MCP Server
Deploy and interact with Apify actors for web scraping and data extraction.
BrowserStack MCP
BrowserStack MCP Server for automated testing across multiple browsers.
Zapier MCP
A Zapier server that provides automation capabilities for various apps.