Mandrel MCP Test Harness

A comprehensive testing framework for Model Context Protocol (MCP) servers with enterprise-grade capabilities.

The Mandrel project provides the moth binary (MOdel context protocol Test Harness) for command-line testing operations.

🌟 Key Features

Protocol Compliance

✅ MCP Protocol Validation - Full MCP 2025-06-18 specification compliance
✅ Transport Testing - stdio transport validation (primary focus)
✅ Capability Detection - Automatic server capability discovery and validation
✅ Error Handling - Comprehensive MCP error code testing (-32601, -32602, -32603)

Performance & Scale

✅ Concurrent Execution - Configurable parallelism with resource limits (up to 32 concurrent tests)
✅ Performance Monitoring - Response time tracking from 0ms to 10+ seconds
✅ Real-time Metrics - Memory usage, CPU monitoring, and throughput measurement
✅ Stress Testing - High-load testing with verified performance characteristics

Validation & Quality

✅ JSONPath Validation - Flexible response validation using JSONPath expressions
✅ Custom Scripts - Python/JavaScript/Lua custom validation logic support
✅ Security Testing - Built-in security constraint validation (no passwords, API keys)
✅ Server Reality Testing - Validates only features servers actually support

Production Ready

✅ CI/CD Integration - GitHub Actions, GitLab CI, Jenkins support with auto-detection
✅ Multiple Report Formats - JSON, HTML, JUnit XML with interactive charts
✅ Configuration Profiles - Environment-specific configuration management
✅ Enterprise Features - Comprehensive validation, audit logging, compliance reporting

🚀 Quick Start

# Build from source
git clone https://github.com/rustic-ai/codeprism.git
cd codeprism
cargo build --release --bin moth

# Run verified working example (100% success rate)
cargo run --bin moth -- run codeprism-docs/docs/test-harness/examples/filesystem-server.yaml

# Validate a specification
cargo run --bin moth -- validate my-server.yaml

📚 Documentation Sections

Getting Started

Quick Start Guide - Get up and running in 5 minutes with verified examples
Installation Guide - Complete installation instructions

Reference Documentation

CLI Reference - Complete command-line documentation (run, validate, report, etc.)
Configuration Reference - Complete YAML specification format with working examples
User Guide - Comprehensive testing guide

Operations & Production

Performance Tuning - Optimize test execution and server performance
Production Deployment - Enterprise deployment guide
Troubleshooting - Common issues and solutions

Examples & Resources

Working Examples - Verified test specifications with 100% success rates
- Filesystem Server - ✅ 8/8 tests passing
- Everything Server - ✅ 8/8 tests passing
Test Results & Reports - Understanding test output

🏗️ Architecture Overview

The MCP Test Harness consists of several key components verified through extensive testing:

Core Components

Configuration Manager: YAML configuration parsing with comprehensive validation
Test Executor: Asynchronous test execution with verified concurrency control
Server Manager: MCP server lifecycle management (stdio transport)
Response Validator: JSONPath validation and custom script validation
Performance Monitor: Real-time monitoring with sub-millisecond precision
Report Generator: Multi-format reporting with interactive visualizations

📊 Test Categories

Core Testing (Verified Working)

Initialization Testing - Server startup and handshake validation
Tool Testing - Individual tool functionality validation with real examples
Resource Testing - Resource access and management validation
Error Handling - MCP error code validation (-32601, -32602, -32603)

Advanced Testing (Production Tested)

Performance Testing - Response time validation (0ms to 10+ seconds measured)
Concurrency Testing - Parallel execution up to 32 concurrent tests
Security Testing - Security constraint validation (no passwords, API keys)
Unicode Testing - International character support validation

Specialized Testing (Enterprise Ready)

Server Reality Testing - Only test capabilities servers actually support
Capability Validation - Accurate capability declaration validation
Protocol Compliance - MCP 2025-06-18 specification compliance
Regression Testing - Automated change impact detection

🎓 Learning Path

Beginner (New to MCP Test Harness)

Quick Start Guide - Run verified examples in 5 minutes
Working Examples - Learn from 100% working tests
CLI Reference - Master the moth run and moth validate commands

Intermediate (Regular User)

Configuration Reference - Advanced YAML configuration patterns
Performance Testing - Optimize test execution
Everything Server Example - Complex real-world testing

Advanced (Power User/Developer)

Production Deployment - Enterprise deployment
Custom Validation Scripts - Python/JS/Lua scripts
CI/CD Integration - Automated testing pipelines

🆘 Getting Help

📖 Documentation Issues: If you find documentation unclear or missing
🐛 Bug Reports: For software bugs and unexpected behavior
💡 Feature Requests: For new functionality suggestions
❓ Usage Questions: For help with configuration and usage

Support Channels

GitHub Issues: Report issues and bugs
GitHub Discussions: Community Q&A
Documentation: Complete reference materials (this site)

🏆 Proven Track Record

Our test harness has been thoroughly tested with real MCP servers:

✅ Filesystem Server Results

Suite: Filesystem MCP Server (MCP-Compliant)
Total Tests: 8, Passed: 8, Failed: 0
Duration: 2.3s
Success Rate: 100%

✅ Everything Server Results

Suite: Everything MCP Server (Working Tests)
Total Tests: 8, Passed: 8, Failed: 0
Duration: 10.02s
Success Rate: 100%

Real Performance Characteristics

Fast Operations: Math operations (0-1ms response time)
Text Processing: Unicode support (0-1ms response time)
Environment Access: System variables (1ms response time)
Long Operations: Progress notifications (10+ seconds)
Resource Management: Basic access validation
Error Handling: Proper MCP error codes

🎯 Server Reality Focus

Unlike documentation-driven test frameworks, we focus on server reality:

✅ Test only what works - No false capability claims
✅ Use actual tool names - Verified against real implementations
✅ Correct output formats - Based on actual server responses
✅ Realistic timeouts - Based on measured performance
✅ Proper error codes - Validated MCP error responses

Before vs After

Traditional Approach (Documentation-Based):

capabilities:
  sampling: true          # ❌ Often wrong
  prompts: true          # ❌ Usually unsupported

expected:
  path: "$.result"       # ❌ Wrong format
  value: 8               # ❌ Unrealistic

Our Approach (Server Reality):

capabilities:
  sampling: false        # ✅ Verified accurate
  prompts: false        # ✅ Tested and confirmed

expected:
  fields:
    - path: "$[0].text"  # ✅ Actual server format
      contains: "100"    # ✅ Realistic validation

Ready to get started? Begin with our Quick Start Guide to run your first verified test in under 5 minutes! 🚀

Want working examples? Check out our verified examples with 100% success rates against real MCP servers.

🌟 Key Features​

Protocol Compliance​

Performance & Scale​

Validation & Quality​

Production Ready​

🚀 Quick Start​

📚 Documentation Sections​

Getting Started​

Reference Documentation​

Operations & Production​

Examples & Resources​

🏗️ Architecture Overview​

Core Components​

📊 Test Categories​

Core Testing (Verified Working)​

Advanced Testing (Production Tested)​

Specialized Testing (Enterprise Ready)​

🎓 Learning Path​

Beginner (New to MCP Test Harness)​

Intermediate (Regular User)​

Advanced (Power User/Developer)​

🆘 Getting Help​

Support Channels​

🏆 Proven Track Record​

✅ Filesystem Server Results​

✅ Everything Server Results​

Real Performance Characteristics​

🎯 Server Reality Focus​

Before vs After​