No description
Find a file
2025-09-19 11:07:48 -04:00
LICENSE Initial commit 2025-09-18 23:20:33 -04:00
promptfooconfig.comparison.yaml Add configurations and examples for Docker MCP Toolkit and Promptfoo integration. 2025-09-18 23:24:17 -04:00
promptfooconfig.mcp-direct.yaml Add configurations and examples for Docker MCP Toolkit and Promptfoo integration. 2025-09-18 23:24:17 -04:00
promptfooconfig.mcp-repo-summarizer.yaml Add API key instructions and update MCP repo summarizer config. 2025-09-19 11:07:48 -04:00
readme.md Add API key instructions and update MCP repo summarizer config. 2025-09-19 11:07:48 -04:00

Docker Model Runner, Docker MCP Toolkit, and Promptfoo

This repo contains a few examples of how to use Docker Model Runner, Docker MCP Toolkit, and Promptfoo together to compare models, evaluate MCP servers, and even perform LLM red-teaming from the comfort of your own dev machine.

Prerequisites

  1. Enable Docker MCP Toolkit in Docker Desktop per https://docs.docker.com/ai/mcp-catalog-and-toolkit/get-started/#enable-docker-mcp-toolkit.
  2. Enable Docker Model Runner in Docker Desktop or Docker Engine per https://docs.docker.com/ai/model-runner/#enable-docker-model-runner.
  3. Use the Docker Model Runner CLI to pull the following models
docker model pull ai/gemma3:4B-Q4_K_M
docker model pull ai/smollm3:Q4_K_M
docker model pull ai/mxbai-embed-large:335M-F16
  1. Install Promptfoo
npm install -g promptfoo

Run the model comparison evaluation

export ANTHROPIC_API_KEY=<your_api_key_here>
promptfoo eval -c promptfooconfig.comparison.yaml
promptoo view

Run the MCP Direct example

promptfoo eval -c promptfooconfig.mcp-direct.yaml

Run the MCP Red-Team Example

export ANTHROPIC_API_KEY=<your_api_key_here>
promptfoo redteam run -c promptfooconfig.mcp-repo-summarizer.yaml