docker/docker-model-runner-and-mcp-with-promptfoo

mirror of https://github.com/docker/docker-model-runner-and-mcp-with-promptfoo synced 2026-04-05 19:45:11 +00:00

No description

Find a file

Luke Bunselmeyer 283c6356b5 Add API key instructions and update MCP repo summarizer config.		2025-09-19 11:07:48 -04:00
LICENSE	Initial commit	2025-09-18 23:20:33 -04:00
promptfooconfig.comparison.yaml	Add configurations and examples for Docker MCP Toolkit and Promptfoo integration.	2025-09-18 23:24:17 -04:00
promptfooconfig.mcp-direct.yaml	Add configurations and examples for Docker MCP Toolkit and Promptfoo integration.	2025-09-18 23:24:17 -04:00
promptfooconfig.mcp-repo-summarizer.yaml	Add API key instructions and update MCP repo summarizer config.	2025-09-19 11:07:48 -04:00
readme.md	Add API key instructions and update MCP repo summarizer config.	2025-09-19 11:07:48 -04:00

readme.md

Docker Model Runner, Docker MCP Toolkit, and Promptfoo

This repo contains a few examples of how to use Docker Model Runner, Docker MCP Toolkit, and Promptfoo together to compare models, evaluate MCP servers, and even perform LLM red-teaming from the comfort of your own dev machine.

Prerequisites

Enable Docker MCP Toolkit in Docker Desktop per https://docs.docker.com/ai/mcp-catalog-and-toolkit/get-started/#enable-docker-mcp-toolkit.
Enable Docker Model Runner in Docker Desktop or Docker Engine per https://docs.docker.com/ai/model-runner/#enable-docker-model-runner.
Use the Docker Model Runner CLI to pull the following models

docker model pull ai/gemma3:4B-Q4_K_M
docker model pull ai/smollm3:Q4_K_M
docker model pull ai/mxbai-embed-large:335M-F16

Install Promptfoo

npm install -g promptfoo

Run the model comparison evaluation

export ANTHROPIC_API_KEY=<your_api_key_here>
promptfoo eval -c promptfooconfig.comparison.yaml
promptoo view

Run the MCP Direct example

promptfoo eval -c promptfooconfig.mcp-direct.yaml

Run the MCP Red-Team Example

export ANTHROPIC_API_KEY=<your_api_key_here>
promptfoo redteam run -c promptfooconfig.mcp-repo-summarizer.yaml