Watch

mirror of https://github.com/ansible/ansible-chatbot-stack synced 2026-07-22 20:03:44 +00:00

No description

Python 61.8%
Makefile 23.9%
Shell 8.7%
Dockerfile 5.6%

Find a file

Repository files (latest commit first)
Filename	Latest commit message	Latest commit date
Demetrius 36f4800974 Bump lightspeed-stack base image from 0.4.2 to 0.4.3.1 (#293 ) Aligns the Containerfile builder base with the version used in the production container (ansible-lightspeed-chatbot-container). This brings in Authlib 1.7.x which adds joserfc as a transitive dependency, resolving missing authentication library issues at runtime. Co-authored-by: Cursor <cursoragent@cursor.com>		2026-07-22 11:48:10 -03:00
.github	Update GitHub Artifact Actions to v7 (#255 )	2026-07-17 16:33:16 -04:00
.tekton	chore(deps): update konflux references (#228 )	2026-03-28 09:43:50 -04:00
docs	Add architecture diagram for Ansible Chatbot Stack (#270 )	2026-06-15 11:06:44 -03:00
scripts	Add script to generate system prompt files from upstream operator template	2026-06-24 11:39:23 -04:00
tests	Rename remaining Lightspeed references to Automation Intelligent Assistant	2026-05-20 12:19:50 -05:00
.gitignore	cleanup vector_db directory at startup	2026-01-15 14:06:30 +01:00
.pre-commit-config.yaml	Simplify pre-commit hook to use uv lock --check	2026-03-23 10:04:41 -04:00
.python-version	AAP-48373: llama-stack: Tidy-up ansible-chatbot-stack once LSC supports required features (#57 )	2025-07-10 13:45:04 +01:00
ansible-chatbot-deploy.yaml	Fix AAP version in system prompt	2026-06-18 12:11:46 -04:00
ansible-chatbot-mcp-run.yaml	Added storage/stores/prompts section	2026-03-20 15:18:04 -04:00
ansible-chatbot-run.yaml	Added storage/stores/prompts section	2026-03-20 15:18:04 -04:00
ansible-chatbot-system-prompt-granite-compat.txt	Add script to generate system prompt files from upstream operator template	2026-06-24 11:39:23 -04:00
ansible-chatbot-system-prompt.txt	Add script to generate system prompt files from upstream operator template	2026-06-24 11:39:23 -04:00
codecov.yml	Add Codecov integration for repository onboarding	2026-06-02 11:03:02 +02:00
Containerfile	Bump lightspeed-stack base image from 0.4.2 to 0.4.3.1 (#293 )	2026-07-22 11:48:10 -03:00
entrypoint.sh	Logs BYOK files data in chatbot entrypoint.sh	2026-04-02 14:05:00 +02:00
kustomization.yaml	Merge pull request #177 from ansible/konflux/mintmaker/main/quay.io-ansible-ansible-mcp-controller-0.x	2026-01-09 13:02:08 +01:00
LICENSE.md	Create LICENSE.md	2025-05-29 16:49:27 +02:00
lightspeed-stack-mcp.yaml	Rename remaining Lightspeed references to Automation Intelligent Assistant	2026-05-20 12:19:50 -05:00
lightspeed-stack.yaml	Rename remaining Lightspeed references to Automation Intelligent Assistant	2026-05-20 12:19:50 -05:00
lightspeed-stack_local.yaml	Rename remaining Lightspeed references to Automation Intelligent Assistant	2026-05-20 12:19:50 -05:00
Makefile	Add uv.lock update-lock target and pre-commit hook for dependency checks	2026-03-21 09:56:40 -04:00
pyproject.toml	Bump litellm 1.84.0, aiohttp 3.14.0, pillow 12.3.0 for CVEs (#284 )	2026-07-17 11:09:32 -04:00
README.md	Add script to generate system prompt files from upstream operator template	2026-06-24 11:39:23 -04:00
sonar-project.properties	Exclude scripts/ from SonarQube Cloud coverage analysis	2026-06-24 11:45:26 -04:00
uv.lock	Bump litellm 1.84.0, aiohttp 3.14.0, pillow 12.3.0 for CVEs (#284 )	2026-07-17 11:09:32 -04:00

README.md

Ansible Chatbot (llama) Stack

This repository contains the necessary configuration to build a Docker Container Image for ansible-chatbot-stack.

ansible-chatbot-stack builds on top of lightspeed-stack that wraps Meta's llama-stack AI framework.

ansible-chatbot-stack includes various customisations for:

A remote vLLM inference provider (RHOSAI vLLM compatible)
The inline sentence transformers (Meta)
AAP RAG database files and configuration
Lightspeed external providers
System Prompt injection

Build/Run overview:

flowchart TB
%% Nodes
    LLAMA_STACK([fa:fa-layer-group llama-stack:x.y.z])
    LIGHTSPEED_STACK([fa:fa-layer-group lightspeed-stack:x.y.z])
    LIGHTSPEED_RUN_CONFIG{{fa:fa-wrench lightspeed-stack.yaml}}
    ANSIBLE_CHATBOT_STACK([fa:fa-layer-group ansible-chatbot-stack:x.y.z])
    ANSIBLE_CHATBOT_RUN_CONFIG{{fa:fa-wrench ansible-chatbot-run.yaml}}
    ANSIBLE_CHATBOT_DOCKERFILE{{fa:fa-wrench Containerfile}}
    ANSIBLE_LIGHTSPEED([fa:fa-layer-group ansible-ai-connect-service:x.y.z])
    LIGHTSPEED_PROVIDERS("fa:fa-code-branch lightspeed-providers:x.y.z")
    PYPI("fa:fa-database PyPI")

%% Edge connections between nodes
    ANSIBLE_LIGHTSPEED -- Uses --> ANSIBLE_CHATBOT_STACK
    ANSIBLE_CHATBOT_STACK -- Consumes --> PYPI
    LIGHTSPEED_PROVIDERS -- Publishes --> PYPI
    ANSIBLE_CHATBOT_STACK -- Built from --> ANSIBLE_CHATBOT_DOCKERFILE
    ANSIBLE_CHATBOT_STACK -- Inherits from --> LIGHTSPEED_STACK
    ANSIBLE_CHATBOT_STACK -- Includes --> LIGHTSPEED_RUN_CONFIG
    ANSIBLE_CHATBOT_STACK -- Includes --> ANSIBLE_CHATBOT_RUN_CONFIG
    LIGHTSPEED_STACK -- Embeds --> LLAMA_STACK
    LIGHTSPEED_STACK -- Uses --> LIGHTSPEED_RUN_CONFIG
    LLAMA_STACK -- Uses --> ANSIBLE_CHATBOT_RUN_CONFIG

Build

Setup for Ansible Chatbot Stack

External Providers YAML manifests must be present in providers.d/ of your host's llama-stack directory.
Vector Database is copied from the latest aap-rag-content image to ./vector_db.
Embeddings image files are copied from the latest aap-rag-content image to ./embeddings_model.

        make setup

Building Ansible Chatbot Stack

Builds the image ansible-chatbot-stack:$ANSIBLE_CHATBOT_VERSION.

Change the ANSIBLE_CHATBOT_VERSION version and inference parameters below accordingly.

    export ANSIBLE_CHATBOT_VERSION=0.0.1
    
    make build

Container file structure

Files from `lightspeed-stack` base image

└── app-root/
    ├── .venv/
    └── src/
        ├── <lightspeed-stack files>
        └── lightspeed_stack.py

Runtime files

These are stored in a PersistentVolumeClaim for resilience

└── .llama/
    └── data/
        └── distributions/
            └── ansible-chatbot/
                ├── aap_faiss_store.db
                ├── agents_store.db
                ├── responses_store.db
                ├── localfs_datasetio.db
                ├── trace_store.db
                └── embeddings_model/

Configuration files

└── .llama/
    ├── distributions/
    │   └── llama-stack/
    │       └── config
    │           └── ansible-chatbot-run.yaml
    │   └── ansible-chatbot/
    │       ├── ansible-chatbot-version-info.json    
    │       └── config
    │           └── lightspeed-stack.yaml
    │       └── system-prompts/
    │           └── default.txt
    └── providers.d
        └── <llama-stack external providers>

Run

Runs the image ansible-chatbot-stack:$ANSIBLE_CHATBOT_VERSION as a local container.

Change the ANSIBLE_CHATBOT_VERSION version and inference parameters below accordingly.

System Prompt

Select the system prompt file based on the model type:

Model type	System prompt file
Granite models	`ansible-chatbot-system-prompt-granite-compat.txt`
OpenAI-compatible models (default)	`ansible-chatbot-system-prompt.txt`

    export ANSIBLE_CHATBOT_VERSION=0.0.1
    export ANSIBLE_CHATBOT_VLLM_URL=<YOUR_MODEL_SERVING_URL>
    export ANSIBLE_CHATBOT_VLLM_API_TOKEN=<YOUR_MODEL_SERVING_API_TOKEN>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL=<YOUR_INFERENCE_MODEL>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL_FILTER=<YOUR_INFERENCE_MODEL_TOOLS_FILTERING>
    
    make run

Basic tests

Runs basic tests against the local container.

Change the ANSIBLE_CHATBOT_VERSION version and inference parameters below accordingly.

    export ANSIBLE_CHATBOT_VERSION=0.0.1
    export ANSIBLE_CHATBOT_VLLM_URL=<YOUR_MODEL_SERVING_URL>
    export ANSIBLE_CHATBOT_VLLM_API_TOKEN=<YOUR_MODEL_SERVING_API_TOKEN>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL=<YOUR_INFERENCE_MODEL>
    export ANSIBLE_CHATBOT_INFERENCE_MODEL_FILTER=<YOUR_INFERENCE_MODEL_TOOLS_FILTERING>
    
    make run-test

AAP quality evaluations

AAP Chatbot Quality evaluations available:

Deploy into a k8s cluster

Change configuration in `kustomization.yaml` accordingly, then

    kubectl kustomize . > my-chatbot-stack-deploy.yaml

Deploy the service

    kubectl apply -f my-chatbot-stack-deploy.yaml

Appendix - Google Gemini API

Using the gemini remote inference provider:

Set the environment variable OPENAI_API_KEY=<YOUR_API_KEY>
Example of a v1/query request:

{
    "query": "hello",
    "system_prompt": "You are a helpful assistant.",
    "model": "gemini/gemini-2.5-flash",
    "provider": "gemini"
}

Appendix - Google Vertex API

Using the gemini remote inference provider:

Set a dummy value for the environment variable OPENAI_API_KEY (so gemini provider within llama-stack, does not complain)
Set the path for your Google's Service Account credentials JSON file in the env GOOGLE_APPLICATION_CREDENTIALS=<PATH_GOOGLE_CRED_JSON_FILE>
Example of a v1/query request:

{
    "query": "hello",
    "system_prompt": "You are a helpful assistant.",
    "model": "gemini-2.5-flash",
    "provider": "gemini"
}

Appendix - Generating system prompt files

The system prompt files (ansible-chatbot-system-prompt.txt and ansible-chatbot-system-prompt-granite-compat.txt) are generated from the upstream operator template.

To regenerate them after the upstream template changes:

    python3 scripts/generate_system_prompts.py

Appendix - Host clean-up

If you have the need for re-building images, apply the following clean-ups right before:

    make clean

Appendix - Obtain a container shell

    # Obtain a container shell for the Ansible Chatbot Stack.
    make shell

Appendix - Run from source (PyCharm)

Clone the lightspeed-core/lightspeed-stack repository to your development environment.

In the ansible-chatbot-stack project root, create .env file in the project root and define following variables:

PYTHONDONTWRITEBYTECODE=1
PYTHONUNBUFFERED=1
PYTHONCOERCECLOCALE=0
PYTHONUTF8=1
PYTHONIOENCODING=UTF-8
LANG=en_US.UTF-8
VLLM_URL=(VLLM URL Here)
VLLM_API_TOKEN=(VLLM API Token Here)
INFERENCE_MODEL=granite-3.3-8b-instruct

LIBRARY_CLIENT_CONFIG_PATH=./ansible-chatbot-run.yaml
# For OpenAI-compatible models (default):
SYSTEM_PROMPT_PATH=./ansible-chatbot-system-prompt.txt
# For Granite models:
# SYSTEM_PROMPT_PATH=./ansible-chatbot-system-prompt-granite-compat.txt
EMBEDDINGS_MODEL=./embeddings_model
VECTOR_DB_DIR=./vector_db
PROVIDERS_DB_DIR=./work
EXTERNAL_PROVIDERS_DIR=./llama-stack/providers.d

Create a Python run configuration with following values:
- script/module: script
- script path: (lightspeed-stack project root)/src/lightspeed_stack.py
- arguments: --config ./lightspeed-stack_local.yaml
- working directory: (ansible-chatbot-stack project root)
- path to ".env" files: (ansible-chatbot-stack project root)/.env
Run the created configuration from PyCharm main menu.

Note:

If you want to debug codes in the lightspeed-providers project, you can add it as a local package dependency with:

uv add --editable (lightspeed-providers project root)

It will update pyproject.toml and uv.lock files. Remember that they are for debugging purpose only and avoid checking in those local changes.

README.md

Ansible Chatbot (llama) Stack

Build

Setup for Ansible Chatbot Stack

Building Ansible Chatbot Stack

Container file structure

Files from lightspeed-stack base image

Runtime files

Configuration files

Run

System Prompt

Basic tests

AAP quality evaluations

Deploy into a k8s cluster

Change configuration in kustomization.yaml accordingly, then

Deploy the service

Appendix - Google Gemini API

Appendix - Google Vertex API

Appendix - Generating system prompt files

Appendix - Host clean-up

Appendix - Obtain a container shell

Appendix - Run from source (PyCharm)

Note:

Files from `lightspeed-stack` base image

Change configuration in `kustomization.yaml` accordingly, then