Watch

mirror of https://github.com/ansible/metrics-service synced 2026-07-22 19:43:51 +00:00

No description

Python 96.3%
HTML 2.6%
Shell 0.8%
Dockerfile 0.2%
Makefile 0.1%

Find a file

Repository files (latest commit first)
Filename	Latest commit message	Latest commit date
dependabot[bot] fbfdedabc6 chore(deps): bump pip from 25.2 to 26.1.2 (#326 ) Bumps [pip](https://github.com/pypa/pip) from 25.2 to 26.1.2. - [Changelog](https://github.com/pypa/pip/blob/main/NEWS.rst) - [Commits](https://github.com/pypa/pip/compare/25.2...26.1.2) --- updated-dependencies: - dependency-name: pip dependency-version: 26.1.2 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>		2026-07-22 14:17:26 +01:00
.github	chore(deps): bump actions/setup-python from 6.2.0 to 6.3.0 (#323 )	2026-07-22 14:12:35 +01:00
.tekton	chore: update tekton pipeline bundle digest (#271 )	2026-06-18 13:35:29 -05:00
.vscode	AAP-52509 - Changing to Default Postgres Over SQL (#11 )	2025-09-16 16:14:50 +01:00
apps	[AAP-83321] Retry initial_resource_sync on failure with short backoff (#365 )	2026-07-22 12:40:37 +01:00
licenses	Enhance Dockerfile and settings for Red Hat certification compliance (#130 )	2026-03-06 11:03:00 +00:00
metrics_service	fix: guard DYNACONF._post_hooks access to prevent AttributeError (#343 )	2026-07-08 18:16:02 +01:00
scripts	fix: expose Prometheus endpoint at /api/metrics (AAP-77819) (#251 )	2026-06-22 11:18:45 +01:00
tests	fix(AAP-77819): fix Prometheus metrics endpoint trailing slash and remove stale legacy redirects (#364 )	2026-07-22 11:09:52 +01:00
tools	fix(AAP-77819): fix Prometheus metrics endpoint trailing slash and remove stale legacy redirects (#364 )	2026-07-22 11:09:52 +01:00
.coderabbit.yaml	Remove standalone docker-compose, use unified compose (#266 )	2026-06-25 06:44:59 +00:00
.copier-answers.yml	chore: update framework templates to pin actions/checkout SHA (#342 )	2026-07-08 18:06:57 +01:00
.dockerignore	Remove legacy requirements files and add production Docker configuration (#129 )	2026-03-05 20:01:26 +00:00
.gitignore	Remove celery reference (#346 )	2026-07-14 11:06:53 +02:00
.hermeto-pip-input.json	Konflux build fix (#104 )	2026-02-16 11:20:16 +00:00
.pre-commit-config.yaml	Update pre-commit configuration and enhance development documentation (#151 )	2026-03-20 19:26:59 +00:00
.protected_files.yaml	[platform-service-framework] complete framework retrofit (#84 )	2025-12-19 10:06:49 -05:00
CLAUDE.md	feat(indirect-nodes): move collector from snapshot to daily pipeline with DB routing (#332 )	2026-07-06 09:19:35 +01:00
codecov.yml	URL tests - remove exception suppression, drop separate test URL config (#263 )	2026-06-16 18:47:15 +00:00
CONTRIBUTING.md	Update pre-commit configuration and enhance development documentation (#151 )	2026-03-20 19:26:59 +00:00
DCO	Repo initialization	2025-07-28 11:58:20 -04:00
docker-compose.prod.single.yml	Fix retry not increasing attempts, task fixups (#187 )	2026-04-16 13:31:43 +00:00
docker-compose.production.yml	consolidate task timeout into a single TASK_TIMEOUT setting (#218 )	2026-05-20 16:15:40 +00:00
Dockerfile	Fix retry not increasing attempts, task fixups (#187 )	2026-04-16 13:31:43 +00:00
Dockerfile.dev	Remove standalone docker-compose, use unified compose (#266 )	2026-06-25 06:44:59 +00:00
LICENSE	[AAP-69237] framework alignment (#267 )	2026-06-18 13:51:05 +01:00
Makefile	Remove standalone docker-compose, use unified compose (#266 )	2026-06-25 06:44:59 +00:00
manage.py	[AAP-69237] framework alignment (#267 )	2026-06-18 13:51:05 +01:00
pyproject.toml	chore(deps): bump pip from 25.2 to 26.1.2 (#326 )	2026-07-22 14:17:26 +01:00
README.md	Remove standalone docker-compose, use unified compose (#266 )	2026-06-25 06:44:59 +00:00
renovate.json	revert renovate cron fix — Renovate requires wildcard minutes (#341 )	2026-07-09 12:43:26 +00:00
SECURITY.md	Repo initialization	2025-07-28 11:58:20 -04:00
settings.local.py.example	Env var cleanup - use METRICS_SERVICE_MODE (#87 )	2026-01-16 14:57:18 +00:00
sonar-project.properties	use logger.exception() in some exception handlers, disable rule forcing it over logger.error (#279 )	2026-07-09 10:57:12 +00:00
ubi.repo	build(ci): [AAP-52401] Metrics-Service Onboarding to Konflux (#21 )	2025-10-03 10:50:24 +01:00
uv.lock	chore(deps): bump pip from 25.2 to 26.1.2 (#326 )	2026-07-22 14:17:26 +01:00

README.md

Metrics service

A modern Django-based service built for the Ansible Automation Platform (AAP) ecosystem, featuring comprehensive task management, REST APIs, and automated background job processing.

Features

🚀 Modern Django Architecture - Django 5.2+ with clean app-based structure
📊 Automated Task Management - Feature-enable controlled task groups with automatic routing
⚡ Smart Task Routing - Automatic submission to dispatcherd with no manual intervention
🔌 REST API - Versioned RESTful APIs with OpenAPI documentation
🔐 Authentication & Authorization - Django-Ansible-Base integration with RBAC
🐳 Docker Ready - Multi-container deployment with PostgreSQL
🧪 Comprehensive Testing - Unit and integration tests with coverage reporting
📝 API Documentation - Interactive Swagger/OpenAPI documentation
🔧 Metrics Collection - Integrated metrics-utility for data collection

Quick Start

Option 1: Docker + dev server (Recommended)

# Requires a sibling ../metrics-utility checkout.

# Start base containers (postgres, minio)
make compose

# In another terminal — run migrations, start dev server
tools/dev.sh --init

Your service will be available at:

Application: http://localhost:8000
API Documentation: http://localhost:8000/api/docs/
Admin Interface: http://localhost:8000/admin/

Option 2: Local Development

# Prerequisites: Python 3.12, PostgreSQL 13+

# Install dependencies (project uses uv)
uv sync --dev

# Configure (optional — for local overrides)
cp settings.local.py.example settings.local.py
# Edit settings.local.py to configure your local development environment.

# Set up database (configure via environment variables if needed)
# See Configuration section below for environment variable options
python manage.py migrate
python manage.py metrics_service init-default-settings
python manage.py metrics_service init-service-id
python manage.py metrics_service init-system-tasks
python manage.py createsuperuser

# Start complete service (Django + dispatcher + scheduler)
python manage.py metrics_service run

Option 3: Local development, with uv and metrics-utility from sources

Edit pyproject.toml such that:

 [tool.uv.sources]
 django-ansible-base = { git = "https://github.com/ansible/django-ansible-base", rev = "devel" }
+metrics-utility = { path = "../metrics-utility", editable = true }

uv sync
uv run ./manage.py migrate
uv run ./manage.py createsuperuser
uv run ./manage.py metrics_service run
uv run tools/tasks/run_task.py hello_world # debugging individual tasks

Endpoints

# List all tasks
GET /api/v1/tasks/

# Create a new task
POST /api/v1/tasks/
{
  "name": "Hello World Task",
  "function_name": "hello_world",
  "task_data": {}
}

# Get running tasks
GET /api/v1/tasks/running/

# Retry a failed task
POST /api/v1/tasks/{id}/retry/

# Available task functions
GET /api/v1/tasks/available_functions/

Built-in Task Functions

System Tasks (always enabled):

cleanup_old_tasks - Clean up completed/failed tasks
hello_world - Simple test task for dispatcherd integration Metrics Collection Tasks (controlled by METRICS_COLLECTION, default: enabled):
collect_hourly_metrics - Collect time-series metrics every hour (collector type via collector_type parameter)
collect_snapshot_metrics - Collect daily snapshot metrics (collector type via collector_type parameter)
daily_metrics_rollup - Merge hourly collections and create daily rollup summary
cleanup_metrics_data - Clean up old metrics data based on retention policies

Anonymization and Transmission Tasks (controlled by ANONYMIZED_DATA_COLLECTION, default: enabled, customer opt-out):

daily_anonymize_and_prepare - Anonymize daily rollup and prepare for transmission
send_anonymized_to_segment - Send anonymized metrics to Segment.com

Background Tasks

The service includes an automated background task system with intelligent routing:

Unified Service Management

# Start complete service (init*, then Django + dispatcher + scheduler)
python manage.py metrics_service run

# Start with custom configuration
python manage.py metrics_service run --workers 4

# Individual components
python manage.py runserver 0.0.0.0:8000  # web
python manage.py run_dispatcherd --workers 2  # worker
python manage.py run_task_scheduler  # scheduler

Automatic Task Routing

Tasks are automatically routed based on their properties:

Immediate tasks → Direct to dispatcherd
Scheduled tasks → APScheduler with DateTrigger
Recurring tasks → APScheduler with CronTrigger

No manual intervention required - create a task and it's automatically processed!

Task Groups & Feature Flags

We have these feature flags:

flag	default
`METRICS_COLLECTION`	true
`ANONYMIZED_DATA_COLLECTION`	true
`DASHBOARD_COLLECTION`	false (customer opt-in)

You can change defaults using METRICS_SERVICE_FEATURE__ prefixed environment variables.

# Pause local collectors, rollup, and metrics cleanup (default: true)
METRICS_SERVICE_FEATURE__METRICS_COLLECTION=false

# Disable anonymization and Segment transmission (default: true)
METRICS_SERVICE_FEATURE__ANONYMIZED_DATA_COLLECTION=false

Feature flags are resolved at runtime with this precedence:

DB row in dynamic_settings_setting (written via the API, dbshell, or a prior init-default-settings run) — always wins
Env var (METRICS_SERVICE_FEATURE__*) — used on fresh installs or when no DB row exists
Static default in settings.FEATURE — fallback if neither of the above is set

init-default-settings does not pre-seed feature flags into the database, so env vars take effect on fresh installs unless a DB row exists. DB rows always take precedence over env vars. A pod restart is required for env var changes to be picked up by the running service.

If system tasks are missing _feature_flag for metrics collection templates, run python manage.py metrics_service init-system-tasks to sync them.

Development

Code Quality Tools

# Format + lint + test in one step (via poe task runner)
uv run poe check

# Or individually
uv run poe format     # ruff format (includes import sorting)
uv run poe lint       # ruff check
uv run poe unit-test  # pytest

# Direct ruff commands
ruff format .
ruff check . --fix

# Type checking (optional, gradual adoption)
mypy .

Pre-commit Hooks

This project uses pre-commit hooks to ensure code quality and automatically sync requirements files:

# Install pre-commit hooks
pre-commit install

# Run hooks on all files
pre-commit run --all-files

# Run hooks manually
pre-commit run

The pre-commit configuration automatically runs:

ruff check --fix — lint and auto-fix
ruff-format — code formatting
Platform Service Framework validation

Testing

# Run all tests
pytest

# Run with coverage
pytest --cov=apps --cov=metrics_service --cov-report=html

# Run specific test categories
pytest -m unit          # Unit tests only
pytest -m integration   # Integration tests only

Database Operations

# Create migrations
python manage.py makemigrations

# Apply migrations
python manage.py migrate

# Initialize settings table with feature flag defaults
python manage.py metrics_service init-default-settings

# Remove feature flags from settings
python manage.py metrics_service remove-default-settings

# Initialize DAB ServiceID (required after first migration)
python manage.py metrics_service init-service-id

# Initialize system tasks
python manage.py metrics_service init-system-tasks

OpenAPI Schema

The OpenAPI schema files are committed to tools/openapi-schema/ and must be kept in sync with the codebase. A CI check will fail if the committed schema differs from what the code generates.

Generating the schema

Requires the database to be running:

make generate-openapi-schema

This will write/update:

tools/openapi-schema/metrics-service.yaml
tools/openapi-schema/metrics-service.json

Commit these files along with any API changes.

Validating the schema

To validate the committed schema against the OpenAPI 3.0 specification locally:

make validate-openapi-schema

Note

Schema validation runs automatically in CI on every PR and will block merging if the schema is invalid or out of sync with the code.

Configuration

Metrics Service uses Dynaconf for settings management, following the Platform Service Framework.

Quick Start

Development Mode (default):

Important

The following example assumes those values exported as environment variables, to set on the settings.local.py file remove the METRICS_SERVICE_ prefix.

# Project
DJANGO_SETTINGS_MODULE=metrics_service.settings
METRICS_SERVICE_MODE=development
METRICS_SERVICE_SECRET_KEY=dev-secret-key-change-in-production
METRICS_SERVICE_DEBUG="true"
METRICS_SERVICE_ALLOWED_HOSTS='["localhost","127.0.0.1","metrics-service","0.0.0.0"]'

# Database
METRICS_SERVICE_DATABASES__default__ENGINE=django.db.backends.postgresql
METRICS_SERVICE_DATABASES__default__HOST=postgres
METRICS_SERVICE_DATABASES__default__PORT=5432
METRICS_SERVICE_DATABASES__default__USER=metrics_service
METRICS_SERVICE_DATABASES__default__PASSWORD=metrics_service
METRICS_SERVICE_DATABASES__default__NAME=metrics_service
METRICS_SERVICE_DATABASES__default__OPTIONS__sslmode=prefer

# Task App
METRICS_SERVICE_FEATURE__ANONYMIZED_DATA_COLLECTION="true"
DISPATCHERD_CONFIG_FILE=/app/apps/settings/dispatcherd.yaml
DISPATCHERD_ENABLED="true"

python manage.py runserver

Production Mode:

# Set environment mode and required secrets
export METRICS_SERVICE_MODE=production
export METRICS_SERVICE_SECRET_KEY="your-secure-random-key"
export METRICS_SERVICE_ALLOWED_HOSTS="yourdomain.com,api.yourdomain.com"

# Override defaults as needed
export METRICS_SERVICE_DATABASES__default__HOST=prod-db.example.com
export METRICS_SERVICE_DATABASES__default__PASSWORD=secure-password

python manage.py runserver

Configuration Methods

Settings are loaded in order of precedence (lowest to highest):

Read Only (overridable)

metrics_service/settings.py - Framework defaults

Editable:

apps/settings/defaults.py - Defaults for the whole project
apps/core/settings.py - Core settings, DAB related settings
apps/*/settings.py - Each app settings in the loading order
apps/settings/{mode}.py - Settings specific to the current METRICS_SERVICE_MODE
settings.local.py - For local settings (git ignored)
/etc/ansible-automation-platform/metrics_service/ - for prod environment overrides
METRICS_SERVICE_ prefixed environment variables

Common Environment Variables

Variable	Description	Required in Production
`METRICS_SERVICE_MODE`	Environment mode (development/production)	No (defaults to development)
`METRICS_SERVICE_SECRET_KEY`	Django secret key	Yes
`METRICS_SERVICE_DEBUG`	Enable debug mode	No
`METRICS_SERVICE_LOG_LEVEL`	Logging level (DEBUG/INFO/WARNING/ERROR)	No (defaults to INFO)
`METRICS_SERVICE_DATABASES__default__HOST`	Database host	No (has default)
`METRICS_SERVICE_DATABASES__default__PASSWORD`	Database password	No (has default)
`METRICS_SERVICE_ALLOWED_HOSTS`	Allowed hosts (comma-separated)	Yes (production)

Note: Use double underscores (__) for nested settings:

# Nested database configuration
export METRICS_SERVICE_DATABASES__default__HOST=localhost
export METRICS_SERVICE_DATABASES__default__PORT=5432

Logging Configuration

Metrics Service uses a centralized logging system that integrates with Django's logging framework. All log levels are controlled by a single environment variable.

Setting Log Level:

# For development - see all debug messages
export METRICS_SERVICE_LOG_LEVEL=DEBUG

# For production - informational messages only
export METRICS_SERVICE_LOG_LEVEL=INFO

# For troubleshooting - warnings and errors
export METRICS_SERVICE_LOG_LEVEL=WARNING

# For critical issues only
export METRICS_SERVICE_LOG_LEVEL=ERROR

Quick Debug Mode:

# Run with debug logging temporarily
METRICS_SERVICE_LOG_LEVEL=DEBUG python manage.py runserver

# Or for the complete service
METRICS_SERVICE_LOG_LEVEL=DEBUG python manage.py metrics_service run

Log Output Format:

All logs use Django's configured format with timestamps, log levels, request IDs (when applicable), module names, and messages:

2025-01-18 10:15:23,456 INFO     [abc123] apps.tasks.signals New task created: Cleanup (ID: 42)
2025-01-18 10:15:24,789 WARNING  [] apps.core.utils Database connection slow: 2.3s

To inspect the full settings loading history or debug a specific variable:

export DJANGO_SETTINGS_MODULE=metrics_service.settings
uv run dynaconf inspect -m debug -f yaml   # full loading history
uv run dynaconf inspect -k VARIABLE_NAME   # single variable

Deployment

Docker Production

# Build production image
docker build -t metrics-service .

# Run with production settings
docker run -p 8000:8000 \
  -e METRICS_SERVICE_MODE=production \
  -e METRICS_SERVICE_SECRET_KEY=your-secret-key \
  -e METRICS_SERVICE_DATABASES__default__HOST=your-db-host \
  -e METRICS_SERVICE_DATABASES__default__PASSWORD=your-db-password \
  metrics-service

Contributing

Fork the repository
Create a feature branch: git checkout -b feature/my-feature
Make your changes with tests
Run the test suite: uv run pytest
Run code quality checks: uv run poe check
Submit a pull request

Development Standards

Code Style: Ruff formatting, 120 character line length
Type Hints: Required for all new code
Documentation: Docstrings for public APIs
Testing: Test coverage for new features
Commits: Clear, concise commit messages

License

This project is licensed under the Apache License - see the LICENSE file for details.

Support

Documentation: Check the CLAUDE.md file for detailed development guidance
Issues: Report bugs and feature requests via GitHub issues
API Documentation: Interactive docs available at /api/docs/ when running