Deployment

This page documents deployment options and best practices for RAG API Core. It is intended for DevOps engineers, cloud architects, and anyone responsible for running the platform in production or staging environments.

Supported Deployment Targets

Docker Compose: For local development and quick prototyping. Uses docker-compose.yml to spin up the API and dependencies locally.
Azure App Service: For production and staging. The API is built as a Docker container, pushed to Azure Container Registry (ACR), and deployed to Azure App Service. Supports environment variables, managed identity, and Azure integrations.

Note: Kubernetes and bare metal/VM deployments are not supported or recommended for this platform.

Typical Deployment Workflow

Clone the Repository sh git clone <repo-url> cd RAG_API_Core
Local Development (Optional)
Use docker-compose.yml to run the API and dependencies locally for development and testing. sh docker-compose up --build
Provision Azure Infrastructure
Deploy an ARM template from a Template Spec (generated from a Bicep template) to create all required Azure resources and permissions. This includes App Service, Storage, Key Vault, AI Search, and managed identities.
Define all required variables/parameters for the environment (resource names, locations, etc.) as part of the deployment.
Build and Push Docker Image
Build the Docker image for the API.
Push the image to Azure Container Registry (ACR).
Deploy to Azure App Service
Configure App Service to pull the image from ACR.
Set environment variables and assign managed identity as needed.
Manual Steps
Check that all permissions and role assignments were created correctly by the ARM deployment.
Deploy LLM and embedding models to Azure OpenAI manually (using Azure Portal or CLI).
Update the configuration YAML files for the specific environment with the correct endpoints, deployment names, and model details.
Connect to the deployment services through a service connection (for CI/CD or automation).
Connect Repos and Run Pipelines
Connect your code repositories to Azure DevOps or GitHub.
Run the deployment pipelines to build and deploy the relevant services.
Deploy data pipelines and the App Service container to the Function App and App Service, respectively.
Finalize setup and verify all services are running as expected.
Verify Health
Check /api/health or /api/v2/health/check for service status.
Monitor Logs
Use Azure monitoring tools (App Service logs, Application Insights, etc.).

Azure-Specific Notes

ARM/Bicep Templates: Infrastructure is provisioned using an ARM template generated from a Bicep template. This automates creation of all required services and permissions.
Manual Steps: After ARM deployment, verify permissions, deploy models, and update config YAMLs as needed.
Managed Identity: Used for secure access to Azure resources (Key Vault, Storage, AI Search).
App Settings: Set environment variables via Azure Portal or ARM template parameters.
Scaling: Use Azure scaling features for App Service.
Monitoring: Integrate with Azure Monitor and Application Insights.

Example: Azure App Service Deployment

Build and push your Docker image to Azure Container Registry (ACR)
Deploy the ARM template (from Bicep) to provision all required Azure resources
Create or update the App Service to pull from ACR
Set environment variables and assign managed identity
Deploy models and update configuration YAMLs as needed
Monitor health endpoints and logs

Value

Consistent, repeatable deployments across environments
Secure integration with Azure services
Health checks and monitoring for production readiness
Scalable and maintainable infrastructure

Back to Architecture