Skip to content

Latest commit

 

History

History
31 lines (20 loc) · 1.56 KB

low-cost.md

File metadata and controls

31 lines (20 loc) · 1.56 KB

Reduce deployment costs

This AI RAG chat application is designed to be easily deployed using the Azure Developer CLI, which provisions the infrastructure according to the Bicep files in the infra folder. Those files describe each of the Azure resources needed, and configures their SKU (pricing tier) and other parameters. Many Azure services offer a free tier, but the infrastructure files in this project do not default to the free tier as there are often limitations in that tier.

However, if your goal is to minimize costs while prototyping your application, follow the steps below before running azd up. Once you've gone through these steps, return to the deployment steps.

  1. Log in to your Azure account using the Azure Developer CLI:

    azd auth login
  2. Create a new azd environment for the free resource group:

    azd env new

    Enter a name that will be used for the resource group. This will create a new folder in the .azure folder, and set it as the active environment for any calls to azd going forward.

  3. Use the free tier of Azure AI Search:

    azd env set AZURE_SEARCH_SERVICE_SKU free

    Limitations:

    1. You are only allowed one free search service across all regions.
    2. The free tier does not support semantic ranker. Note that will generally result in decreased search relevance.