Intra-vendor Model Routing Support AI APIs #3353

AnuGayan · 2024-11-08T14:06:36Z

Problem

AI API designers need to be able to route incoming requests across models within a single vendor’s ecosystem.

With the API Manager 4.4.0 release, AI API support was introduced. We allow API consumers to specify the model they wish to consume within that LLM vendor. This leaves us with the following concerns:

API designers have no control over which models are used, how frequently they are accessed, or the costs incurred.
Risks such as model exhaustion and model misuse could lead to API being throttled out.
Inability to enforce a model routing strategy at design time considering the use cases of AI applications consuming the API.

Proposed Solution

Support intra-vendor model routing for AI APIs.

The proposed solution is to enforce the model routing strategy as a policy for the API request flow. Policies can be categorised as follows:

Static routing techniques (only relies on the incoming request)
Dynamic routing techniques (based on metrics from previous invocations)
Failover
Custom routing strategy enforcement

Targeting the APIM 4.5.0 release, we are shipping the following policies:

Model Round-robin Policy
Model Weighted Round-robin Policy
Model Failover Policy

Task Breakdown

Version

4.5.0

ashera96 · 2025-02-15T15:40:02Z

Progress Update:

The task breakdown under the issue description was updated with the completed tasks.

The following PRs were merged targeting the alpha release:

Carbon-apimgt changes: Intra-vendor model routing support for AI APIs carbon-apimgt#12871
Apim-apps changes: Intra-vendor model routing support for AI APIs apim-apps#883
Product-apim changes: Intra-vendor model routing support for AI APIs product-apim#13661

ashera96 · 2025-02-24T03:59:16Z

Progress Update:

The following PRs were merged targeting the beta release:

Carbon-apimgt changes: Intra-vendor model routing feature enhancements carbon-apimgt#12987
Apim-apps changes: Intra-vendor model routing feature enhancements apim-apps#929
Product-apim changes: Intra-vendor model routing feature enhancements product-apim#13695

AnuGayan added Component/APIM 4.x.x labels Nov 8, 2024

github-actions bot added the Missing/Component label Nov 8, 2024

ashera96 changed the title ~~Multiple Backend Support for APIs~~ Intra-vendor Model Routing Support AI APIs Feb 3, 2025

ashera96 added Type/NewFeature 4.5.0 and removed Missing/Component 4.x.x labels Feb 3, 2025

ashera96 self-assigned this Feb 3, 2025

hisanhunais assigned PasanT9 Feb 3, 2025

ashera96 mentioned this issue Feb 17, 2025

Add initial docs for intra-vendor model routing feature wso2/docs-apim#9020

Merged

hisanhunais added the 4.5.0-alpha label Feb 17, 2025

hisanhunais added this to the 4.5.0-Alpha milestone Feb 17, 2025

ashera96 mentioned this issue Feb 18, 2025

[4.5.0] Add initial docs for intra-vendor model routing feature wso2/docs-apim#9025

Merged

ashera96 mentioned this issue Feb 24, 2025

Add tables introduced via intra-vendor model routing feature to multi-dc scripts wso2/carbon-apimgt#12993

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intra-vendor Model Routing Support AI APIs #3353

Intra-vendor Model Routing Support AI APIs #3353

AnuGayan commented Nov 8, 2024 •

edited by ashera96

Loading

ashera96 commented Feb 15, 2025

ashera96 commented Feb 24, 2025

Intra-vendor Model Routing Support AI APIs #3353

Intra-vendor Model Routing Support AI APIs #3353

Comments

AnuGayan commented Nov 8, 2024 • edited by ashera96 Loading

Problem

Proposed Solution

Task Breakdown

Version

ashera96 commented Feb 15, 2025

Progress Update:

ashera96 commented Feb 24, 2025

Progress Update:

AnuGayan commented Nov 8, 2024 •

edited by ashera96

Loading