[Feature]: Expose Lora lineage information from /v1/models #6274

Jeffwan · 2024-07-10T00:39:52Z

🚀 The feature, motivation and pitch

python -m vllm.entrypoints.openai.api_server \
    --model /workspace/meta-llama/Llama-2-7b-hf \
    --enable-lora \
    --lora-modules sql-lora=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/

The /v1/models response from above setup can not expose the lineage between lora and base models. In below example, root always points to the base_model.

Current Status

Base model will either use --model or --served-model-name. If user use local path, then the id and root would not be model id like OpenAI.
Lora model card information is from LoraRequest which doesn't have base_model at this moment. Technically, we can assume they are all adapters to base model. This may break later once the engine supports multiple models.

{
  "object": "list",
  "data": [
    {
      "id": "/workspace/meta-llama/Llama-2-7b-hf",
      "object": "model",
      "created": 1715644056,
      "owned_by": "vllm",
      "root": "/workspace/meta-llama/Llama-2-7b-hf",
      "parent": null,
      "permission": [
        {
          .....
        }
      ]
    },
    {
      "id": "sql-lora",
      "object": "model",
      "created": 1715644056,
      "owned_by": "vllm",
      "root": "/workspace/meta-llama/Llama-2-7b-hf",
      "parent": null,
      "permission": [
        {
          ....
        }
      ]
    }
  ]
}

Expected

We can use root to represent model path and parent to indicate base_model for lora adapters. seems they are not OpenAI protocols, we should be able to make the change

{
  "object": "list",
  "data": [
    {
      "id": "meta-llama/Llama-2-7b-hf",
      "object": "model",
      "created": 1715644056,
      "owned_by": "vllm",
      "root": "~/.cache/huggingface/hub/models--meta-llama--Llama-2-7b-hf/snapshots/01c7f73d771dfac7d292323805ebc428287df4f9/",
      "parent": null,
      "permission": [
        {
          .....
        }
      ]
    },
    {
      "id": "sql-lora",
      "object": "model",
      "created": 1715644056,
      "owned_by": "vllm",
      "root": "~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/snapshots/0dfa347e8877a4d4ed19ee56c140fa518470028c/",
      "parent": meta-llama/Llama-2-7b-hf,
      "permission": [
        {
          ....
        }
      ]
    }
  ]
}

I am drafting a PR to address this issue and please help review whether above looks good.

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

Jeffwan · 2024-09-27T05:36:53Z

As #6315 merged, we can close this issue.

Jeffwan added the feature request New feature or request label Jul 10, 2024

This was referenced Jul 10, 2024

[RFC]: Enhancing LoRA Management for Production Environments in vLLM #6275

Closed

[Core] Support Lora lineage and base model metadata management #6315

Merged

Jeffwan closed this as completed Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: Expose Lora lineage information from /v1/models #6274

[Feature]: Expose Lora lineage information from /v1/models #6274

Jeffwan commented Jul 10, 2024 •

edited

Loading

Jeffwan commented Sep 27, 2024

Uh oh!

Uh oh!

[Feature]: Expose Lora lineage information from /v1/models #6274

[Feature]: Expose Lora lineage information from /v1/models #6274

Comments

Jeffwan commented Jul 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 The feature, motivation and pitch

Current Status

Expected

Alternatives

Additional context

Jeffwan commented Sep 27, 2024

Uh oh!

Jeffwan commented Jul 10, 2024 •

edited

Loading