[feat]: build llama-cpp-python instead of using the whl #182

kyteinsky · 2025-05-15T13:12:40Z

Describe the feature you'd like to request

Instead of using the whl file from https://github.com/abetlen/llama-cpp-python/releases, build the package in the docker container. Newer versions of llama-cpp-python haven't had pre-built releases recently which would make building the only way to have them officially.

This will also solve #126 and #178 (comment) (using app_api's env declaration for CMAKE_ARGS).

Some related links:

The text was updated successfully, but these errors were encountered:

marcelklehr · 2025-05-16T12:19:04Z

I think we should limit the configurations we support. Not because I want to exclude people, but because more options gives more surface area for failure, which we have to debug and solve. We can try to support non-AVX CPUs as well as ARM, but I'd not go so far as to just invite arbitrary CMAKE_ARGS values.

kyteinsky · 2025-05-16T12:37:06Z

yeah fair enough. Building llama-cpp-python should alone be enough to address these.

kyteinsky added the enhancement New feature or request label May 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat]: build llama-cpp-python instead of using the whl #182

[feat]: build llama-cpp-python instead of using the whl #182

kyteinsky commented May 15, 2025

marcelklehr commented May 16, 2025

Uh oh!

kyteinsky commented May 16, 2025

Uh oh!

[feat]: build llama-cpp-python instead of using the whl #182

[feat]: build llama-cpp-python instead of using the whl #182

Comments

kyteinsky commented May 15, 2025

marcelklehr commented May 16, 2025

Uh oh!

kyteinsky commented May 16, 2025

Uh oh!