Workflow update - PART 1 #1416

Smartappli · 2024-04-30T19:24:57Z

CUDA compiled with AVX
Remove Python 3.8
Remove macos-11 deprecated
Add python 3.9 when missing
Upgrade macos-13 to macos-latest in tests
Upgrade ubuntu-20.04 to ubuntu-latest
Upgrade windows-2019 to windows-latest
refactoring of metal building

abetlen · 2024-04-30T22:54:29Z

#1342 (comment)

I'll paste my comment here, and maybe we can open a new discussion, basically I'm concerned about the size of releases ballooning with the number of prebuilt wheel variants. I had some suggestions for long term solutions there but not sure what the right approach is.

Anecdotally @oobabooga claims to have run into issues with GitHub throttling his prebuilt wheel repo because of this.

oobabooga · 2024-05-01T00:32:35Z

If you generate too many wheels, there is a 100% chance you will reach a storage quota and GitHub will ask you to start paying for storage else your wheels will fail to upload. It's not too expensive (a few $ a month at most), but it's worth keeping in mind.

Smartappli · 2024-05-01T08:46:45Z

I avoided the API quota limit problems by adding a timer in my yaml

- name: ⌛ rate 1
    shell: pwsh
    run: |
      # add random sleep since we run on fixed schedule
      sleep (get-random -max 1200)
      
      # get currently authenticated user rate limit info
      $rate = gh api rate_limit | convertfrom-json | select -expandproperty rate
  
      # if we don't have at least 100 requests left, wait until reset
      if ($rate.remaining -lt 400) {
          $wait = ($rate.reset - (Get-Date (Get-Date).ToUniversalTime() -UFormat %s))
          echo "Rate limit remaining is $($rate.remaining), waiting for $($wait) seconds to reset"
          sleep $wait
          $rate = gh api rate_limit | convertfrom-json | select -expandproperty rate
          echo "Rate limit has reset to $($rate.remaining) requests"
      }

Smartappli · 2024-05-01T08:50:09Z

#1342 (comment)

I'll paste my comment here, and maybe we can open a new discussion, basically I'm concerned about the size of releases ballooning with the number of prebuilt wheel variants. I had some suggestions for long term solutions there but not sure what the right approach is.

Anecdotally @oobabooga claims to have run into issues with GitHub throttling his prebuilt wheel repo because of this.

https://github.com/Smartappli/serge-wheels/actions

Smartappli · 2024-05-01T08:53:31Z

Not enabling AVX penalizes LLaMa cpp python performance in both cpu and cuda.

gaby · 2024-05-01T13:42:45Z

Maybe the list can be shrink down a bit. For example:

Not many people have AVX512, remove until there's enough demand.
Making AVX support a minimum?
Remove python3.8, it's EOL in a few months.

gaby · 2024-05-01T13:45:44Z

@Smartappli Your hanges are adding AVX for CUDA wheels, is that needed? At that point the user is using the GPU.

It makes sense for the basic wheels to have AVX, and AVX2 wheels, not so much for the CUDA ones.

Smartappli · 2024-05-01T14:43:56Z

I copy that thx @gaby

in summary: AVX and AVX2 on CPU is
enough

Smartappli · 2024-05-11T16:24:44Z

Tests 11 may 24
CPU Build Test: https://github.com/Smartappli/llama-cpp-python/actions/runs/9044907773
CUDA Build Test: https://github.com/Smartappli/llama-cpp-python/actions/runs/9044908928
Metal Build Test: https://github.com/Smartappli/llama-cpp-python/actions/runs/9044910394

Smartappli · 2024-05-15T21:25:04Z

ping @gaby

Smartappli · 2024-05-20T19:56:00Z

@abetlen can you review plz?

abetlen · 2024-06-04T16:45:29Z

Hey @Smartappli thanks for your patience and the PR, busy month so just catching up on open PRs right now, do you mind splitting this one up into 2 with one that includes the following

CUDA compiled with AVX
Remove Python 3.8
Remove macos-11 deprecated
Add python 3.9 when missing
Upgrade macos-13 to macos-latest in tests
Upgrade ubuntu-20.04 to ubuntu-latest
Upgrade windows-2019 to windows-latest
refactoring of metal building

and another just for the cpu wheels changes?

Smartappli · 2024-06-06T22:24:00Z

Hey @Smartappli thanks for your patience and the PR, busy month so just catching up on open PRs right now, do you mind splitting this one up into 2 with one that includes the following
CUDA compiled with AVX
Remove Python 3.8
Remove macos-11 deprecated
Add python 3.9 when missing
Upgrade macos-13 to macos-latest in tests
Upgrade ubuntu-20.04 to ubuntu-latest
Upgrade windows-2019 to windows-latest
refactoring of metal building
and another just for the cpu wheels changes?

@abetlen Done: #1515

oobabooga · 2024-06-13T19:09:30Z

Has anyone managed to fix the CUDA workflows? Mine keep failing with error

C:\Miniconda3\envs\build\include\crt/host_config.h(153): fatal error C1189: #error: -- unsupported Microsoft Visual Studio version! Only the versions between 2017 and 2022 (inclusive) are supported! The nvcc flag '-allow-unsupported-compiler' can be used to override this version check; however, using an unsupported host compiler may cause compilation failure or incorrect run time execution. Use at your own risk. [C:\Users\runneradmin\AppData\Local\Temp\tmpwbsbwtdg\build\CMakeFiles\CMakeScratch\TryCompile-uh6ciq\cmTC_cbbed.vcxproj]

See: https://github.com/oobabooga/llama-cpp-python-cuBLAS-wheels/actions/runs/9457447475/job/26051277254.

I see that @abetlen's workflow also fail with the same error: https://github.com/abetlen/llama-cpp-python/actions/runs/9457182450/job/26051175939

Smartappli added 2 commits April 30, 2024 21:07

Update build-wheels-cuda.yaml

d2c1973

Update build-wheels-cuda.yaml

4f0cd41

Smartappli changed the title ~~[WIP] Cuda with AVX~~ Cuda with AVX Apr 30, 2024

Smartappli changed the title ~~Cuda with AVX~~ CPU with AVX and AVX2 May 1, 2024

Smartappli changed the title ~~CPU with AVX and AVX2~~ [WIP] CPU with AVX and AVX2 May 1, 2024

Smartappli added 17 commits May 1, 2024 20:50

Merge branch 'abetlen:main' into cuda

3e012ca

revert

04f509d

Bump pyhton from 3.8 to 3.9

34d4475

Remove python 3.8

f7b3dd0

Remove Python 3.7 and 3.8 deprecated

a903572

Bump python from 3.8 to 3.9

e890e2e

Add python 3.9

0ce29fe

Add python 3.9, remove macos-11 deprecated, add macos-14

63c74d7

Bump python 3.8 to 3.9

864d190

Add python 3.13

1934440

Add python 3.13

525b7af

python 3.13 remove

b94a9e2

remove python 3.13

d24c882

remove python 3.8

a5d4a37

Bump macos-13 to macos-14

2e412af

Update build-wheels-metal.yaml

b1a7cde

Update build-wheels-metal.yaml

8517a80

Smartappli added 6 commits May 11, 2024 12:50

Update build-and-release.yaml

bc406ad

Update build-and-release.yaml

0c0946e

Update build-and-release.yaml

a4f25b4

Update build-wheels-metal.yaml

5e1d021

Update build-and-release.yaml

79619a5

Update build-and-release.yaml

0e7d135

Smartappli added 3 commits May 11, 2024 19:42

Update build-and-release.yaml

2fba497

Merge branch 'main' into cuda

10ab995

Merge branch 'main' into cuda

b80cc88

Smartappli added 2 commits May 18, 2024 00:14

Merge branch 'main' into cuda

f385942

Merge branch 'main' into cuda

6951db8

Smartappli added 3 commits May 29, 2024 12:13

Merge branch 'abetlen:main' into cuda

a6b776b

Update build-and-release.yaml

1ed20b8

Update build-wheels-metal.yaml

bb98fa5

Smartappli changed the title ~~Workflow update~~ Workflow update - PART 1 Jun 6, 2024

Smartappli changed the title ~~Workflow update - PART 1~~ [WIP] Workflow update - PART 1 Jun 6, 2024

revert

496a437

Smartappli changed the title ~~[WIP] Workflow update - PART 1~~ Workflow update - PART 1 Jun 6, 2024

Smartappli mentioned this pull request Jun 6, 2024

Workflow update - PART 2 #1515

Open

Smartappli and others added 3 commits June 12, 2024 22:44

Merge branch 'main' into cuda

fe9a9a1

Merge branch 'main' into cuda

6d065af

Remove cpu variants

6627890

abetlen merged commit 9e396b3 into abetlen:main Jun 13, 2024
13 checks passed

Isotr0py mentioned this pull request Aug 10, 2024

使用Sakura-32B-Galgame-Kaggle-llama.cpp.ipynb出现错误 Isotr0py/SakuraLLM-Notebooks#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Workflow update - PART 1 #1416

Workflow update - PART 1 #1416

Uh oh!

Smartappli commented Apr 30, 2024 •

edited

Loading

Uh oh!

abetlen commented Apr 30, 2024

Uh oh!

oobabooga commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024

Uh oh!

gaby commented May 1, 2024

Uh oh!

gaby commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024 •

edited

Loading

Uh oh!

Smartappli commented May 11, 2024

Uh oh!

Smartappli commented May 15, 2024

Uh oh!

Smartappli commented May 20, 2024

Uh oh!

abetlen commented Jun 4, 2024

Uh oh!

Smartappli commented Jun 6, 2024 •

edited

Loading

Uh oh!

Uh oh!

oobabooga commented Jun 13, 2024

Uh oh!

Uh oh!

Workflow update - PART 1 #1416

Workflow update - PART 1 #1416

Uh oh!

Conversation

Smartappli commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abetlen commented Apr 30, 2024

Uh oh!

oobabooga commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024

Uh oh!

gaby commented May 1, 2024

Uh oh!

gaby commented May 1, 2024

Uh oh!

Smartappli commented May 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Smartappli commented May 11, 2024

Uh oh!

Smartappli commented May 15, 2024

Uh oh!

Smartappli commented May 20, 2024

Uh oh!

abetlen commented Jun 4, 2024

Uh oh!

Smartappli commented Jun 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

oobabooga commented Jun 13, 2024

Uh oh!

Uh oh!

Smartappli commented Apr 30, 2024 •

edited

Loading

Smartappli commented May 1, 2024 •

edited

Loading

Smartappli commented Jun 6, 2024 •

edited

Loading