Skip to content

Actions: b4rtaz/distributed-llama

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
571 workflow runs
571 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix: loader.
main #535: Commit 881cca0 pushed by b4rtaz
February 16, 2025 23:57 1m 30s main
February 16, 2025 23:57 1m 30s
feat: quantizeF32toQ80 avx2.
main #534: Pull request #171 synchronize by b4rtaz
February 16, 2025 23:44 1m 23s feat/f32-to-q80-avx2
February 16, 2025 23:44 1m 23s
feat: quantizeF32toQ80 avx2.
main #533: Pull request #171 synchronize by b4rtaz
February 16, 2025 23:38 51s feat/f32-to-q80-avx2
February 16, 2025 23:38 51s
feat: quantizeF32toQ80 avx2.
main #532: Pull request #171 opened by b4rtaz
February 16, 2025 23:29 1m 35s feat/f32-to-q80-avx2
February 16, 2025 23:29 1m 35s
fix: model list json.
main #531: Commit c2792a1 pushed by b4rtaz
February 16, 2025 22:55 1m 45s main
February 16, 2025 22:55 1m 45s
feat: add model name to api output. (#170)
main #530: Commit a4bd7aa pushed by b4rtaz
February 16, 2025 22:41 1m 43s main
February 16, 2025 22:41 1m 43s
dllama-api: /v1/models: return basename of the model
main #529: Pull request #170 opened by lemmi
February 16, 2025 19:02 1m 35s lemmi:api-modelName
February 16, 2025 19:02 1m 35s
fix: missing init quants in api. (#168)
main #528: Commit 6e147d6 pushed by b4rtaz
February 16, 2025 10:20 1m 41s main
February 16, 2025 10:20 1m 41s
fix: missing init quants in api.
main #527: Pull request #168 opened by b4rtaz
February 16, 2025 10:19 1m 45s fix/missing-init-quants
February 16, 2025 10:19 1m 45s
fix: api not found support.
main #526: Commit 689e6df pushed by b4rtaz
February 16, 2025 08:52 1m 15s main
February 16, 2025 08:52 1m 15s
fix: fixed inference getting stuck (#166)
main #525: Commit a4964a0 pushed by b4rtaz
February 16, 2025 01:07 1m 31s main
February 16, 2025 01:07 1m 31s
fix: fixed inference getting stuck
main #524: Pull request #166 opened by b4rtaz
February 16, 2025 01:05 1m 12s fix/atomic-uint
February 16, 2025 01:05 1m 12s
feat: use softmax_F32 for sampler. (#163)
main #523: Commit 1e73dcb pushed by b4rtaz
February 15, 2025 15:27 1m 41s main
February 15, 2025 15:27 1m 41s
feat: use softmax_F32 for sampler.
main #522: Pull request #163 opened by b4rtaz
February 15, 2025 14:39 1m 56s feat/softmax-f32
February 15, 2025 14:39 1m 56s
readme.md.
main #521: Commit 4cda910 pushed by b4rtaz
February 15, 2025 10:50 1m 37s main
February 15, 2025 10:50 1m 37s
feat: support r1 distill llama. (#161)
main #520: Commit aec85b9 pushed by b4rtaz
February 15, 2025 10:35 1m 23s main
February 15, 2025 10:35 1m 23s
feat: support r1 distill llama.
main #519: Pull request #161 synchronize by b4rtaz
February 15, 2025 10:35 52s feat/r1-distill-llama
February 15, 2025 10:35 52s
feat: support r1 distill llama.
main #518: Pull request #161 opened by b4rtaz
February 14, 2025 23:48 49s feat/r1-distill-llama
February 14, 2025 23:48 49s
readme.
main #517: Commit caea6eb pushed by b4rtaz
February 14, 2025 15:29 1m 42s main
February 14, 2025 15:29 1m 42s
fix: tokenizer utf8 support (#160)
main #516: Commit 63465c5 pushed by b4rtaz
February 14, 2025 15:20 1m 47s main
February 14, 2025 15:20 1m 47s
fix: tokenizer utf8 support
main #515: Pull request #160 synchronize by b4rtaz
February 14, 2025 15:02 1m 50s feat/tokenizer-fixes
February 14, 2025 15:02 1m 50s
fix: tokenizer utf8 support
main #514: Pull request #160 opened by b4rtaz
February 14, 2025 15:01 1m 31s feat/tokenizer-fixes
February 14, 2025 15:01 1m 31s
fix: hide mlock warning
main #513: Commit 5ce3c19 pushed by b4rtaz
February 13, 2025 20:33 1m 39s main
February 13, 2025 20:33 1m 39s
fix: position calculation
main #512: Commit b5283f7 pushed by b4rtaz
February 13, 2025 09:12 1m 34s main
February 13, 2025 09:12 1m 34s
feat: fundamental codebase refactor (#156)
main #511: Commit 121bc8c pushed by b4rtaz
February 12, 2025 22:59 1m 27s main
February 12, 2025 22:59 1m 27s