0.19.4
Services
Rate limits
You can now configure rate limits for your services running behind a gateway.
type: service
image: my-app:latest
port: 80
rate_limits:
# For /api/auth/* - 1 request per second, no bursts
- prefix: /api/auth/
rps: 1
# For other URLs - 4 requests per second + bursts of up to 9 requests
- rps: 4
burst: 9
Examples
TensorRT-LLM
We added a new example on TensorRT-LLM that shows how to deploy both DeepSeek R1 and its distilled version
using TensorRT-LLM and dstack
.
Llama 4
The Llama example was updated to demonstrate the deployment of Llama 4 Scout using dstack
.
Contributing
We continue to make contributing to dstack
easier and improve dev experience. Since the last release, we moved from pip
to uv
in CI and dev pipelines. Dependencies installation times went from ~70 seconds to less than 10 seconds. The Development guide was updated to show how to get the dstack
development setup with uv
. The CI Build pipeline triggered on pull requests were optimized from 9 minutes to 4 minutes.
We also documented uv
as one of the recommended installation options for dstack
.
What's changed
- [Landing] Refactoring (WIP) by @peterschmidt85 in #2495
- Fix CloudWatchLogStorage with sparse logs by @un-def in #2501
- Migrate to uv by @colinjc in #2455
- Fix docs build with uv by @r4victor in #2504
- [Example] Update Llama 4 Examples by @Bihan in #2508
- Move to uv in dstack-server Docker image by @r4victor in #2509
- Fix dstack dependency for gateway by @r4victor in #2511
- [Docs] Add
uv
toInstallation
; Minor improvements by @peterschmidt85 in #2510 - Validate usernames by @r4victor in #2514
- Run pytest in parallel with pytest-xdist by @r4victor in #2515
- Add Llama4 AMD example by @Bihan in #2513
- Use exponentially increasing retry delays for pending runs by @r4victor in #2519
- Speed up frontend CI by @r4victor in #2520
- Service rate limits by @jvstme in #2517
- Set no-guess-dev for dev package versions by @r4victor in #2521
- Detect dstack version from file instead of git by @r4victor in #2524
- Add TensorrRT-LLM Example by @Bihan in #2444
- Fix Nginx upstream name conflicts by @jvstme in #2526
- Fix detaching from
dstack attach
by @jvstme in #2528
New contributors
Full changelog: 0.19.3...0.19.4