Release 0.19.4 · dstackai/dstack

Services

Rate limits

You can now configure rate limits for your services running behind a gateway.

type: service
image: my-app:latest
port: 80

rate_limits:
# For /api/auth/* - 1 request per second, no bursts
- prefix: /api/auth/
  rps: 1
# For other URLs - 4 requests per second + bursts of up to 9 requests
- rps: 4
  burst: 9

Examples

TensorRT-LLM

We added a new example on TensorRT-LLM that shows how to deploy both DeepSeek R1 and its distilled version
using TensorRT-LLM and dstack.

Llama 4

The Llama example was updated to demonstrate the deployment of Llama 4 Scout using dstack.

Contributing

We continue to make contributing to dstack easier and improve dev experience. Since the last release, we moved from pip to uv in CI and dev pipelines. Dependencies installation times went from ~70 seconds to less than 10 seconds. The Development guide was updated to show how to get the dstack development setup with uv. The CI Build pipeline triggered on pull requests were optimized from 9 minutes to 4 minutes.

We also documented uv as one of the recommended installation options for dstack.

What's changed

[Landing] Refactoring (WIP) by @peterschmidt85 in #2495
Fix CloudWatchLogStorage with sparse logs by @un-def in #2501
Migrate to uv by @colinjc in #2455
Fix docs build with uv by @r4victor in #2504
[Example] Update Llama 4 Examples by @Bihan in #2508
Move to uv in dstack-server Docker image by @r4victor in #2509
Fix dstack dependency for gateway by @r4victor in #2511
[Docs] Add uv to Installation; Minor improvements by @peterschmidt85 in #2510
Validate usernames by @r4victor in #2514
Run pytest in parallel with pytest-xdist by @r4victor in #2515
Add Llama4 AMD example by @Bihan in #2513
Use exponentially increasing retry delays for pending runs by @r4victor in #2519
Speed up frontend CI by @r4victor in #2520
Service rate limits by @jvstme in #2517
Set no-guess-dev for dev package versions by @r4victor in #2521
Detect dstack version from file instead of git by @r4victor in #2524
Add TensorrRT-LLM Example by @Bihan in #2444
Fix Nginx upstream name conflicts by @jvstme in #2526
Fix detaching from dstack attach by @jvstme in #2528

New contributors

@colinjc made their first contribution in #2455

Full changelog: 0.19.3...0.19.4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.19.4