Skip to content

Commit e5ae818

Browse files
committed
updated gpum onboarding documentation when using dd-operator for k8s deployments
1 parent 27aad04 commit e5ae818

File tree

1 file changed

+22
-0
lines changed

1 file changed

+22
-0
lines changed

gpu/README.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -174,6 +174,8 @@ helm install -f values.yaml -f values-gpu.yaml datadog-gpu datadog
174174

175175
#### Datadog Operator
176176

177+
_**Minimal required operator version: 1.14**_
178+
177179
To enable the GPU feature in clusters where all the nodes have GPUs, set the `features.gpu.enabled` parameter in the DatadogAgent manifest:
178180

179181
```yaml
@@ -185,6 +187,18 @@ spec:
185187
features:
186188
gpu:
187189
enabled: true
190+
# for operator versions 1.14.x and 1.15.x add this section
191+
override:
192+
nodeAgent:
193+
containers:
194+
agent:
195+
env:
196+
# add this env var, if using operator version 1.14.x
197+
- name: DD_ENABLE_NVML_DETECTION
198+
value: "true"
199+
# add this env var, if using operator versions 1.14.x or 1.15.x
200+
- name: DD_COLLECT_GPU_TAGS
201+
value: "true"
188202
```
189203

190204
For **mixed environments**, use the [DatadogAgentProfiles feature](https://github.com/DataDog/datadog-operator/blob/main/docs/datadog_agent_profiles.md) of the operator, which allows different configurations to be deployed for different nodes. In this case, it is not necessary to modify the DatadogAgent manifest. Instead, create a profile that enables the configuration on GPU nodes only:
@@ -210,6 +224,14 @@ spec:
210224
env:
211225
- name: DD_GPU_MONITORING_ENABLED
212226
value: "true"
227+
# add this env var, if using operator version 1.14.x
228+
agent:
229+
env:
230+
- name: DD_ENABLE_NVML_DETECTION
231+
value: "true"
232+
# add this env var, if using operator versions 1.14.x or 1.15.x
233+
- name: DD_COLLECT_GPU_TAGS
234+
value: "true"
213235
```
214236

215237
<!-- xxz tab xxx -->

0 commit comments

Comments
 (0)