Skip to content

Commit ad8c766

Browse files
ChatQnA - Adding files to deploy an application in the K8S environment using Helm
Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com>
1 parent 8db1dd8 commit ad8c766

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

ChatQnA/kubernetes/helm/faqgen-rocm-tgi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ tgi:
2121
MAX_TOTAL_TOKENS: "4096"
2222
PYTORCH_TUNABLEOP_ENABLED: "0"
2323
USE_FLASH_ATTENTION: "true"
24-
FLASH_ATTENTION_RECOMPUTE: "true"
24+
FLASH_ATTENTION_RECOMPUTE: "false"
2525
HIP_VISIBLE_DEVICES: "0,1"
2626
MAX_BATCH_SIZE: "2"
2727
extraCmdArgs: [ "--num-shard","2" ]

ChatQnA/kubernetes/helm/rocm-tgi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ tgi:
1515
MAX_TOTAL_TOKENS: "4096"
1616
PYTORCH_TUNABLEOP_ENABLED: "0"
1717
USE_FLASH_ATTENTION: "true"
18-
FLASH_ATTENTION_RECOMPUTE: "true"
18+
FLASH_ATTENTION_RECOMPUTE: "false"
1919
HIP_VISIBLE_DEVICES: "0,1"
2020
MAX_BATCH_SIZE: "2"
2121
extraCmdArgs: [ "--num-shard","2" ]

0 commit comments

Comments
 (0)