Skip to content

Commit ba82b02

Browse files
chg: [user-manual] Improvements.
1 parent 4065193 commit ba82b02

File tree

1 file changed

+7
-2
lines changed

1 file changed

+7
-2
lines changed

content/user-manual/ai/index.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -61,15 +61,20 @@ With the various vulnerability feeders of Vulnerability-Lookup (for the CVE Prog
6161
we’ve collected over a million JSON records. This allow us to generate datasets for training and building models.
6262

6363
During our explorations, we realized that we can automatically update a BERT-based text classification model daily using a dataset of approximately 600k rows from Vulnerability-Lookup.
64-
With powerful GPUs, it’s a matter of hours. We are currenly using the RoBERTa base model.
64+
With powerful GPUs, it’s a matter of hours.
65+
66+
Models are generated on our own GPUs and with our [various open source trainers](https://github.com/vulnerability-lookup/VulnTrain).
67+
68+
Similar to the datasets, model updates are performed on a regular basis.
6569

6670

6771
### Text classification
6872

6973
#### vulnerability-severity-classification-roberta-base
7074

71-
This model is a fine-tuned version of ``roberta-base`` on the dataset
75+
This model is a fine-tuned version of ``RoBERTa base`` on the dataset
7276
[CIRCL/vulnerability-scores](https://huggingface.co/datasets/CIRCL/vulnerability-scores).
77+
7378
The time of generation with two GPUs NVIDIA L40S is approximately 6 hours.
7479

7580
Try it with Python:

0 commit comments

Comments
 (0)