Skip to content

Commit 2fea96b

Browse files
authored
drug_normalizer doc updated. (#1775)
1 parent e1970b2 commit 2fea96b

File tree

1 file changed

+16
-7
lines changed

1 file changed

+16
-7
lines changed

docs/en/licensed_annotator_entries/DrugNormalizer.md

Lines changed: 16 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -7,19 +7,28 @@ model
77
{%- endcapture -%}
88

99
{%- capture model_description -%}
10-
Annotator which normalizes raw text from clinical documents, e.g. scraped web pages or xml documents, from document type columns into Sentence.
11-
Removes all dirty characters from text following one or more input regex patterns.
12-
Can apply non wanted character removal which a specific policy.
13-
Can apply lower case normalization.
10+
Normalizes drug mentions in clinical text.
11+
12+
Adds spaces between punctuation and words, as well as normalize the
13+
drug mentions. The `policy` parameter controls what drug information
14+
should be normalized, check the parameter documentation for details.
15+
16+
For usage examples and discussion, check out
17+
[Blogpost](https://medium.com/spark-nlp/normalize-drug-names-and-dosage-units-with-spark-nlp-8b7ef606facf)
1418

1519
Parametres:
1620

1721
- `lowercase`: (boolean) whether to convert strings to lowercase. Default is False.
1822

1923
- `policy`: (str) rule to remove patterns from text. Valid policy values are:
20-
+ **"all"**,
21-
+ **"abbreviations"**,
22-
+ **"dosages"**
24+
- **`"all"`** – replaces both abbreviations and dosages
25+
- **`"abbreviations"`** – replaces all abbreviations with their full forms
26+
> e.g., `"oral sol"``"oral solution"`
27+
- **`"dosages"`** – converts dosages to a standardized format
28+
> e.g., `"10 million units"``"10000000 unt"`
29+
30+
**Default:** `"all"`
31+
2332

2433

2534
See [Spark NLP Workshop](https://github.com/JohnSnowLabs/spark-nlp-workshop/blob/master/tutorials/Certification_Trainings/Healthcare/23.Drug_Normalizer.ipynb) for more examples of usage.

0 commit comments

Comments
 (0)