You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The next step would be to pre-compute this coefficient for a range of common languages CharAugmenter(language="en") is better than CharAugmenter(natural_language_swap_correction=1.052).
It could indeed enhance readability, and make the code more easily usable for non-English languages.
Suggested Implementation Steps:
Identify a set of common languages for pre-computation.
Implement a mechanism to store and retrieve pre-computed coefficients.
Update the CharAugmenter module to use pre-computed coefficients when available.
The text was updated successfully, but these errors were encountered:
Issue description
Textnoisr uses a coefficient to take into account repetitions in consecutive letters in natural language.
As @felix-martel-prl said in #7 (review) :
It could indeed enhance readability, and make the code more easily usable for non-English languages.
Suggested Implementation Steps:
The text was updated successfully, but these errors were encountered: