Skip to content

Commit a896c5e

Browse files
Version 2.0.6
1 parent 7630487 commit a896c5e

File tree

3 files changed

+5
-2
lines changed

3 files changed

+5
-2
lines changed

README.rst

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -565,7 +565,7 @@ defined within the ``TOK`` class:
565565
| S_END | 11002 | End of sentence | |
566566
+---------------+---------+---------------------+---------------------------+
567567

568-
(*) The token types marked with an asterisk are reserved for the Reynir package
568+
(*) The token types marked with an asterisk are reserved for the Greynir package
569569
and not currently returned by the tokenizer.
570570

571571
To obtain a descriptive text for a token kind, use
@@ -752,6 +752,8 @@ can be found in the file ``test/toktest_normal_gold_expected.txt``.
752752
Changelog
753753
---------
754754

755+
* Version 2.0.6: Fixed handling of abbreviations such as *m.v.* (*miðað við*)
756+
that should not start a new sentence even if the following word is capitalized
755757
* Version 2.0.5: Fixed bug where single uppercase letters were erroneously
756758
being recognized as abbreviations, causing prepositions such as 'Í' and 'Á'
757759
at the beginning of sentences to be misunderstood in ReynirPackage

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -57,7 +57,7 @@ def read(*names, **kwargs):
5757

5858
setup(
5959
name="tokenizer",
60-
version="2.0.5",
60+
version="2.0.6", # Also update src/tokenizer/__init__.py
6161
license="MIT",
6262
description="A tokenizer for Icelandic text",
6363
long_description=u"{0}\n{1}".format(

src/tokenizer/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,3 +42,4 @@
4242
from .abbrev import Abbreviations, ConfigError
4343

4444
__author__ = u"Miðeind ehf"
45+
__version__ = u"2.0.6" # Also update setup.py

0 commit comments

Comments
 (0)