Skip to content

Latest commit

 

History

History
36 lines (28 loc) · 1.27 KB

README.md

File metadata and controls

36 lines (28 loc) · 1.27 KB

Build Status Coverage Status Maintainability

LLTK

Lithuanian Language ToolKit

This is Python3 library for Lithuanian language analysis. Currently these functions are supported:

  • word stemming,
  • paragraph splitting into sentences,
  • text summarization based on smmry.

For summarization example please checkout example jupyter notebook.

Stemming and splitting is based on TokenMill work. So big thumbs up for these guys for opensourcing their tools.

Installing for development

  • Fork the project
  • Clone your forked project:
$ git clone git@github.com:evalkaz/lltk.git
  • Install development dependencies:
$ pip install -r requirements-dev.txt

Running tests

After cloned and installed development dependencies run tests just typing the following command:

$ pytest

Any contribution is welcome!