Skip to content

Latest commit

 

History

History
16 lines (13 loc) · 654 Bytes

README.md

File metadata and controls

16 lines (13 loc) · 654 Bytes

information-retrieval

Implementation of some of the information retrieval methods. These methods are written with pure Java by me. Some of them are need performance improvements. You need to add TextFolder and put the data in this folder. For more information, please check readFile method of Retrieval class.

Implemented Algorithms

  • Blocked sort-based indexing
  • Boolean retrival
  • Naive Bayes
  • Positional posting list
  • Posting list
  • Tf-idf
  • Rocchio (not completed)

Data

Data can be reached from here.

Test queries: New York, np vp advp pp nnp, etc.