Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 387 Bytes

README.md

File metadata and controls

12 lines (9 loc) · 387 Bytes

K-Harmonic-Mean-for-PySpark

Implementation of K-Harmonic Mean for application in Pyspark

Dependencies

  1. Numpy
  2. PySpark

How to Use

An example of how to use the algorithm can be found in test.py.

The rdd passed to the train method should be a list of tuples or a list of lists. The lists or tuples are the feature vectors and should all be floating point numbers or integers.