Skip to content

Machine Learning using PySpark

Awantik Das edited this page Oct 8, 2018 · 3 revisions
  1. Fundamentals of Spark

    • Spark Infrastructure
    • Framework
    • RDD
    • DataFrames
    • Vector - Sparse,Dense
    • Matrix - Sparse,Dense
  2. Fundamentals of Machine Learning

    • Types of Machine Learning
    • Machine Learning Pipeline
    • Data Pipeline
    • Deployment
  3. Data Wrangling & Visualization

    • Connecting CSV,JSON & Databases
    • Statistical Analysis
    • Accessing Data by rows, columns & filters
  4. Data Pre-processing

  5. Feature Selection & Extraction

  6. Linear Models for Classification & Regression

  7. Trees & Ensemble Methods

  8. Clustering

  9. Recommendation Engine

  10. Deployment

Clone this wiki locally