-
Notifications
You must be signed in to change notification settings - Fork 120
Machine Learning using PySpark
Awantik Das edited this page Oct 8, 2018
·
3 revisions
-
Fundamentals of Spark
- Spark Infrastructure
- Framework
- RDD
- DataFrames
- Vector - Sparse,Dense
- Matrix - Sparse,Dense
-
Fundamentals of Machine Learning
- Types of Machine Learning
- Machine Learning Pipeline
- Data Pipeline
- Deployment
-
Data Wrangling & Visualization
- Connecting CSV,JSON & Databases
- Statistical Analysis
- Accessing Data by rows, columns & filters
-
Data Pre-processing
-
Feature Selection & Extraction
-
Linear Models for Classification & Regression
-
Trees & Ensemble Methods
-
Clustering
-
Recommendation Engine
-
Deployment