-
It's important to define which algorithm gives better performance of the machine learning solutions, we need to narrow down on the best algorithms that suit both the data and the business requirements.
-
You can see:
- Phase I
- Data Pre-processing
- Gathering the Data
- Data Exploration and Analysis
- Data Cleaning
- Outliers Detection
- Dealing with Null values
- Adding new features
- Prepare the data for ML
- Dealing with the high skewness of the data
- Data Pre-processing
- Phase II:
- Initialize the models.
- Comparing different Machine learning models.
- Compute train/test results.
- Evaluate our models using cross validation.
- Improving the top models
- Stacking the best models to get a better score
- Phase 1
- Nulls
- Skewness
- Elastic Net
- Kernel Ridge
- Lasso
- Random Forest
- SVM
- XGBoost
- LGBM
- Gradient Boosting