Skip to content

Latest commit

 

History

History
12 lines (12 loc) · 592 Bytes

README.md

File metadata and controls

12 lines (12 loc) · 592 Bytes

MachineLearning

Tutorial Datasets analyzed by Machine Learning

  • Kaggles Titanic Dataset (Python)
  • Kaggles Heart Disease Dataset (R)

Data Preprocessing and Random Forest Hyperparameter Tuning

  • See RandomForestTuningExample.ipynb
  • In this file, I do an iteration of randomsearch tuning to find a subset of hyperparameter values to tune more thoroughly in suubsequent gridsearch.
  • Some basic figure generation using seaborn.
  • Some basic object-oriented programming (OOP).

Future Directions

  • Look into imputing missing values
  • consider using xgboost instead of a random forest