This project utilizes Kaggle's stroke prediction dataset to develop and compare three decision tree-based machine learning models:
- XGBoost
- Random Forest
- Decision Tree (scikit-learn)
- 📜
model.ipynb
– Jupyter Notebook containing the implementation and comparison of the three models. - 📜
README.md
– Project documentation. - 📜
requirements.txt
– List of dependencies required to run the project.
Ensure you have Python installed, then install the required dependencies using:
pip install -r requirements.txt
- Open
model.ipynb
in Jupyter Notebook or JupyterLab. - Run the notebook cells to train, evaluate, and compare the models.
- Analyze the results and accuracy metrics.
Based on the accuracy results:
✅ XGBoost provides the highest accuracy.
✅ Random Forest performs well but is slightly less accurate than XGBoost.
The dataset used in this project is the Stroke Prediction Dataset from Kaggle. You can access it here.
This project is licensed under the MIT License.