Skip to content

Latest commit



53 lines (34 loc) · 2.78 KB

File metadata and controls

53 lines (34 loc) · 2.78 KB

"An ounce of prevention is worth a pound of cure" -Benjamin Franklin


To build a machine learning model to classify patients into heart attack risk categories.

Executive Summary

This project proves the concept that machine learning can be used to classify patients into heart attack risk categories. This is not a diagnostic tool, but a time-saving measure. This would provide very fast identification of high-risk patients without the time-consuming, manual review of records. Patients identified as high risk can be sent for further review by a doctor and have frequent follow-ups along with nutrition guidance.

Data Dictionary:

Feature Description
Age Age of the patient
Sex Sex of the patient
exang exercise induced angina (1 = yes; 0 = no)
ca number of major vessels (0-3)
cp Chest Pain type (angina)
Value 1 typical angina
Value 2 atypical angina
Value 3 non-anginal pain
Value 4 asymptomatic
trtbps resting blood pressure (in mm Hg)
chol cholestoral in mg/dl fetched via BMI sensor
fbs (fasting blood sugar > 120 mg/dl) (1 = true; 0 = false)
rest_ecg resting electrocardiographic results
Value 0 normal
Value 1 having ST-T wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05 mV)
Value 2 showing probable or definite left ventricular hypertrophy by Estes' criteria
thalach maximum heart rate achieved
target 0= less chance of heart attack 1= more chance of heart attack
st-slope description of slope shape of ventricular relaxation, the s-t segment

**Medical Definitions 1- Angina: chest pain due to reduced blood flow to the heart muscles. There're 3 types of angina: stable angina, unstable angina, and variant angina. To know more about angina click here:,of%20these%20more%20serious%20problems.

2- Cholesterol: a waxy substance found in the body cells and it belongs to a group of organic molecules called lipids. There are 3 types of cholesterol; high-density lipoprotein (HDL) and it's known as the "good cholesterol", low-density lipoprotein (LDL) known as the "bad cholesterol", and very-low-density lipoproteins (VLDL) and as the name implies, they're low dense particles that carry triglycerides in the blood.

3- ECG: short for electrocardiogram, it's a routine test usually done to check the heart's electrical activity.

4- ST depression: a type of ST-segment abnormality. the ST segment is the flat, isoelectric part of the ECG and it represents the interval between ventricular depolarization and repolarization. For more details check this link:

5- Thalassemia: it's a genetic blood disorder that is characterized by a lower rate of hemoglobin than normal.