Image
Code

Loading and Parsing Healthcare Dataset Stroke CSV Data

aishwarya8615

Last edited Sep 25, 2019
Created on Sep 04, 2019

Healthcare Dataset Stroke Data

Data Source: Healthcare Dataset Stroke Data is used. This is the Github Data Link for the Healthcare Dataset Stroke Data

Context: This dataset is used to predict whether a patient is likely to get stroke based on the input parameters like gender, age, and various diseases and smoking status. I have taken a subset of the original test data using the filtering method for Data Visualization purposes.

Motive: What age group, gender or smoking status are more likely to get stroke and how do they compare with respect to these factors? Is there any correlation between age and stroke?

About the Data: Output/Occurance of stroke is a categorical variable. 2 of the inputs(gender and smoking status) are categorical and ordinal respectively and age is numerical.

Notes: Unknown in Smoking status means the information is unavailable N/A in other input fields imply that it is not applicable.

MIT Licensed