Image
Code

Parallel Coordinates with Brushing (Heart_Failure_Prediction)

Sree Likhith

Last edited Mar 20, 2024
Created on Mar 20, 2024

Parallel Coordinates Visualization of Heart Disease Data

This Visualization showcases an interactive Parallel Coordinates Plot developed using D3.js, tailored for multidimensional data exploration with features like brushing for filtering and responsive design for adaptability across devices. The visualization focuses on the Heart Disease dataset, aiming to uncover patterns and relationships between various clinical parameters and the presence of heart disease.

Dataset

The dataset for this visualization is based on clinical measurements related to heart health, including age, sex, cholesterol levels, and more. You can access the dataset here: https://www.kaggle.com/datasets/fedesoriano/heart-failure-prediction.

Observations

Several interesting patterns emerged from the visualization:

  1. Age and Heart Disease: Older individuals tend to show a higher incidence of heart disease compared to younger ones. This pattern is observable as more lines corresponding to heart disease cases cluster in the higher age brackets.

  2. Cholesterol Levels: While high cholesterol levels are commonly associated with heart disease, the visualization reveals a more nuanced relationship. Some individuals with high cholesterol do not show signs of heart disease, indicating the influence of other factors.

  3. Exercise-induced Angina (Pain): A clear pattern emerges showing a higher prevalence of exercise-induced angina among individuals with heart disease. This observation supports the known link between angina and compromised heart health.

  4. Resting Blood Pressure: The data shows that resting blood pressure alone is not a definitive indicator of heart disease, as individuals with normal pressure also show instances of disease, suggesting the multifactorial nature of heart health.

Interactive Exploration

The Parallel Coordinates Plot allows users to interactively explore the data by filtering across multiple dimensions. For instance, brushing along the 'Age' axis to isolate specific age groups reveals how other parameters like 'MaxHR' (Maximum Heart Rate) and 'Cholesterol' levels distribute among those with and without heart disease within that age range.

Conclusion

This visualization project not only demonstrates the power of Parallel Coordinates for exploring multidimensional datasets but also highlights key insights into heart disease prevalence and its relationship with various clinical parameters. By allowing users to interact with the data, it fosters a deeper understanding of the complex factors that contribute to heart health.

MIT Licensed