This is a data science project that:
- analyzes and visualizes relationships between different predictor variables of a patients' health data,
- transforms predictor variables leveraging p-value analysis and dimensionality reduction, and
- trains and compate different Machine Learning models such as logistic regression, random forests and neural networks on stroke susceptibility prediction.
Please refer to the Python notebook for the code, the csv file for the data and the pdf for an in-depth report of my findings.
- Keras ðŸ§
- Matplotlib 📈
- NumPy 🧮
- scikit-learn 🤖
- statsmodels 📊