Skip to content
Project - Diabetes Patients Analysis
Diabetes Patient Analysis
Task:
The main objective of this project is to diagnostically predict whether a patient has diabetes or not. The dataset used for this analysis is sourced from the National Institute of Diabetes and Digestive and Kidney Diseases.
Load dataset and important libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import plotly.express as px
df=pd.read_csv('diabetes.csv')Data Exploration
df.info()No null values
df.head()About the dataset
- Pregnancies: Number of times that female has pregnent
- Glucose: Glucose level of the patient
- Blood Pressure: Blood Pressure level of patient
- Skin Thickness: Thick of skin to measure fat
- Insulin: Insulin level of patient
- BMI: Body mass index
- Diabetes Pedigree Function: Related to genetics, is a chart diagrams that shows inheritance of a trait or health condition through genreations of a family.
- Age: Age of the patient
- Outcome: Whether the patient has diabetes or not.
- 0 typically indicates that the patient does not have diabetes.
- 1 typically indicates that the patient has diabetes
df.duplicated().sum()No duplicates
df.describe()
df["Pregnancies"].unique()df["Pregnancies"].value_counts()df["BloodPressure"].value_counts().sort_index(ascending=True)