Skip to content
0

Medical Device Manufacturer - Customer Segmentation

Can you find a better way to segment your customers?

📖 Background

You work for a medical device manufacturer in Switzerland. Your company manufactures orthopedic devices and sells them worldwide. The company sells directly to individual doctors who use them on rehabilitation and physical therapy patients.

Historically, the sales and customer support departments have grouped doctors by geography. However, the region is not a good predictor of the number of purchases a doctor will make or their support needs.

Your team wants to use a data-centric approach to segmenting doctors to improve marketing, customer service, and product planning.

load the datasets

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
doctors = pd.read_csv('data/doctors.csv')
doctors.head(2)
complaints = pd.read_csv('data/complaints.csv')
complaints.info()
orders = pd.read_csv('data/orders.csv')
orders.info()
# looks like there are records with null Ranks
# Checking these records
orders[orders['Condition J'].isna()]
orders.head(2)
instructions = pd.read_csv('data/instructions.csv')
instructions.info()

Data Understanding

1- Dataframe overview 2- Checking null values. 3- Checking duplication.

Need to check for a way to done this nicely

#print("doctor df shape:" + doctors.shape + " complaints df shape:" )
print("doctor df shape:")
doctors.shape
print("complaints df shape:")
complaints.shape
doctors.info()
# Check duplicate:
doctors.duplicated().sum(),orders.duplicated().sum(),complaints.duplicated().sum(),instructions.duplicated().sum()