Data Analyst Associate Practical Exam Submission
You can use any tool that you want to do your analysis and create visualizations. Use this template to write up your summary for submission.
You can use any markdown formatting you wish. If you are not familiar with Markdown, read the Markdown Guide before you start.
Task 1
- The original data contained 4500 rows and 8 columns. I began by validating each column as instructed. Once each missing value was replaced, I deleted 426 rows where repeat_purchase was NULL, resulting in 4074 remaining rows. The following bullet points state changes made during data validation:
- The category column contained 75 missing values listed as '-', and was with replaced with 'Unknown'. With the 'Unknown' column, there are 7 unique categories.
- Animal column has 4 unique values, and no missing values
- Size column had inconsistencies in capitalization. Each size was corrected into proper format ('Small','Medium','Large'). This makes 3 unique values in size since there were no missing rows.
- Price column had 450 missing values which were converted into the median (29.42). Some values were varchar or float, and coverted into numeric in addition to rounding to 2 decimal places.
- Sales column had a similar problem with data types as varchars/floats. Converted these values into numeric and rounded to 2 decimal places.
- Rating column had 450 missing values listed as 'NA', these were changed to '0' as instructed. Values are listed 0-9 after changes, there are no values of '10'.
- Repeat_purchase column had 426 missing values listed as '-0'. These rows were deleted, leaving 4074 columns.
Task 2
Which categories are sold the most?
There are 7 total categories of pet products in this data. Equipment being the most common. Following equipment is food, which has almost 300 less items sold than equipment. Closely following food is toys, medicine, and then housing, accordingly. Lastly, we have accessory falling very short in items sold, and the unknown column in which items were not specified into a category or data was missing.
What do the prices look like?
Prices of the items in this list range from a minimum of about 13.00 to 51.00. The median price is about 29.00 with the range of the lower and upper quartile at 24.00 and 35.00 respectively.
Which category has the most repeat purchases?
Equipment has the most repeat purchases with over 600 repeating purchases. Food, housing, medicine, and toys are almost equal at around 450 each. Accessories are at about 200 repeat purchases, while the 'Unknown' category has less than 50. PetMind should look into what causes the increased repeat purchases on equipment when compared to the rest of the categories.
Task 3
How do total sales compare across each category?
Equipment is the highest selling category in terms of dollars earned. Specifically, cat equipment is the highest source of sales at over $600,000 for this period of time. Despite the equal repeat purchases of food, housing, medicine, and toys, the sales numbers are spread out between these categories. Toys has the second most in total sales, and then follows food, medicine, housing, accessory, and unknown, accordingly.
Task 4
How do repeat purchases impact sales?
As repeat purchases increase, sales will also increase accordingly. There is a strong linear relationship between these variables across all categories.
Based on the information presented, PetMind should determine the cause of the increased sales in pet equipment. On the other hand, the pet accessories are not as popular, and may require further analysis to figure out why this is the case. Having more insight on this data could show why pet equipment has almost 100 more repeated purchases than food products and successfully increase repeat purchases on everyday items.
✅ When you have finished...
- Publish your Workspace using the option on the left
- Check the published version of your report:
- Can you see everything you want us to grade?
- Are all the graphics visible?
- Review the grading rubric. Have you included everything that will be graded?
- Head back to the Certification Dashboard to submit your practical exam