Skip to content

Data Analyst Associate Practical Exam Submission

You can use any tool that you want to do your analysis and create visualizations. Use this template to write up your summary for submission.

You can use any markdown formatting you wish. If you are not familiar with Markdown, read the Markdown Guide before you start.

Task 1

Product_id: There are 1500 nominal values, distinguished from other fields in the dataset. No missing values and it matches the description given. Category: There were 6 unique values which corresponded with the data dictionary. 25 rows contained hyphens which did not match the description therefore they were replaced with 'Unknown'. Animal: There were four unique values that matches the four given in the description. There were no missing values so no changes were made. Size: This column contained the values small, medium and large in different case types. No values were missing but the values were transformed to proper case. Price: The values of this column were positive and continuous. There were 150 missing values which were replaced with the median value of the data in the column, 28.07, and all values round to 2 decimal places. Sales: The values were round to 2 decimal places and matched the description given. There were no missing values. Rating: The values of this column ranged from 1 to 9. There were 150 missing values. All missing values were replaced with 0. Repeat_purchase: All the values in this column were either 0 or 1. There were no missing values and no changes were made to this column.

Task 2

The figure below shows how many products are repeatedly purchased and repeat purchases across categories. The most purchased product based on category is the Equipment. Food, Housing, Medicine and Toys have a close purchase measure while the other categories have a wide difference or gap from every other category. So it does not have the same number of observations across all categories.

Task 3

Sales in the last year ranged from 286.94 to 2255.96. From the histogram below we observe a high record in product sales between 979 and 1079. It can be gathered that there were higher sales across the mean, median and third quartile value of sales.

Task 4

✅ When you have finished...

  • Publish your Workspace using the option on the left
  • Check the published version of your report:
    • Can you see everything you want us to grade?
    • Are all the graphics visible?
  • Review the grading rubric. Have you included everything that will be graded?
  • Head back to the Certification Dashboard to submit your practical exam