Skip to content

GoodThought NGO has been a catalyst for positive change, focusing its efforts on education, healthcare, and sustainable development to make a significant difference in communities worldwide. With this mission, GoodThought has orchestrated an array of assignments aimed at uplifting underprivileged populations and fostering long-term growth.

This project offers a hands-on opportunity to explore how data-driven insights can direct and enhance these humanitarian efforts. In this project, you'll engage with the GoodThought PostgreSQL database, which encapsulates detailed records of assignments, funding, impacts, and donor activities from 2010 to 2023. This comprehensive dataset includes:

  • Assignments: Details about each project, including its name, duration (start and end dates), budget, geographical region, and the impact score.
  • Donations: Records of financial contributions, linked to specific donors and assignments, highlighting how financial support is allocated and utilized.
  • Donors: Information on individuals and organizations that fund GoodThought’s projects, including donor types.

Refer to the below ERD diagram for a visual representation of the relationships between these data tables:

You will execute SQL queries to answer two questions, as listed in the instructions. Good luck!

Spinner
DataFrameas
highest_donation_assignments
variable
-- highest_donation_assignments
-- Question: List the top five assignments based on total value of donations, categorized by donor type. The output should include four columns: 1) assignment_name, 2) region, 3) rounded_total_donation_amount rounded to two decimal places, and 4) donor_type, sorted by rounded_total_donation_amount in descending order.

WITH highest_donation_assignments AS(SELECT assignment_name, region, ROUND(SUM(amount), 2) AS rounded_total_donation_amount, donor_type
	FROM assignments
	JOIN donations ON assignments.assignment_id = donations.assignment_id
	JOIN donors ON donations.donor_id = donors.donor_id
	GROUP BY assignment_name, region, donor_type)

SELECT *
FROM highest_donation_assignments
ORDER BY rounded_total_donation_amount DESC
LIMIT 5;
Spinner
DataFrameas
top_regional_impact_assignments
variable
-- top_regional_impact_assignments
-- Question: Identify the assignment with the highest impact score in each region, ensuring that each listed assignment has received at least one donation. The output should include four columns: 1) assignment_name, 2) region, 3) impact_score, and 4) num_total_donations, sorted by region in ascending order. Include only the highest-scoring assignment per region, avoiding duplicates within the same region. 

WITH impact_assignments AS (
    SELECT 
        assignment_name, 
        region, 
        impact_score, 
        COUNT(amount) AS num_total_donations
    FROM assignments
    JOIN donations ON assignments.assignment_id = donations.assignment_id
    JOIN donors ON donations.donor_id = donors.donor_id
    GROUP BY assignment_name, region, impact_score
    HAVING COUNT(amount) >= 1
),
top_regional_impact_assignments AS (
    SELECT 
        *,
        ROW_NUMBER() OVER (PARTITION BY region ORDER BY impact_score DESC) AS region_rank
    FROM impact_assignments
)
SELECT assignment_name, region, impact_score, num_total_donations
FROM top_regional_impact_assignments
WHERE region_rank = '1'
ORDER BY region ASC, region_rank ASC;