Skip to content

GoodThought NGO has been a catalyst for positive change, focusing its efforts on education, healthcare, and sustainable development to make a significant difference in communities worldwide. With this mission, GoodThought has orchestrated an array of assignments aimed at uplifting underprivileged populations and fostering long-term growth.

This project offers a hands-on opportunity to explore how data-driven insights can direct and enhance these humanitarian efforts. In this project, you'll engage with the GoodThought PostgreSQL database, which encapsulates detailed records of assignments, funding, impacts, and donor activities from 2010 to 2023. This comprehensive dataset includes:

  • Assignments: Details about each project, including its name, duration (start and end dates), budget, geographical region, and the impact score.
  • Donations: Records of financial contributions, linked to specific donors and assignments, highlighting how financial support is allocated and utilized.
  • Donors: Information on individuals and organizations that fund GoodThought’s projects, including donor types.

Refer to the below ERD diagram for a visual representation of the relationships between these data tables:

You will execute SQL queries to answer two questions, as listed in the instructions. Good luck!

Spinner
DataFrameas
highest_donation_assignments
variable
-- highest_donation_assignments
-- top 5 assignments based total value of donations, cat donor type
-- assignment_name (assignments), region(assignments), rounded_total_donation_amount (donations) (two dec), donor_type (donors)
-- order by rounded_total DESC

WITH rounded_total AS (
	SELECT assignment_id, ROUND(SUM(amount),2) AS rounded_total_donation_amount, donor_type
	FROM donations
	JOIN donors
		ON donations.donor_id = donors.donor_id
	GROUP BY assignment_id, donor_type
)

SELECT DISTINCT assignment_name, 
	region, 
	rounded_total_donation_amount, 
	donors.donor_type
FROM assignments AS a
JOIN rounded_total AS rt
	ON a.assignment_id = rt.assignment_id
JOIN donors
	ON rt.donor_type = donors.donor_type
ORDER BY rounded_total_donation_amount DESC
LIMIT 5;
Spinner
DataFrameas
top_regional_impact_assignments
variable
-- top_regional_impact_assignments
-- assignment w/ highest impact score each region, at least one donation
-- assignment_name (assignments), region (assignments), impact_score (assignments), num_total_donations (donations)
-- order by region asc
-- only highest scoring per region, no dupes

WITH total_donations AS (
	SELECT COUNT(*) AS num_total_donations, assignment_id
	FROM donations
	WHERE donation_id IS NOT NULL
	GROUP BY assignment_id
),

region_rank AS (
	SELECT assignment_id, assignment_name, region, impact_score,
		ROW_NUMBER() OVER(PARTITION BY region ORDER BY impact_score DESC) AS score_rank 
	FROM assignments	
)

SELECT assignment_name, 
	region,
	impact_score,
	num_total_donations
FROM region_rank AS rr
JOIN total_donations AS td
	ON rr.assignment_id = td.assignment_id
WHERE score_rank = 1
ORDER BY region ASC;