Skip to content

GoodThought NGO has been a catalyst for positive change, focusing its efforts on education, healthcare, and sustainable development to make a significant difference in communities worldwide. With this mission, GoodThought has orchestrated an array of assignments aimed at uplifting underprivileged populations and fostering long-term growth.

This project offers a hands-on opportunity to explore how data-driven insights can direct and enhance these humanitarian efforts. In this project, you'll engage with the GoodThought PostgreSQL database, which encapsulates detailed records of assignments, funding, impacts, and donor activities from 2010 to 2023. This comprehensive dataset includes:

  • Assignments: Details about each project, including its name, duration (start and end dates), budget, geographical region, and the impact score.
  • Donations: Records of financial contributions, linked to specific donors and assignments, highlighting how financial support is allocated and utilized.
  • Donors: Information on individuals and organizations that fund GoodThought’s projects, including donor types.

Refer to the below ERD diagram for a visual representation of the relationships between these data tables:

You will execute SQL queries to answer two questions, as listed in the instructions. Good luck!

Spinner
DataFrameas
highest_donation_assignments
variable
SELECT
	a.assignment_name,
	a.region,
	ROUND(SUM(dt.amount), 2) AS rounded_total_donation_amount,
	dn.donor_type
FROM assignments AS a
	INNER JOIN donations as dt
	ON a.assignment_id = dt.assignment_id
	INNER JOIN donors AS dn
	ON dt.donor_id = dn.donor_id
GROUP BY 
	a.assignment_name, a.region, dn.donor_type
ORDER BY 
	rounded_total_donation_amount DESC
LIMIT 5;
Spinner
DataFrameas
top_regional_impact_assignments
variable
--shows number of donations received for each assignment
WITH donations_count AS (
	SELECT
		assignment_id,
		COUNT(donation_id) AS num_total_donations
	FROM 
		donations
	GROUP BY 
		assignment_id
),
--gives each assignment a unique rank based on its impact score, specific to its region, while the assignment has a least 1 donation (assignment with the highest impact score in each region has row_num 1)
ranks AS (
	SELECT
		*,
		ROW_NUMBER() OVER(PARTITION BY region ORDER BY impact_score DESC) AS row_num
	FROM 
		assignments AS a
		JOIN donations_count AS dc
		ON a.assignment_id = dc.assignment_id
	WHERE 
		dc.num_total_donations > 0
)
--show selected columns, filtered for assignments where the rank value, generated in the previous CTE, is 1
SELECT
	assignment_name,
	region,
	impact_score,
	num_total_donations
FROM 
	ranks
WHERE 
	row_num = 1
ORDER BY 
	region;