Project: Impact Analysis of GoodThought NGO Initiatives

GoodThought NGO has been a catalyst for positive change, focusing its efforts on education, healthcare, and sustainable development to make a significant difference in communities worldwide. With this mission, GoodThought has orchestrated an array of assignments aimed at uplifting underprivileged populations and fostering long-term growth.

This project offers a hands-on opportunity to explore how data-driven insights can direct and enhance these humanitarian efforts. In this project, you'll engage with the GoodThought PostgreSQL database, which encapsulates detailed records of assignments, funding, impacts, and donor activities from 2010 to 2023. This comprehensive dataset includes:

Assignments: Details about each project, including its name, duration (start and end dates), budget, geographical region, and the impact score.
Donations: Records of financial contributions, linked to specific donors and assignments, highlighting how financial support is allocated and utilized.
Donors: Information on individuals and organizations that fund GoodThought’s projects, including donor types.

Refer to the below ERD diagram for a visual representation of the relationships between these data tables:

You will execute SQL queries to answer two questions, as listed in the instructions. Good luck!

DataFrameas

highest_donation_assignments

variable

-- highest_donation_assignments
WITH top AS (SELECT d.assignment_id, ds.donor_type, ROUND(SUM(d.amount), 2) AS highest_donation
			FROM donations AS d
JOIN donors AS ds
	ON d.donor_id = ds.donor_id
GROUP BY d.assignment_id, ds.donor_type)

SELECT a.assignment_name, a.region, tp.highest_donation AS rounded_total_donation_amount, tp.donor_type
			FROM assignments AS a
JOIN top AS tp
	ON a.assignment_id = tp.assignment_id
ORDER BY rounded_total_donation_amount DESC
LIMIT 5;

DataFrameas

df

variable

SELECT assignment_id, ROW_NUMBER()OVER(PARTITION BY impact_score, region ORDER BY impact_score DESC)
	FROM assignments AS a;

DataFrameas

top_regional_impact_assignments

variable

-- CTE to calculate total impact score per region
-- Step 1: Top regions by impact
SELECT a.region, SUM(a.impact_score) AS impact_assignments
FROM assignments AS a
JOIN donations AS d
  ON a.assignment_id = d.assignment_id
GROUP BY a.region
ORDER BY impact_assignments DESC;

-- Step 2: CTE to count donations per assignment
WITH dr AS (
    SELECT assignment_id, COUNT(donation_id) AS num_total_donations
    FROM donations
    GROUP BY assignment_id
),

-- Step 3: CTE to rank assignments by impact_score within each region
sc AS (
    SELECT 
        a.assignment_id,
        a.assignment_name,
        a.impact_score,
        a.region,
        ROW_NUMBER() OVER(PARTITION BY a.region ORDER BY a.impact_score DESC) AS rank_value
    FROM assignments AS a
    JOIN dr ON a.assignment_id = dr.assignment_id
)

-- Step 4: Final selection of top assignment per region
SELECT 
    sc.assignment_name,
    sc.region,
    sc.impact_score,
    dr.num_total_donations
FROM sc
JOIN dr ON sc.assignment_id = dr.assignment_id
WHERE sc.rank_value = 1
ORDER BY sc.region ASC;