Project: Impact Analysis of GoodThought NGO Initiatives

GoodThought NGO has been a catalyst for positive change, focusing its efforts on education, healthcare, and sustainable development to make a significant difference in communities worldwide. With this mission, GoodThought has orchestrated an array of assignments aimed at uplifting underprivileged populations and fostering long-term growth.

This project offers a hands-on opportunity to explore how data-driven insights can direct and enhance these humanitarian efforts. In this project, you'll engage with the GoodThought PostgreSQL database, which encapsulates detailed records of assignments, funding, impacts, and donor activities from 2010 to 2023. This comprehensive dataset includes:

Assignments: Details about each project, including its name, duration (start and end dates), budget, geographical region, and the impact score.
Donations: Records of financial contributions, linked to specific donors and assignments, highlighting how financial support is allocated and utilized.
Donors: Information on individuals and organizations that fund GoodThought’s projects, including donor types.

Refer to the below ERD diagram for a visual representation of the relationships between these data tables:

You will execute SQL queries to answer two questions, as listed in the instructions. Good luck!

DataFrameas

df

variable

SELECT
	assignment_name, a.assignment_id,
	ROUND(SUM(amount), 2) as rounded_total
from donations as d
INNER JOIN
	assignments as a on d.assignment_id = a.assignment_id
GROUP BY
	2
ORDER BY 3 DESC
LIMIT 5;

DataFrameas

highest_donation_assignments

variable

-- highest_donation_assignments
SELECT
	assignment_name,
	region,
	ROUND(SUM(amount), 2) as rounded_total_donation_amount,
	donor_type
FROM
assignments a
INNER JOIN 
donations d ON a.assignment_id = d.assignment_id
INNER JOIN
donors d1 ON d1.donor_id = d.donor_id
GROUP BY
a.assignment_id, donor_type
ORDER BY
rounded_total_donation_amount DESC
LIMIT 5;

DataFrameas

top_regional_impact_assignments

variable

-- top_regional_impact_assignments
with donation_count as (
	select
		assignment_id,
		count(donation_id) as donation_count
	from
		donations
	group by
		assignment_id
), regional_max as (
	select
		distinct first_value(region) over(partition by region) as region,
		max(impact_score) over(partition by region) as max_impact_score,
		first_value(assignment_id) over(partition by region order by impact_score desc) as assignment_id
	from assignments
)

-- select * from regional_max;

select
	a.assignment_name,
	rm.region, 
	a.impact_score,
	dc.donation_count as num_total_donations
from assignments a
inner join
	regional_max rm on a.assignment_id = rm.assignment_id
inner join
	donation_count dc on dc.assignment_id = a.assignment_id
order by 2