GoodThought NGO has been a catalyst for positive change, focusing its efforts on education, healthcare, and sustainable development to make a significant difference in communities worldwide. With this mission, GoodThought has orchestrated an array of assignments aimed at uplifting underprivileged populations and fostering long-term growth.
This project offers a hands-on opportunity to explore how data-driven insights can direct and enhance these humanitarian efforts. In this project, you'll engage with the GoodThought PostgreSQL database, which encapsulates detailed records of assignments, funding, impacts, and donor activities from 2010 to 2023. This comprehensive dataset includes:
Assignments
: Details about each project, including its name, duration (start and end dates), budget, geographical region, and the impact score.Donations
: Records of financial contributions, linked to specific donors and assignments, highlighting how financial support is allocated and utilized.Donors
: Information on individuals and organizations that fund GoodThought’s projects, including donor types.
Refer to the below ERD diagram for a visual representation of the relationships between these data tables:
You will execute SQL queries to answer two questions, as listed in the instructions. Good luck!
SELECT
assignment_name, a.assignment_id,
ROUND(SUM(amount), 2) as rounded_total
from donations as d
INNER JOIN
assignments as a on d.assignment_id = a.assignment_id
GROUP BY
2
ORDER BY 3 DESC
LIMIT 5;
-- highest_donation_assignments
SELECT
assignment_name,
region,
ROUND(SUM(amount), 2) as rounded_total_donation_amount,
donor_type
FROM
assignments a
INNER JOIN
donations d ON a.assignment_id = d.assignment_id
INNER JOIN
donors d1 ON d1.donor_id = d.donor_id
GROUP BY
a.assignment_id, donor_type
ORDER BY
rounded_total_donation_amount DESC
LIMIT 5;
-- top_regional_impact_assignments
with donation_count as (
select
assignment_id,
count(donation_id) as donation_count
from
donations
group by
assignment_id
), regional_max as (
select
distinct first_value(region) over(partition by region) as region,
max(impact_score) over(partition by region) as max_impact_score,
first_value(assignment_id) over(partition by region order by impact_score desc) as assignment_id
from assignments
)
-- select * from regional_max;
select
a.assignment_name,
rm.region,
a.impact_score,
dc.donation_count as num_total_donations
from assignments a
inner join
regional_max rm on a.assignment_id = rm.assignment_id
inner join
donation_count dc on dc.assignment_id = a.assignment_id
order by 2