Skip to content

Does going to university in a different country affect your mental health? A Japanese international university surveyed its students in 2018 and published a study the following year that was approved by several ethical and regulatory boards.

The study found that international students have a higher risk of mental health difficulties than the general population, and that social connectedness (belonging to a social group) and acculturative stress (stress associated with joining a new culture) are predictive of depression.

Explore the students data using PostgreSQL to find out if you would come to a similar conclusion for international students and see if the length of stay is a contributing factor.

Here is a data description of the columns you may find helpful.

Field NameDescription
inter_domTypes of students (international or domestic)
japanese_cateJapanese language proficiency
english_cateEnglish language proficiency
academicCurrent academic level (undergraduate or graduate)
ageCurrent age of student
stayCurrent length of stay in years
todepTotal score of depression (PHQ-9 test)
toscTotal score of social connectedness (SCS test)
toasTotal score of acculturative stress (ASISS test)
Spinner
DataFrameas
students
variable
-- Run this code to save the CSV file as students
SELECT * 
FROM 'students.csv';

Total records in the dataset

Spinner
DataFrameas
df
variable
-- checking total records
SELECT count(*) AS total_records
FROM 'students.csv';

There are 286 records in our dataset

Count of Records by student type

Spinner
DataFrameas
df1
variable
-- checking records by student type

SELECT inter_dom, count(*) as count_inter_dom
FROM 'students.csv'
GROUP BY 1;

From the result set we see we have more "inter" student type with 201 records, "Dom" student type with 67 records, and finally 18 students with unknown status (null).

Summarizing the depression scores

Spinner
DataFrameas
df2
variable
--Finding the summary statistics
SELECT 
	MIN(todep) AS min_phq, MAX(todep) AS max_phq, ROUND(AVG(todep),2) AS avg_phq
FROM
	'students.csv'

Accross the dataset we see that the min depression score is 0, max depression score is 25, and the average depression score is 8.19

Summarizing the social connectedness scores

Spinner
DataFrameas
df3
variable
SELECT 
	MIN(tosc) AS min_scs, MAX(tosc) AS max_scs, ROUND(AVG(tosc),2) AS avg_scs
FROM
	'students.csv'