Skip to content
Project: Analyzing Electric Vehicle Charging Habits
As electronic vehicles (EVs) become more popular, there is an increasing need for access to charging stations, also known as ports. To that end, many modern apartment buildings have begun retrofitting their parking garages to include shared charging stations. A charging station is shared if it is accessible by anyone in the building.
But with increasing demand comes competition for these ports — nothing is more frustrating than coming home to find no charging stations available! In this project, you will use a dataset to help apartment building managers better understand their tenants’ EV charging habits.
The data has been loaded into a PostgreSQL database with a table named charging_sessions with the following columns:
charging_sessions
| Column | Definition | Data type |
|---|---|---|
garage_id | Identifier for the garage/building | VARCHAR |
user_id | Identifier for the individual user | VARCHAR |
user_type | Indicating whether the station is Shared or Private | VARCHAR |
start_plugin | The date and time the session started | DATETIME |
start_plugin_hour | The hour (in military time) that the session started | NUMERIC |
end_plugout | The date and time the session ended | DATETIME |
end_plugout_hour | The hour (in military time) that the session ended | NUMERIC |
duration_hours | The length of the session, in hours | NUMERIC |
el_kwh | Amount of electricity used (in Kilowatt hours) | NUMERIC |
month_plugin | The month that the session started | VARCHAR |
weekdays_plugin | The day of the week that the session started | VARCHAR |
Let’s get started!
Sources
DataFrameas
unique_users_per_garage
variable
-- Find the number of unique individuals that use each garage’s shared charging stations.
SELECT garage_id, COUNT(DISTINCT user_id) AS num_unique_users
FROM charging_sessions
WHERE user_type = 'Shared'
GROUP BY garage_id
ORDER BY num_unique_users DESC;DataFrameas
most_popular_shared_start_times
variable
-- Find the top 10 most popular charging start times (by weekday and start hour) for sessions that use shared charging stations.
SELECT weekdays_plugin, start_plugin_hour, COUNT (start_plugin_hour) AS num_charging_sessions
FROM charging_sessions
WHERE user_type = 'Shared'
GROUP BY weekdays_plugin, start_plugin_hour
ORDER BY num_charging_sessions DESC
LIMIT 10;DataFrameas
long_duration_shared_users
variable
-- Find the users whose average charging duration last longer than 10 hours when using shared charging stations.
SELECT user_id, ROUND(AVG(duration_hours),2) AS avg_charging_duration
FROM charging_sessions
WHERE user_type = 'Shared'
AND duration_hours IS NOT NULL
GROUP BY user_id
HAVING AVG(duration_hours) >= 10
ORDER BY avg_charging_duration DESC;DataFrameas
Ex_1
variable
-- Number of charging sessions per user per garage:
SELECT user_id, garage_id, COUNT(*) AS session_count
FROM charging_sessions
GROUP BY user_id, garage_id
ORDER BY session_count DESC, user_id ASC;
DataFrameas
Ex_2
variable
-- Count Unique Users Per Garage
SELECT garage_id, COUNT(DISTINCT user_id) AS unique_users
FROM charging_sessions
GROUP BY garage_id
ORDER BY unique_users DESC;
DataFrameas
Ex_3
variable
-- Total number of shared sessions to provide context for the counts
SELECT start_plugin_hour,
COUNT(start_plugin_hour) AS session_count,
ROUND(COUNT(start_plugin_hour) * 100.0 / SUM(COUNT(start_plugin_hour)) OVER (),2) AS percentage
FROM charging_sessions
WHERE user_type = 'Shared'
GROUP BY start_plugin_hour
ORDER BY session_count DESC;
DataFrameas
Ex_4
variable
-- Add Percentage of Total Hours
SELECT user_id,
ROUND(SUM(duration_hours), 2) AS total_hours,
ROUND(SUM(duration_hours) * 100.0 / SUM(SUM(duration_hours)) OVER (), 2) AS percentage
FROM charging_sessions
WHERE user_type = 'Shared' AND duration_hours IS NOT NULL
GROUP BY user_id
ORDER BY total_hours DESC;