Skip to content
Introduction to Data Science in Python
Introduction to Data Science in Python
Run the hidden code cell below to import the data used in this course.
# Importing pandas and numpy
import numpy as np
import pandas as pd
# Importing the course datasets
frequencies = pd.read_csv("datasets/all_frequencies.csv")
records = pd.read_csv("datasets/cell_phone_records.csv")
credit = pd.read_csv("datasets/credit_records.csv")
ransom = pd.read_csv("datasets/ransom.csv")
gravel = pd.read_csv("datasets/shoe_gravel_sample.csv")
Take Notes
Add notes about the concepts you've learned and code cells with code you want to keep.
Add your notes here
# Add your code snippets here
DataFrameavailable as
df
variable
-- Select the names and most recent salaries of all current employees
-- Note that the query may take a long time to run if you remove the LIMIT statement
SELECT first_name,
last_name,
hire_date,
salary
FROM employees
INNER JOIN salaries USING(emp_no)
WHERE to_date = (SELECT MAX(to_date) FROM salaries)
ORDER BY emp_no
LIMIT 5