Skip to content

Introduction to Data Science in Python

Run the hidden code cell below to import the data used in this course.

# Importing pandas and numpy
import numpy as np
import pandas as pd

# Importing the course datasets
frequencies = pd.read_csv("datasets/all_frequencies.csv")
records = pd.read_csv("datasets/cell_phone_records.csv")
credit = pd.read_csv("datasets/credit_records.csv")
ransom = pd.read_csv("datasets/ransom.csv")
gravel = pd.read_csv("datasets/shoe_gravel_sample.csv")

Take Notes

Add notes about the concepts you've learned and code cells with code you want to keep.

Add your notes here

# Add your code snippets here
Spinner
DataFrameas
df
variable
-- Select the names and most recent salaries of all current employees
-- Note that the query may take a long time to run if you remove the LIMIT statement
SELECT first_name,
        last_name,
        hire_date,
        salary
FROM employees
  INNER JOIN salaries USING(emp_no)
WHERE to_date = (SELECT MAX(to_date) FROM salaries)
ORDER BY emp_no
LIMIT 5