Skip to content
New Workbook
Sign up
Introduction to Data Science in Python

Introduction to Data Science in Python

Run the hidden code cell below to import the data used in this course.

# Importing pandas and numpy
import numpy as np
import pandas as pd

# Importing the course datasets
frequencies = pd.read_csv("datasets/all_frequencies.csv")
records = pd.read_csv("datasets/cell_phone_records.csv")
credit = pd.read_csv("datasets/credit_records.csv")
ransom = pd.read_csv("datasets/ransom.csv")
gravel = pd.read_csv("datasets/shoe_gravel_sample.csv")

Take Notes

Add notes about the concepts you've learned and code cells with code you want to keep.

Add your notes here

# Add your code snippets here
Spinner
DataFrameavailable as
df
variable
-- Select the names and most recent salaries of all current employees
-- Note that the query may take a long time to run if you remove the LIMIT statement
SELECT first_name,
        last_name,
        hire_date,
        salary
FROM employees
  INNER JOIN salaries USING(emp_no)
WHERE to_date = (SELECT MAX(to_date) FROM salaries)
ORDER BY emp_no
LIMIT 5