Skip to content
Project
What are the most frequent words in Herman Melville's novel, Moby Dick, and how often do they occur?
The Data Science pipeline you'll build in this workspace can be used to visualize the word frequency distributions of any novel that you can find on Project Gutenberg. The natural language processing tools used here apply to much of the data that data scientists encounter as a vast proportion of the world's data is unstructured data and includes a great deal of text.
# Import and download packages
import requests
from bs4 import BeautifulSoup
import nltk
from collections import Counter
nltk.download('stopwords')
# Start coding here...