Text Mining America's Toughest Game Show

Use text mining to analyze Jeopardy! data.

Start Project

10 Tasks1,500 XP

Loved by learners at thousands of companies

Project Description

Note: this project is soft launched, which means you may experience bugs. Please click "Report an Issue" in the top-right corner of the project interface to provide feedback.

Jeopardy! (hosted by Alex Trebek) has cemented itself in TV history as one of the most iconic American game shows of all time. In this project, you will examine ten years worth of Jeopardy! episodes with text mining techniques to find the most frequently asked types of questions on the show.

The dataset used in this project is a cleaned subset of this dataset from the Datasets subreddit, uploaded by user trexmatt.

Project Tasks

1
This... is... Jeopardy!

2
A glimpse ahead
3
Corpus of categories
4
Cleaning the categories
5
Favorite topics
6
Removing unwanted words
7
Creating better tools, part 1
8
Creating better tools, part 2
9
Think!
10
A few insights

Technologies

Alexis Lee

Intern at DataCamp

Alexis is a Content Intern at DataCamp's New York City office. Currently, she is an undergraduate student at Yale University, intending to study Classics and Economics.

FAQs

What do other learners have to say?

Text Mining America's Toughest Game Show

Loved by learners at thousands of companies

Project Description

Project Tasks

FAQs

Is this project suitable for beginners?

What is the programming language of this project?

Can I add this project to my Data Portfolio?

Do I need to download any software to complete this project?

What do other learners have to say?