Who's Tweeting? Trump or Trudeau?

Build a machine learning classifier that knows whether President Trump or Prime Minister Trudeau is tweeting!

Start Project

8 Tasks1,500 XP

Loved by learners at thousands of companies

Project Description

Let's apply our natural language processing knowledge to Twitter. Tweets are notoriously difficult, as they are shorter than most texts and usually have hard-to-parse content like hashtags, mentions, links and emoji.

Despite the difficulties, tweets are fun content, so in this notebook we'll take a look at classifying two prominent North American politicians. Can we determine if it is Donald Trump or Justin Trudeau based on just a tweet? Let's see!

Project Tasks

1
Tweet classification: Trump vs. Trudeau

2
Transforming our collected data
3
Vectorize the tweets
4
Training a multinomial naive Bayes model
5
Evaluating our model using a confusion matrix
6
Trying out another classifier: Linear SVC
7
Introspecting our top model
8
Bonus: can you write a Trump or Trudeau tweet?

Technologies

Python

Topics

Data Manipulation Data Visualization Probability & Statistics

Katharine Jarmul

Founder, kjamistan

Katharine Jarmul runs a data analysis company called kjamistan that specializes in helping companies analyze data and training others on data analysis best practices, particularly with Python. She has been using Python for 8 years for a variety of data work -- including telling stories at major national newspapers, building large scale aggregation software, making decisions based on customer analytics, and marketing spend and advising new ventures on the competitive landscape.

FAQs

What do other learners have to say?

Who's Tweeting? Trump or Trudeau?

Loved by learners at thousands of companies

Project Description

Project Tasks

FAQs

Is this project suitable for beginners?

What is the programming language of this project?

Can I add this project to my Data Portfolio?

Do I need to download any software to complete this project?

What type of tweets will I be classifying?

What types of transformations will I be using?

What is a confusion matrix?

What is Linear SVC?

What do other learners have to say?