Skip to main content
HomePython

Course

Natural Language Processing with spaCy

IntermediateSkill Level
4.7+
540 reviews
Updated 07/2025
Master the core operations of spaCy and train models for natural language processing. Extract information from unstructured data and match patterns.
Start Course for Free
PythonMachine Learning
4 hr
15 videos
53 Exercises
4,450 XP
8,000
Statement of Accomplishment

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Loved by learners at thousands of companies

Group

Training a Team?

Try for Business

Course Description

Meet spaCy, an Industry-Standard for NLP

In this course, you will learn how to use spaCy, a fast-growing industry-standard library, to perform various natural language processing tasks such as tokenization, sentence segmentation, parsing, and named entity recognition. spaCy can provide powerful, easy-to-use, and production-ready features across a wide range of natural language processing tasks.

Learn the Core Operations of spaCy

You will start by learning the core operations of spaCy and how to use them to parse text and extract information from unstructured data. Then, you will work with spaCy’s classes, such as Doc, Span, and Token, and learn how to use different spaCy components for calculating word vectors and predicting semantic similarity.

Train spaCy Models and Learn About Pattern Matching

You will practice writing simple and complex matching patterns to extract given terms and phrases using EntityRuler, Matcher, and PhraseMatcher from unstructured data. You will also learn how to create custom pipeline components and create training/evaluation data. From there, you will dive into training spaCy models and how to use them for inference. Throughout the course, you will work on real-world examples and solidify your understanding of using spaCy in your own NLP projects.

Prerequisites

Supervised Learning with scikit-learnPython Toolbox
1

Introduction to NLP and spaCy

This chapter will introduce you to NLP, some of its use cases such as named-entity recognition and AI-powered chatbots. You’ll learn how to use the powerful spaCy library to perform various natural language processing tasks such as tokenization, sentence segmentation, POS tagging, and named entity recognition.
Start Chapter
2

spaCy Linguistic Annotations and Word Vectors

Learn about linguistic features, word vectors, semantic similarity, analogies, and word vector operations. In this chapter you’ll discover how to use spaCy to extract word vectors, categorize texts that are relevant to a given topic and find semantically similar terms to given words from a corpus or from a spaCy model vocabulary.
Start Chapter
3

Data Analysis with spaCy

4

Customizing spaCy Models

Explore multiple real-world use cases where spaCy models may fail and learn how to train them further to improve model performance. You’ll be introduced to spaCy training steps and understand how to train an existing spaCy model or from scratch, and evaluate the model at the inference time.
Start Chapter
Natural Language Processing with spaCy
Course
Complete

Earn Statement of Accomplishment

Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
Enroll Now

Don’t just take our word for it

*4.7
from 540 reviews
81%
16%
2%
0%
0%
  • Sergey
    3 hours ago

  • Rahma
    14 hours ago

  • Hippolyte
    15 hours ago

  • Fahad
    yesterday

  • Dina
    yesterday

  • Youssif
    2 days ago

Rahma

Hippolyte

Fahad

FAQs

What makes spaCy different from other NLP libraries like NLTK?

spaCy is designed for production use with fast processing speeds and pre-trained models. This course focuses on spaCy's industrial-strength tools for tokenization, parsing, and named entity recognition.

Will I learn to train custom NLP models with spaCy?

Yes. Chapter 4 covers training existing spaCy models further and training models from scratch, then evaluating their performance at inference time on real-world use cases.

What rule-based extraction techniques does the course teach?

Chapter 3 covers EntityRuler, Matcher, PhraseMatcher, and RegEx for pattern-based information extraction. You also learn to add custom components to the spaCy pipeline.

How many prerequisites does this advanced course require?

Seven prerequisites are listed, including Intermediate Python, Data Manipulation with pandas, Supervised Learning with scikit-learn, and Introduction to Statistics in Python.

What practical NLP tasks will I be able to perform after this course?

You will be able to tokenize text, perform POS tagging, extract named entities, compute word vectors, measure semantic similarity, build custom pipelines, and train spaCy models.

Join over 19 million learners and start Natural Language Processing with spaCy today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.