Textacy: An Introduction to Text Data Cleaning and Normalization in Python
Discover how Textacy, a Python library, simplifies text data preprocessing for machine learning. Learn about its unique features like character normalization and data masking, and see how it compares to other libraries like NLTK and spaCy.
Sep 18, 2023 · 5 min read
Topics
RelatedSee MoreSee More
cheat-sheet
Text Data In Python Cheat Sheet
Welcome to our cheat sheet for working with text data in Python! We've compiled a list of the most useful functions and packages for cleaning, processing, and analyzing text data in Python, along with clear examples and explanations, so you'll have everything you need to start developing!
Richie Cotton
4 min
tutorial
Understanding Text Classification in Python
Discover what text classification is, how it works, and successful use cases. Explore end-to-end examples of how to build a text preprocessing pipeline followed by a text classification model in Python.
Moez Ali
12 min
tutorial
Data Cleaning Tutorial
Data cleaning is a very basic building block of data science. Learn the importance of data cleaning and how to use python and carry out the process.
DataCamp Team
16 min
tutorial
Web Scraping & NLP in Python
Learn to scrape novels from the web and plot word frequency distributions; You will gain experience with Python packages requests, BeautifulSoup and nltk.
Hugo Bowne-Anderson
14 min
code-along
Gender Bias Text Analysis in Python
In this session we use text analysis to look at gender bias in a review dataset.
Richie Cotton
code-along
Using ChatGPT's Advanced Data Analysis
In this session, you'll see how to use this tool to combine the text-writing skills of ChatGPT with the power of Python to perform some data analysis and machine-learning workflows.
Richie Cotton