コース
Pythonで学ぶNLPの特徴量エンジニアリング
上級スキルレベル
更新日 2024/11PythonMachine Learning4時間15 ビデオ52 演習4,200 XP28,897達成証明書
数千の企業の学習者に愛されています
2名以上のトレーニングをお考えですか?
DataCamp for Businessを試すコース説明
前提条件
Introduction to Natural Language Processing in PythonSupervised Learning with scikit-learn1
Basic features and readability scores
Learn to compute basic features such as number of words, number of characters, average word length and number of special characters (such as Twitter hashtags and mentions). You will also learn to compute readability scores and determine the amount of education required to comprehend a piece of text.
2
Text preprocessing, POS tagging and NER
In this chapter, you will learn about tokenization and lemmatization. You will then learn how to perform text cleaning, part-of-speech tagging, and named entity recognition using the spaCy library. Upon mastering these concepts, you will proceed to make the Gettysburg address machine-friendly, analyze noun usage in fake news, and identify people mentioned in a TechCrunch article.
3
N-Gram models
Learn about n-gram modeling and use it to perform sentiment analysis on movie reviews.
4
TF-IDF and similarity scores
Learn how to compute tf-idf weights and the cosine similarity score between two vectors. You will use these concepts to build a movie and a TED Talk recommender. Finally, you will also learn about word embeddings and using word vector representations, you will compute similarities between various Pink Floyd songs.
Pythonで学ぶNLPの特徴量エンジニアリング
コース完了 19百万人を超える学習者と一緒にPythonで学ぶNLPの特徴量エンジニアリングを今日から始めましょう!
DataCamp for Mobileでデータスキルを磨きましょう
モバイル コースと毎日の 5 分間のコーディング チャレンジで、外出先でも進歩できます。