Courses
Pythonで学ぶNLPの特徴量エンジニアリング
高度なスキルレベル
更新 2024/11無料でコースを始める
含まれるものプレミアム or チーム
PythonMachine Learning4時間15 videos52 Exercises4,200 XP28,581達成証明書
数千社の学習者に愛用されています
2人以上をトレーニングしますか?
DataCamp for Businessを試すコースの説明
前提条件
Introduction to Natural Language Processing in PythonSupervised Learning with scikit-learn1
Basic features and readability scores
Learn to compute basic features such as number of words, number of characters, average word length and number of special characters (such as Twitter hashtags and mentions). You will also learn to compute readability scores and determine the amount of education required to comprehend a piece of text.
2
Text preprocessing, POS tagging and NER
In this chapter, you will learn about tokenization and lemmatization. You will then learn how to perform text cleaning, part-of-speech tagging, and named entity recognition using the spaCy library. Upon mastering these concepts, you will proceed to make the Gettysburg address machine-friendly, analyze noun usage in fake news, and identify people mentioned in a TechCrunch article.
3
N-Gram models
Learn about n-gram modeling and use it to perform sentiment analysis on movie reviews.
4
TF-IDF and similarity scores
Learn how to compute tf-idf weights and the cosine similarity score between two vectors. You will use these concepts to build a movie and a TED Talk recommender. Finally, you will also learn about word embeddings and using word vector representations, you will compute similarities between various Pink Floyd songs.
Pythonで学ぶNLPの特徴量エンジニアリング
コース完了