본문으로 바로가기

강의

spaCy로 배우는 Advanced NLP

중급기술 수준

업데이트됨 2024. 11.

규칙 기반과 머신러닝 접근을 모두 활용해 spaCy로 고급 자연어 이해 시스템을 구축하는 방법을 배웁니다.

무료로 강의 시작

PythonMachine Learning

5시간

15 동영상

55 연습 문제

4,450 XP

21,661

성취 증명서

수천 개 기업의 학습자들이 사랑하는

팀을 교육하시나요?

비즈니스용으로 체험해 보세요

강의 설명

텍스트를 많이 다루다 보면 그 텍스트에 대해 더 깊이 알고 싶어지죠. 예를 들어, 주제가 무엇인지, 단어가 문맥에서 어떤 의미를 갖는지, 누가 누구에게 무엇을 하는지, 어떤 회사와 제품이 언급되는지, 서로 비슷한 텍스트는 무엇인지 등이에요. 이 강의에서는 Python의 빠르게 성장하는 업계 표준 라이브러리인 spaCy를 사용해, 규칙 기반과 Machine Learning 접근을 모두 활용하여 고급 자연어 이해 시스템을 구축하는 방법을 배웁니다.

선수 조건

Introduction to Natural Language Processing in Python

1

Finding words, phrases, names and concepts

This chapter will introduce you to the basics of text processing with spaCy. You'll learn about the data structures, how to work with statistical models, and how to use them to predict linguistic features in your text.

Introduction to spaCy

Getting Started

Documents, spans and tokens

Lexical attributes

Statistical models

Model packages

Loading models

Predicting linguistic annotations

Predicting named entities in context

Rule-based matching

Using the Matcher

Writing match patterns

2

Large-scale data analysis with spaCy

In this chapter, you'll use your new skills to extract specific information from large volumes of text. You'll learn how to make the most of spaCy's data structures, and how to effectively combine statistical and rule-based approaches for text analysis.

Data Structures (1)

Strings to hashes

Vocab, hashes and lexemes

Data Structures (2)

Creating a Doc

Docs, spans and entities from scratch

Data structures best practices

Word vectors and similarity

Inspecting word vectors

Comparing similarities

Combining models and rules

Debugging patterns (1)

Debugging patterns (2)

Efficient phrase matching

Extracting countries and relationships

3

Processing Pipelines

This chapter will show you to everything you need to know about spaCy's processing pipeline. You'll learn what goes on under the hood when you process a text, how to write your own components and add them to the pipeline, and how to use custom attributes to add your own meta data to the documents, spans and tokens.

Processing pipelines

What happens when you call nlp?

Inspecting the pipeline

Custom pipeline components

Use cases for custom components

Simple components

Complex components

Extension attributes

Setting extension attributes (1)

Setting extension attributes (2)

Entities and extensions

Components with extensions

Scaling and performance

Processing streams

Processing data with context

Selective processing

4

Training a neural network model

In this chapter, you'll learn how to update spaCy's statistical models to customize them for your use case – for example, to predict a new entity type in online comments. You'll write your own training loop from scratch, and understand the basics of how training works, along with tips and tricks that can make your custom NLP projects more successful.

Training and updating models

Purpose of training

Creating training data (1)

Creating training data (2)

The training loop

Setting up the pipeline

Building a training loop

Exploring the model

Training best practices

Good data vs. bad data

Training multiple labels

Wrapping up

spaCy로 배우는 Advanced NLP

강의
완료

수료증 획득

LinkedIn 프로필, 이력서 또는 CV에 이 인증서를 추가하세요
소셜 미디어와 성과 평가에서 공유하세요지금 등록

19백만 명 이상의 학습자와 함께 spaCy로 배우는 Advanced NLP을(를) 시작하세요!

DataCamp for Mobile을 통해 데이터 분석 능력을 향상시키세요.

모바일 강좌와 매일 5분 코딩 챌린지를 통해 이동 중에도 학습 효과를 높이세요.