Web Scraping in Python

中级技能水平

更新时间 2026年3月

Learn to retrieve and parse information from the internet using the Python library scrapy.

课程描述

The ability to build tools capable of retrieving and parsing information stored across the internet has been and continues to be valuable in many veins of data science. In this course, you will learn to navigate and parse html code, and build tools to crawl websites automatically. Although our scraping will be conducted using the versatile Python library scrapy, many of the techniques you learn in this course can be applied to other popular Python libraries as well, including BeautifulSoup and Selenium. Upon the completion of this course, you will have a strong mental model of html structure, will be able to build tools to parse html code and access desired information, and create a simple scrapy spiders to crawl the web at scale.

先决条件

Intermediate Python

Introduction to HTML

Learn the structure of HTML. We begin by explaining why web scraping can be a valuable addition to your data science toolbox and then delving into some basics of HTML. We end the chapter by giving a brief introduction on XPath notation, which is used to navigate the elements within HTML code.

Web Scraping Overview

50 XP

Web-scraping is not nonsense!

50 XP

HyperText Markup Language

课程描述

获得成就证明

加入超过.css-nklxlk{color:var(--wf-brand--main, #03EF62);}19百万学习者，今天就开始Web Scraping in Python！

创建您的免费帐户

通过 DataCamp for Mobile 提升您的数据技能

加入超过19百万学习者，今天就开始Web Scraping in Python！