- Python Web Scraping(Second Edition)
- Katharine Jarmul Richard Lawson
- 144字
- 2021-07-09 19:42:47
Scraping the Data
In the previous chapter, we built a crawler which follows links to download the web pages we want. This is interesting but not useful-the crawler downloads a web page, and then discards the result. Now, we need to make this crawler achieve something by extracting data from each web page, which is known as scraping.
We will first cover browser tools to examine a web page, which you may already be familiar with if you have a web development background. Then, we will walk through three approaches to extract data from a web page using regular expressions, Beautiful Soup and lxml. Finally, the chapter will conclude with a comparison of these three scraping alternatives.
In this chapter, we will cover the following topics:
- Analyzing a web page
- Approaches to scrape a web page
- Using the console
- xpath selectors
- Scraping results
推薦閱讀
- Puppet 4 Essentials(Second Edition)
- Learning Selenium Testing Tools with Python
- ASP.NET Core 5.0開發入門與實戰
- Rust實戰
- MySQL數據庫管理與開發(慕課版)
- Gradle for Android
- Python High Performance Programming
- MongoDB,Express,Angular,and Node.js Fundamentals
- Mastering Data Mining with Python:Find patterns hidden in your data
- LabVIEW虛擬儀器程序設計從入門到精通(第二版)
- App Inventor少兒趣味編程動手做
- C++程序設計教程
- CodeIgniter Web Application Blueprints
- 數據分析與挖掘算法:Python實戰
- Instant Apache Camel Messaging System