- Web Scraping with Python
- Richard Lawson
- 219字
- 2021-07-09 21:28:49
What you need for this book
All the code used in this book has been tested with Python 2.7, and is available for download at http://bitbucket.org/wswp/code. Ideally, in a future version of this book, the examples will be ported to Python 3. However, for now, many of the libraries required (such as Scrapy/Twisted, Mechanize, and Ghost) are only available for Python 2. To help illustrate the crawling examples, we created a sample website at http://example.webscraping.com. This website limits how fast you can download content, so if you prefer to host this yourself the source code and installation instructions are available at http://bitbucket.org/wswp/places.
We decided to build a custom website for many of the examples used in this book instead of scraping live websites, so that we have full control over the environment. This provides us stability—live websites are updated more often than books, and by the time you try a scraping example, it may no longer work. Also, a custom website allows us to craft examples that illustrate specific skills and avoid distractions. Finally, a live website might not appreciate us using them to learn about web scraping and try to block our scrapers. Using our own custom website avoids these risks; however, the skills learnt in these examples can certainly still be applied to live websites.
- JavaScript 網頁編程從入門到精通 (清華社"視頻大講堂"大系·網絡開發視頻大講堂)
- 深入淺出DPDK
- Go并發編程實戰
- Getting Started with Gulp
- Getting Started with Hazelcast(Second Edition)
- SQL Server 2008 R2數據庫技術及應用(第3版)
- Kotlin極簡教程
- Vue.js 3應用開發與核心源碼解析
- 小程序從0到1:微信全棧工程師一本通
- Django Design Patterns and Best Practices
- 算法秘籍
- Python預測之美:數據分析與算法實戰(雙色)
- Android編程權威指南(第4版)
- 優化驅動的設計方法
- Microsoft Windows Identity Foundation Cookbook