官术网_书友最值得收藏!

Conventions

In this book, you will find a number of styles of text that distinguish between different kinds of information. Here are some examples of these styles, and an explanation of their meaning.

Code words in text are shown as follows: "Most websites define a robots.txt file to let robots know any restrictions about crawling their website."

A block of code is set as follows:

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url><loc>http://example.webscraping.com/view/Afghanistan-1</loc></url>
  <url><loc>http://example.webscraping.com/view/Aland-Islands-2</loc></url>
  <url><loc>http://example.webscraping.com/view/Albania-3</loc></url>
  ...
</urlset>

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

def link_crawler(..., scrape_callback=None):
         …
         links = []
 if scrape_callback:
 links.extend(scrape_callback(url, html) or [])
         ...

Any command-line input or output is written as follows:

$ python performance.py 
Regular expressions: 5.50 seconds
BeautifulSoup: 42.84 seconds
Lxml: 7.06 seconds

New terms and important words are shown in bold. Words that you see on the screen, in menus or dialog boxes for example, appear in the text like this: " When regular users open this web page in their browser, they will enter their e-mail and password, and click on the Log In button to submit the details to the server."

Note

Warnings or important notes appear in a box like this.

Tip

Tips and tricks appear like this.

主站蜘蛛池模板: 桐柏县| 漯河市| 张掖市| 大竹县| 孟连| 即墨市| 河东区| 定南县| 三台县| 台中市| 衡东县| 新兴县| 盐源县| 宁陕县| 敖汉旗| 高青县| 汨罗市| 阿坝县| 山西省| 盐城市| 蒙城县| 屯昌县| 蓬安县| 芷江| 石楼县| 密云县| 晋州市| 司法| 循化| 道真| 南安市| 北安市| 漠河县| 黄山市| 镇江市| 漾濞| 丹凤县| 宜州市| 新民市| 楚雄市| 天长市|