官术网_书友最值得收藏!

Is web scraping legal?

Web scraping is in the early Wild West stage, where what is permissible is still being established. If the scraped data is being used for personal use, in practice, there is no problem. However, if the data is going to be republished, then the type of data scraped is important.

Several court cases around the world have helped establish what is permissible when scraping a website. In Feist Publications, Inc. v. Rural Telephone Service Co., the United States Supreme Court decided that scraping and republishing facts, such as telephone listings, is allowed. Then, a similar case in Australia, Telstra Corporation Limited v. Phone Directories Company Pty Ltd, demonstrated that only data with an identifiable author can be copyrighted. Also, the European Union case, ofir.dk vs home.dk, concluded that regular crawling and deep linking is permissible.

These cases suggest that when the scraped data constitutes facts (such as business locations and telephone listings), it can be republished. However, if the data is original (such as opinions and reviews), it most likely cannot be republished for copyright reasons.

In any case, when you are scraping data from a website, remember that you are their guest and need to behave politely or they may ban your IP address or proceed with legal action. This means that you should make download requests at a reasonable rate and define a user agent to identify you. The next section on crawling will cover these practices in detail.

主站蜘蛛池模板: 吐鲁番市| 清涧县| 安丘市| 汉川市| 娄烦县| 青田县| 左贡县| 高阳县| 咸丰县| 大渡口区| 杭锦旗| 宜兴市| 葫芦岛市| 潮安县| 文山县| 仲巴县| 南投县| 太原市| 锦州市| 沈阳市| 张家港市| 繁昌县| 灯塔市| 江孜县| 枣庄市| 洛隆县| 南康市| 宁都县| 文化| 聂拉木县| 荣昌县| 宁安市| 甘泉县| 江陵县| 民和| 理塘县| 彭泽县| 吉木萨尔县| 吉木萨尔县| 樟树市| 当雄县|