官术网_书友最值得收藏!

Avoid making a large number of requests

Each time one of the programs that we have been discussing runs, it makes HTTP requests to a server that manages the site that you'd like to extract data from. This process happens significantly more frequently and over a shorter amount of time in a concurrent program, where multiple requests are being submitted to that server.

As mentioned before, servers nowadays have the ability to handle multiple requests simultaneously with ease. However, to avoid having to overwork and overconsume resources, servers are also designed to stop answering requests that come in too frequently. Websites of big tech companies, such as Amazon or Twitter, look for large amounts of automated requests that are made from the same IP address and implement different response protocols; some requests might be delayed, some might be refused a response, or the IP address might even be banned from making further requests for a specific amount of time.

Interestingly, making repeated, heavy-duty requests to servers is actually a form of hacking a website. In Denial of Service (DoS) and Distributed Denial of Service (DDoS) attacks, a very large number of requests are made at the same time to the server, flooding the bandwidth of the targeted server with traffic, and as a result, normal, nonmalicious requests from other clients are denied because the servers are busy processing the concurrent requests, as illustrated in the following diagram:

A of a DDoS attack

It is therefore important to space out the concurrent requests that your application makes to a server so that the application would not be considered an attacker and be potentially banned or treated as a malicious client. This could be as simple as limiting the maximum number of threads/requests that can be implemented at a time in your program or pausing the threading for a specific amount of time (for example, using the time.sleep() function) before making a request to the server.

主站蜘蛛池模板: 青浦区| 遂宁市| 塔城市| 县级市| 金阳县| 澎湖县| 香格里拉县| 宁蒗| 康乐县| 石嘴山市| 沅江市| 大理市| 诸城市| 赤城县| 巴楚县| 大竹县| 章丘市| 当涂县| 景东| 东阿县| 舞阳县| 赤城县| 广水市| 手游| 廉江市| 玉山县| 大英县| 隆昌县| 油尖旺区| 南平市| 新乡县| 清苑县| 洛宁县| 保康县| 泽库县| 稻城县| 临沧市| 广汉市| 双桥区| 福建省| 松桃|