官术网_书友最值得收藏!

Introducing parallel queries

Traditionally, a query had to run on a single CPU. While this was just fine in the OLTP world, it started to be a problem for analytical applications, which were bound to the speed provided by a single core. With PostgreSQL 9.6, parallel queries were introduced. Of course, implementing parallel queries was hard and so a lot of infrastructure has already been implemented over the years. All this infrastructure is now available to provide the end user with parallel sequential scans. The idea is to make many CPUs work on complicated WHERE conditions during a sequential scan. Version 9.6 also allowed for parallel aggregates and parallel joins. Of course, there is a lot of work left, but we are already looking at a major leap forward.

To control parallelism, there are two essential settings:

test=# SHOW max_worker_processes; 
max_worker_processes
----------------------
8
(1 row)

test=# SHOW max_parallel_workers_per_gather ;
max_parallel_workers_per_gather
---------------------------------
2
(1 row)

The first one limits the overall number of worker processes available. The second one controls the number of workers allowed per gather node.

A gather node is a new thing you will see in an execution plan. It is in charge of unifying results coming from parallel subprocesses.

In addition to those fundamental settings, there are a couple of new optimizer parameters to adjust the cost of parallel queries.

主站蜘蛛池模板: 莲花县| 迁西县| 天津市| 博爱县| 南皮县| 元朗区| 河池市| 佳木斯市| 沁源县| 九龙坡区| 五河县| 奇台县| 诸城市| 香港 | 高碑店市| 手游| 井陉县| 绥阳县| 德州市| 明星| 鹰潭市| 新建县| 华容县| 周至县| 太原市| 海门市| 孝感市| 瑞丽市| 灵武市| 富川| 微博| 肥城市| 临武县| 永仁县| 万源市| 灵武市| 大新县| 沾化县| 正蓝旗| 北京市| 陆川县|