官术网_书友最值得收藏!

Introducing ICU encodings

When a PostgreSQL database is created, the administrator can choose the encoding, which should be used to store the data. Basically, the configuration decides which characters exist and in which order they are displayed. Here is an example—de_AT@UTF-8. In this case, we will use Unicode characters, which will be displayed in an Austrian sort order (Austrians speak some sort of German). So, de_AT will define the order in which the data will be sorted.

To achieve this kind of sorting, PostgreSQL relies heavily on the operating system. The trouble is that if the sort order of characters changes in the operating system for some reason (maybe because of a bug or because of some other reason), PostgreSQL will have troubles with its indexes. A normal b-tree index is basically a sorted list, and if the sort order changes, naturally, there is a problem.

The introduction of the ICU library is supposed to fix this problem. ICU offers stronger promises than the operating system and is, therefore, more suitable for long-term storage of data. With the introduction of PostgreSQL 10.0, ICU encodings can be enabled.

主站蜘蛛池模板: 海门市| 安陆市| 孙吴县| 长治县| 阆中市| 抚顺市| 台江县| 太仓市| 宁国市| 兴文县| 家居| 讷河市| 尼木县| 通州市| 孟津县| 和龙市| 广宗县| 深水埗区| 额尔古纳市| 泰和县| 海林市| 贵定县| 邹城市| 湾仔区| 桂东县| 中西区| 武夷山市| 阳原县| 新郑市| 辽阳市| 金山区| 阿坝县| 扬中市| 工布江达县| 黄山市| 米脂县| 濮阳市| 洛隆县| 博野县| 漳平市| 武鸣县|