官术网_书友最值得收藏!

Learning Google BigQuery

BigQuery is a serverless, fully managed, and petabyte-scale data warehouse solution for structured data hosted on the Google Cloud infrastructure. BigQuery provides an easy-to-learn and easy-to-use SQL-like language to query data for analysis. In BigQuery, data is organized as Tables, Rows, and Columns. BigQuery uses columnar storage to achieve high compression ratio and is efficient in executing ad hoc queries; the execution plans are optimized on the fly by BigQuery automatically. The reason BigQuery is capable of executing ad hoc queries is that it does not support or use any index, and the storage engine component of BigQuery continuously optimizes the way data is stored and organized. There are no maintenance jobs required to improve BigQuery's performance or clean up data to get better performance.

BigQuery can be accessed via a browser, command-line utility, or API. In this chapter, we will load data into a custom table via a browser by directly uploading the file to BigQuery and also importing data from a file in Google Cloud storage.

The hierarchy in BigQuery is Project | Datasets | Tables. Under a project, datasets can be created. Datasets are containers for tables. It is a way in which tables are grouped in a project. Tables belonging to different datasets in the same project can be combined in queries.

主站蜘蛛池模板: 青神县| 黄山市| 德化县| 衡阳市| 武陟县| 察隅县| 沙洋县| 安丘市| 韩城市| 镇江市| 斗六市| 兴海县| 清丰县| 溆浦县| 峨眉山市| 玉田县| 磐安县| 航空| 亳州市| 巩义市| 栖霞市| 上虞市| 台南县| 平塘县| 井冈山市| 大姚县| 开封县| 科尔| 库伦旗| 日照市| 尼木县| 茌平县| 宁国市| 宁津县| 荥阳市| 云南省| 合作市| 曲靖市| 罗江县| 天长市| 丘北县|