書名： Hands-On Big Data Modeling
作者名： James Lee Tao Wei Suresh Kumar Mukhiya
本章字數： 132字
更新時間： 2021-06-10 18:58:54

Spark SQL

Spark SQL allows for querying structured and semi-structured data inside the Spark program, by using SQL or DataFrame APIs. DataFrames are similar to tables in a relational database. Spark SQL can be embedded into the general programs of native Spark and MLlib, in order to enable interactability between different Spark modules.

Spark SQL provides DataFrame abstractions in different programming languages, such as Python, Java, and Scala, in order to work with structured datasets. It can also read and write data in various structured formats, including JSON, Hive Tables, and Parquet. In addition to that, Spark SQL allows for querying the data by using SQL inside of the Spark program, or by using external tools, for example, connecting to Spark SQL using standard database connectors (JDBC/ODBC).

官术网_书友最值得收藏!

Hands-On Big Data Modeling

Spark SQL