官术网_书友最值得收藏!

Spark SQL

Spark SQL allows for querying structured and semi-structured data inside the Spark program, by using SQL or DataFrame APIs. DataFrames are similar to tables in a relational database. Spark SQL can be embedded into the general programs of native Spark and MLlib, in order to enable interactability between different Spark modules.

Spark SQL provides DataFrame abstractions in different programming languages, such as Python, Java, and Scala, in order to work with structured datasets. It can also read and write data in various structured formats, including JSON, Hive Tables, and Parquet. In addition to that, Spark SQL allows for querying the data by using SQL inside of the Spark program, or by using external tools, for example, connecting to Spark SQL using standard database connectors (JDBC/ODBC). 

主站蜘蛛池模板: 古蔺县| 宣武区| 灵山县| 舞钢市| 满城县| 永寿县| 土默特右旗| 嘉荫县| 平塘县| 塔城市| 托克托县| 花垣县| 元朗区| 饶河县| 根河市| 阜南县| 芦山县| 遵义县| 三穗县| 深圳市| 邓州市| 博爱县| 湖口县| 石狮市| 神池县| 敦煌市| 正安县| 湖州市| 隆昌县| 西林县| 成武县| 松原市| 疏附县| 韶关市| 新巴尔虎左旗| 嵊泗县| 蓬溪县| 手游| 沙洋县| 太白县| 临安市|