官术网_书友最值得收藏!

  • Hands-On Big Data Modeling
  • James Lee Tao Wei Suresh Kumar Mukhiya
  • 132字
  • 2021-06-10 18:58:54

Spark SQL

Spark SQL allows for querying structured and semi-structured data inside the Spark program, by using SQL or DataFrame APIs. DataFrames are similar to tables in a relational database. Spark SQL can be embedded into the general programs of native Spark and MLlib, in order to enable interactability between different Spark modules.

Spark SQL provides DataFrame abstractions in different programming languages, such as Python, Java, and Scala, in order to work with structured datasets. It can also read and write data in various structured formats, including JSON, Hive Tables, and Parquet. In addition to that, Spark SQL allows for querying the data by using SQL inside of the Spark program, or by using external tools, for example, connecting to Spark SQL using standard database connectors (JDBC/ODBC). 

主站蜘蛛池模板: 会宁县| 陇西县| 建始县| 象山县| 宝应县| 公主岭市| 温州市| 台东市| 宾川县| 邵东县| 七台河市| 积石山| 广水市| 望都县| 平安县| 监利县| 泰和县| 古浪县| 穆棱市| 于田县| 大余县| 会宁县| 历史| 枞阳县| 高台县| 苏尼特右旗| 宁津县| 武邑县| 饶阳县| 城市| 石家庄市| 福清市| 安陆市| 绥阳县| 宾川县| 曲靖市| 贵港市| 红河县| 河曲县| 禄劝| 黑龙江省|