官术网_书友最值得收藏!

Using SQL subqueries

It is also possible to use subqueries in ApacheSparkSQL. In the following example, a SQL query uses an anonymous inner query in order to run aggregations on Windows. The encapsulating query is making use of the virtual/temporal result of the inner query, basically removing empty columns:

val result = spark.sql("""
SELECT * from (
SELECT
min(temperature) over w as min_temperature,
max(temperature) over w as max_temperature,
min(voltage) over w as min_voltage,
max(voltage) over w as max_voltage,
min(flowrate) over w as min_flowrate,
max(flowrate) over w as max_flowrate,
min(frequency) over w as min_frequency,
max(frequency) over w as max_frequency,
min(hardness) over w as min_hardness,
max(hardness) over w as max_hardness,
min(speed) over w as min_speed,
max(speed) over w as max_speed
FROM washing_flat
WINDOW w AS (ORDER BY ts ROWS BETWEEN CURRENT ROW AND 10 FOLLOWING)
)
WHERE min_temperature is not null
AND max_temperature is not null
AND min_voltage is not null
AND max_voltage is not null
AND min_flowrate is not null
AND max_flowrate is not null
AND min_frequency is not null
AND max_frequency is not null
AND min_hardness is not null
AND min_speed is not null
AND max_speed is not null
""")

The result of the subqueries is as follows:

主站蜘蛛池模板: 长兴县| 宽城| 朝阳县| 肥城市| 贵德县| 布尔津县| 金坛市| 成安县| 南充市| 吉首市| 临沭县| 垦利县| 东丽区| 旬邑县| 光山县| 报价| 邻水| 绥芬河市| 科尔| 吉水县| 宜兰县| 沾化县| 合肥市| 红河县| 金山区| 贡嘎县| 西藏| 阿尔山市| 浦北县| 北川| 靖安县| 上思县| 呈贡县| 延庆县| 永兴县| 西畴县| 栾城县| 玉田县| 永济市| 烟台市| 岳西县|