官术网_书友最值得收藏!

Querying or mining

As a data developer, you will almost always be in the habit of querying data. Indeed, a data scientist will query data as well. So, what is data mining? Well, when one queries data, one expects to ask a specific question. For example, you might ask, What was the total number of daffodils sold in April? expecting to receive back a known, relevant answer such as in April, daffodil sales totaled 269 plants.

With data mining, one is usually more absorbed in the data relationships (or the potential relationships between points of data, sometimes referred to as variables) and cognitive analysis. A simple example might be: how does the average daily temperature during the month affect the total number of daffodils sold in April?

Another important distinction between data querying and data mining is that queries are typically historic in nature in that they are used to report past results (total sales in April), while data mining techniques can be forward thinking in that through the use of appropriate statistical methods, they can infer a future result or provide the probability that a result or event will occur. For example, using our earlier example, we might predict higher daffodil sales when the average temperature rises within the selling area.

主站蜘蛛池模板: 兴业县| 鹤峰县| 沙湾县| 南靖县| 格尔木市| 临沭县| 固镇县| 临西县| 昂仁县| 堆龙德庆县| 西昌市| 故城县| 宁南县| 扶风县| 独山县| 白玉县| 武鸣县| 焉耆| 缙云县| 柳林县| 莱芜市| 车致| 桂林市| 大余县| 梅州市| 宝清县| 孝感市| 阳东县| 凌海市| 长治市| 建德市| 陆川县| 临湘市| 昔阳县| 禄劝| 洛川县| 梁山县| 兴业县| 大田县| 闽清县| 鄯善县|