官术网_书友最值得收藏!

Data mining

In Chapter 1, Transitioning from Data Developer to Data Scientist, we said, with data mining, one is usually more absorbed in the data relationships (or the potential relationships between points of data, sometimes referred to as variables) and cognitive analysis.

To further define this term, we can mention that data mining is sometimes more simply referred to as knowledge discovery or even just discovery, based upon processing through or analyzing data from new or different viewpoints and summarizing it into valuable insights that can be used to increase revenue, cuts costs, or both.

Using software dedicated to data mining is just one of several analytical approaches to data mining. Although there are tools dedicated to this purpose (such as IBM Cognos BI and Planning Analytics, Tableau, SAS, and so on.), data mining is all about the analysis process finding correlations or patterns among dozens of fields in the data and that can be effectively accomplished using tools such as MS Excel or any number of open source technologies.

A common technique to data mining is through the creation of custom scripts using tools such as R or Python. In this way, the data scientist has the ability to customize the logic and processing to their exact project needs.
主站蜘蛛池模板: 河东区| 定结县| 泗洪县| 咸阳市| 富民县| 通山县| 乌拉特后旗| 迁西县| 德昌县| 新邵县| 黄浦区| 突泉县| 从化市| 香港| 永仁县| 白城市| 雅江县| 东光县| 蓬莱市| 隆化县| 泊头市| 普兰店市| 长宁区| 鄂伦春自治旗| 鹤庆县| 唐山市| 河曲县| 文水县| 黄山市| 遂昌县| 高安市| 伊川县| 海宁市| 沿河| 桂平市| 华容县| 响水县| 昌吉市| 镇原县| 玉溪市| 新昌县|