官术网_书友最值得收藏!

Traditional machine learning architecture

Structured data, such as transactional, customers, analytical, and market data, usually resides within a local relational database. Given a query language, such as SQL, we can query the data used for processing, as shown in the workflow in the preceding diagram. Usually, all the data can be stored in memory and further processed with a machine learning library such as Weka, Java-ML, or MALLET.

A common practice in the architecture design is to create data pipelines, where different steps in the workflow are split. For instance, in order to create a client data record, we might have to scrap the data from different data sources. The record can be then saved in an intermediate database for further processing.

To understand how the high-level aspects of big data architecture differ, let's first clarify when data is considered big.

主站蜘蛛池模板: 安康市| 武威市| 湘潭县| 宜丰县| 绥阳县| 洪雅县| 于都县| 辽中县| 同心县| 犍为县| 宕昌县| 安陆市| 龙游县| 安新县| 昔阳县| 嵊泗县| 徐州市| 察雅县| 穆棱市| 广南县| 惠水县| 邻水| 瑞金市| 泰兴市| 密山市| 滦南县| 杭州市| 阿克| 南康市| 介休市| 内丘县| 奇台县| 石城县| 宿松县| 定陶县| 班戈县| 蒙自县| 永川市| 乌海市| 甘孜| 巴彦淖尔市|