官术网_书友最值得收藏!

Traditional machine learning architecture

Structured data, such as transactional, customers, analytical, and market data, usually resides within a local relational database. Given a query language, such as SQL, we can query the data used for processing, as shown in the workflow in the preceding diagram. Usually, all the data can be stored in memory and further processed with a machine learning library such as Weka, Java-ML, or MALLET.

A common practice in the architecture design is to create data pipelines, where different steps in the workflow are split. For instance, in order to create a client data record, we might have to scrap the data from different data sources. The record can be then saved in an intermediate database for further processing.

To understand how the high-level aspects of big data architecture differ, let's first clarify when data is considered big.

主站蜘蛛池模板: 丹东市| 岚皋县| 任丘市| 临泽县| 高邮市| 云和县| 平阳县| 泊头市| 监利县| 乌海市| 景泰县| 南丹县| 舞钢市| 宣化县| 永顺县| 礼泉县| 普定县| 额尔古纳市| 乌鲁木齐县| 大洼县| 咸宁市| 贵阳市| 屏边| 彭泽县| 鄱阳县| 嘉鱼县| 桐柏县| 五华县| 乐安县| 江北区| 昭平县| 忻州市| 安平县| 广西| 博野县| 元氏县| 秦安县| 兰州市| 西盟| 田阳县| 郴州市|