官术网_书友最值得收藏!

Extract, Transform, Load

Training and testing DL models requires data. Data is usually hosted on different distributed and remote storage systems. You need them to connect to the data sources and perform data retrieval so that you can start the training phase and you would probably need to do some preparation before feeding your model. This chapter goes through the phases of the Extract, Transform, Load (ETL) process applied to DL. It covers several use cases for which the DeepLearning4j framework and Spark would be used. The use cases presented here are related to batch data ingestion. Data streaming will be covered in the next chapter.

The following topics will be covered in this chapter:

  • Training data ingestion through Spark
  • Data ingestion from a relational database
  • Data ingestion from a NoSQL database
  • Data ingestion from S3
主站蜘蛛池模板: 石棉县| 夏津县| 潍坊市| 奉化市| 云阳县| 汉沽区| 河北区| 永福县| 山东省| 定兴县| 紫云| 盐源县| 许昌市| 遂川县| 无锡市| 察哈| 宜州市| 湘潭市| 宿迁市| 沿河| 霞浦县| 虹口区| 扶绥县| 遵化市| 塘沽区| 平潭县| 饶平县| 巧家县| 门头沟区| 丽水市| 平果县| 若尔盖县| 九寨沟县| 改则县| 克东县| 平江县| 白玉县| 萨迦县| 昌平区| 和龙市| 遂川县|