- Mastering Apache Spark 2.x(Second Edition)
- Romeo Kienzler
- 122字
- 2021-07-02 18:55:24
Extended ecosystem
When examining big data processing systems, we think it is important to look at not just the system itself, but also how it can be extended and how it integrates with external systems so that greater levels of functionality can be offered. In a book of this size, we cannot cover every option, but by introducing a topic, we can hopefully stimulate the reader's interest so that they can investigate further.
We have used the H2O machine learning library, SystemML and Deeplearning4j, to extend Apache Spark's MLlib machine learning module. We have shown that Deeplearning and highly performant cost-based optimized machine learning can be introduced to Apache Spark. However, we have just scratched the surface of all the frameworks' functionality.
推薦閱讀
- 數據庫程序員面試筆試真題與解析
- Mastering Objectoriented Python
- Python入門很簡單
- 摩登創客:與智能手機和平板電腦共舞
- 架構不再難(全5冊)
- 零基礎學Scratch少兒編程:小學課本中的Scratch創意編程
- Production Ready OpenStack:Recipes for Successful Environments
- PHP+MySQL網站開發項目式教程
- 詳解MATLAB圖形繪制技術
- Extreme C
- Hands-On Nuxt.js Web Development
- 寫給程序員的Python教程
- Python語言科研繪圖與學術圖表繪制從入門到精通
- Backbone.js Testing
- Learning Jakarta Struts 1.2: a concise and practical tutorial