- Mastering Machine Learning with Spark 2.x
- Alex Tellez Max Pumperla Michal Malohlava
- 198字
- 2021-07-02 18:46:06
Summary
In this chapter, we wanted to give you a brief glimpse into the life of a data scientist, what this entails, and some of the challenges that data scientists consistently face. In light of these challenges, we feel that the Apache Spark project is ideally positioned to help tackle these topics, which range from data ingestion and feature extraction/creation to model building and deployment. We intentionally kept this chapter short and light on verbiage because we feel working through examples and different use cases is a better use of time as opposed to speaking abstractly and at length about a given data science topic. Throughout the rest of this book, we will focus solely on this process while giving best-practice tips and recommended reading along the way for users who wish to learn more. Remember that before embarking on your next data science project, be sure to clearly define the problem beforehand, so you can ask an intelligent question of your data and (hopefully) get an intelligent answer!
One awesome website for all things data science is KDnuggets (http://www.kdnuggets.com). Here's a great article on the language all data scientists must learn in order to be successful (http://www.kdnuggets.com/2015/09/one-language-data-scientist-must-master.html).
- Learn ECMAScript(Second Edition)
- Java語言程序設(shè)計
- Django開發(fā)從入門到實踐
- Vue.js快速入門與深入實戰(zhàn)
- 樂高機器人設(shè)計技巧:EV3結(jié)構(gòu)設(shè)計與編程指導
- PostgreSQL技術(shù)內(nèi)幕:事務處理深度探索
- Mastering PHP Design Patterns
- HTML5+CSS3網(wǎng)站設(shè)計教程
- HTML5入門經(jīng)典
- The Complete Coding Interview Guide in Java
- 低代碼平臺開發(fā)實踐:基于React
- 深入剖析Java虛擬機:源碼剖析與實例詳解(基礎(chǔ)卷)
- Learning Node.js for .NET Developers
- Advanced UFT 12 for Test Engineers Cookbook
- Android應用開發(fā)實戰(zhàn)(第2版)