- Mastering Java for Data Science
- Alexey Grigorev
- 308字
- 2021-07-02 23:44:28
Preface
Data science has become a quite important tool for organizations nowadays: they have collected large amounts of data, and to be able to put it into good use, they need data science--the discipline about methods for extracting knowledge from data. Every day more and more companies realize that they can benefit from data science and utilize the data that they produce more effectively and more profitably.
It is especially true for IT companies, they already have the systems and the infrastructure for generating and processing the data. These systems are often written in Java--the language of choice for many large and small companies across the world. It is not a surprise, Java offers a very solid and mature ecosystem of libraries that are time proven and reliable, so many people trust Java and use it for creating their applications.
Thus, it is also a natural choice for many data processing applications. Since the existing systems are already in Java, it makes sense to use the same technology stack for data science, and integrate the machine learning model directly in the application's production code base.
This book will cover exactly that. We will first see how we can utilize Java’s toolbox for processing small and large datasets, then look into doing initial exploration data analysis. Next, we will review the Java libraries that implement common Machine Learning models for classification, regression, clustering, and dimensionality reduction problems. Then we will get into more advanced techniques and discuss Information Retrieval and Natural Language Processing, XGBoost, deep learning, and large scale tools for processing big datasets such as Apache Hadoop and Apache Spark. Finally, we will also have a look at how to evaluate and deploy the produced models such that the other services can use them.
We hope you will enjoy the book. Happy reading!
- Google Visualization API Essentials
- 使用GitOps實現Kubernetes的持續部署:模式、流程及工具
- Creating Mobile Apps with Sencha Touch 2
- 區塊鏈:看得見的信任
- 大數據時代下的智能轉型進程精選(套裝共10冊)
- 智能數據時代:企業大數據戰略與實戰
- Python金融實戰
- 企業級數據與AI項目成功之道
- 企業級容器云架構開發指南
- 大數據與機器學習:實踐方法與行業案例
- Internet of Things with Python
- 改進的群智能算法及其應用
- 智能與數據重構世界
- 數據庫原理及應用實驗:基于GaussDB的實現方法
- Scratch Cookbook