- Mastering Java for Data Science
- Alexey Grigorev
- 182字
- 2021-07-02 23:44:33
Machine learning and data mining libraries
There are quite a few machine learning and data mining libraries available for Java and other JVM languages. Some of them are as follows:
- Weka (http://www.cs.waikato.ac.nz/ml/weka/) is probably the most famous data mining library in Java, contains a lot of algorithms and has many extensions.
- JavaML (http://java-ml.sourceforge.net/) is quite an old and reliable ML library, but unfortunately not updated anymore
- Smile (http://haifengl.github.io/smile/) is a promising ML library that is under active development at the moment and a lot of new methods are being added there.
- JSAT (https://github.com/EdwardRaff/JSAT) contains quite an impressive list of machine learning algorithms.
- H2O (http://www.h2o.ai/) is a framework for distributed ML written in Java, but is available for multiple languages, including Scala, R, and Python.
- Apache Mahout (http://mahout.apache.org/) is used for in-core (one machine) and distributed machine learning. The Mahout Samsara framework allows writing the code in a framework-independent way and then executes it on Spark, Flink, or H2O.
There are several libraries that specialize solely on neural networks:
- Encog (http://www.heatonresearch.com/encog/)
- DeepLearning4j (http://deeplearning4j.org/)
We will cover some of these libraries throughout the book.
推薦閱讀
- 數據要素安全流通
- 云數據中心基礎
- 算法競賽入門經典:習題與解答
- Python數據分析入門:從數據獲取到可視化
- Java Data Science Cookbook
- 商業分析思維與實踐:用數據分析解決商業問題
- 大數據:從概念到運營
- 從0到1:JavaScript 快速上手
- 深入淺出Greenplum分布式數據庫:原理、架構和代碼分析
- IPython Interactive Computing and Visualization Cookbook(Second Edition)
- MySQL技術內幕:SQL編程
- AndEngine for Android Game Development Cookbook
- 區塊鏈應用開發指南:業務場景剖析與實戰
- Oracle 11g數據庫管理與開發基礎教程
- 一類智能優化算法的改進及應用研究