- Mastering Java for Data Science
- Alexey Grigorev
- 182字
- 2021-07-02 23:44:33
Machine learning and data mining libraries
There are quite a few machine learning and data mining libraries available for Java and other JVM languages. Some of them are as follows:
- Weka (http://www.cs.waikato.ac.nz/ml/weka/) is probably the most famous data mining library in Java, contains a lot of algorithms and has many extensions.
- JavaML (http://java-ml.sourceforge.net/) is quite an old and reliable ML library, but unfortunately not updated anymore
- Smile (http://haifengl.github.io/smile/) is a promising ML library that is under active development at the moment and a lot of new methods are being added there.
- JSAT (https://github.com/EdwardRaff/JSAT) contains quite an impressive list of machine learning algorithms.
- H2O (http://www.h2o.ai/) is a framework for distributed ML written in Java, but is available for multiple languages, including Scala, R, and Python.
- Apache Mahout (http://mahout.apache.org/) is used for in-core (one machine) and distributed machine learning. The Mahout Samsara framework allows writing the code in a framework-independent way and then executes it on Spark, Flink, or H2O.
There are several libraries that specialize solely on neural networks:
- Encog (http://www.heatonresearch.com/encog/)
- DeepLearning4j (http://deeplearning4j.org/)
We will cover some of these libraries throughout the book.
推薦閱讀
- DB29forLinux,UNIX,Windows數據庫管理認證指南
- R數據科學實戰:工具詳解與案例分析(鮮讀版)
- Oracle 12c云數據庫備份與恢復技術
- Power BI智能數據分析與可視化從入門到精通
- Google Cloud Platform for Developers
- Hadoop 3實戰指南
- 數據庫應用系統技術
- 利用Python進行數據分析(原書第2版)
- 中國云存儲發展報告
- 企業大數據處理:Spark、Druid、Flume與Kafka應用實踐
- Access 2010數據庫應用技術教程(第二版)
- Hands-On Java Deep Learning for Computer Vision
- Hadoop與大數據挖掘
- 數據庫應用技術
- 圖解LeetCode初級算法(Python版)