- Mastering Java for Data Science
- Alexey Grigorev
- 277字
- 2021-07-02 23:44:36
AOL Cyclops React
As we already learned, Java Streams API is a very powerful way of dealing with data in a functional way. The Cyclops React library extends this API by adding new operations on streams and allows for more control of the flow execution. To include the library, add this to the pom.xml file:
<dependency>
<groupId>com.aol.simplereact</groupId>
<artifactId>cyclops-react</artifactId>
<version>1.0.0-RC4</version>
</dependency>
Some of the methods it adds are zipWithIndex and cast and convenience collectors such as toList, toSet, and toMap. What is more, it gives more control for parallel execution, for example, it is possible to provide a custom executor, which will be used for processing data or intercepting exceptions declaratively.
Also, with this library, it is easy to create a parallel stream from the iterator--it is hard to do it with the standard library.
For example, let's take words.txt, extract all POS tags from it, and then create a map that associates each tag with a unique index. For reading data, we will use LineIterator from Commons IO, which otherwise would be hard to parallelize using only standard Java APIs. Additionally, we create a custom executor, which will be used for executing the stream operations in parallel:
LineIterator it = FileUtils.lineIterator(new File("data/words.txt"), "UTF-8");
ExecutorService executor = Executors.newCachedThreadPool();
LazyFutureStream<String> stream =
LazyReact.parallelBuilder().withExecutor(executor).from(it);
Map<String, Integer> map = stream
.map(line -> line.split("t"))
.map(arr -> arr[1].toLowerCase())
.distinct()
.zipWithIndex()
.toMap(Tuple2::v1, t -> t.v2.intValue());
System.out.println(map);
executor.shutdown();
it.close();
It is a very simple example and does not come close to describing all the functionality available in this library. For more information, refer to their documentation, which can be found at https://github.com/aol/cyclops-react. We will also use it in other examples in later chapters.
- 數據庫基礎教程(SQL Server平臺)
- LibGDX Game Development Essentials
- Unity 5.x Game AI Programming Cookbook
- 云計算環境下的信息資源集成與服務
- Test-Driven Development with Mockito
- 正則表達式必知必會
- Learning JavaScriptMVC
- 數據化網站運營深度剖析
- 算法與數據中臺:基于Google、Facebook與微博實踐
- Mockito Cookbook
- 大數據Hadoop 3.X分布式處理實戰
- 深度剖析Hadoop HDFS
- MATLAB Graphics and Data Visualization Cookbook
- 圖數據實戰:用圖思維和圖技術解決復雜問題
- SQL Server 2012數據庫管理教程