- Using OpenRefine
- Ruben Verborgh Max De Wilde
- 272字
- 2021-08-06 16:57:12
Recipe 1 – installing OpenRefine
In this recipe, you will learn where to look in order to download the latest release of OpenRefine and how to get it running on your favorite operating system.
First things first: start by downloading OpenRefine from http://openrefine.org/. OpenRefine was previously known as Freebase Gridworks, then as Google Refine for a few years. Since October 2012, the project has been taken over by the community, which makes OpenRefine really open. OpenRefine 2.6 is the first version carrying the new branding. If you are interested in the development version, you can also check https://github.com/OpenRefine.
OpenRefine is based on the Java environment, which makes it platform-independent. Just make sure that you have an up-to-date version of Java running on your machine (available from http://java.com/download) and follow the following instructions, depending on your operating system:
Windows
- Download the ZIP archive.
- Unzip and extract the contents of the archive to a folder of your choice.
- To launch OpenRefine, double-click on
openrefine.exe
.
Mac
- Download the DMG file.
- Open the disk image and drag the OpenRefine icon into the
Applications
folder. - Double-click on the icon to start OpenRefine.
Linux
- Download the gzipped tarball.
- Extract the folder to your home directory.
- In a terminal, enter
./refine
to start.
It should be noted that, by default, OpenRefine will allocate only 1 GB of RAM to Java. While this is sufficient to handle small datasets, it soon becomes restrictive when dealing with larger collections of data. In Recipe 7 – going for more memory, we will detail how to allow OpenRefine to allocate more memory, an operation that also differs from one OS to the other.
- 平面設計初步
- 教父母學會上網
- 工業機器人入門實用教程(KUKA機器人)
- Hands-On Linux for Architects
- 最后一個人類
- STM32G4入門與電機控制實戰:基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現
- AWS Administration Cookbook
- 3D Printing for Architects with MakerBot
- Moodle Course Design Best Practices
- 液壓機智能故障診斷方法集成技術
- Photoshop CS5圖像處理入門、進階與提高
- Serverless Design Patterns and Best Practices
- Embedded Linux Development using Yocto Projects(Second Edition)
- ASP.NET學習手冊
- 設計中的人因:34個設計小故事