- Using OpenRefine
- Ruben Verborgh Max De Wilde
- 272字
- 2021-08-06 16:57:12
Recipe 1 – installing OpenRefine
In this recipe, you will learn where to look in order to download the latest release of OpenRefine and how to get it running on your favorite operating system.
First things first: start by downloading OpenRefine from http://openrefine.org/. OpenRefine was previously known as Freebase Gridworks, then as Google Refine for a few years. Since October 2012, the project has been taken over by the community, which makes OpenRefine really open. OpenRefine 2.6 is the first version carrying the new branding. If you are interested in the development version, you can also check https://github.com/OpenRefine.
OpenRefine is based on the Java environment, which makes it platform-independent. Just make sure that you have an up-to-date version of Java running on your machine (available from http://java.com/download) and follow the following instructions, depending on your operating system:
Windows
- Download the ZIP archive.
- Unzip and extract the contents of the archive to a folder of your choice.
- To launch OpenRefine, double-click on
openrefine.exe
.
Mac
- Download the DMG file.
- Open the disk image and drag the OpenRefine icon into the
Applications
folder. - Double-click on the icon to start OpenRefine.
Linux
- Download the gzipped tarball.
- Extract the folder to your home directory.
- In a terminal, enter
./refine
to start.
It should be noted that, by default, OpenRefine will allocate only 1 GB of RAM to Java. While this is sufficient to handle small datasets, it soon becomes restrictive when dealing with larger collections of data. In Recipe 7 – going for more memory, we will detail how to allow OpenRefine to allocate more memory, an operation that also differs from one OS to the other.
- 大數據技術與應用基礎
- ABB工業機器人編程全集
- Hadoop 2.x Administration Cookbook
- 計算機應用復習與練習
- Dreamweaver 8中文版商業案例精粹
- WOW!Illustrator CS6完全自學寶典
- Blockchain Quick Start Guide
- Getting Started with Containerization
- Apache Hive Essentials
- Data Wrangling with Python
- OpenStack Cloud Computing Cookbook
- Docker on Amazon Web Services
- AVR單片機工程師是怎樣煉成的
- EJB JPA數據庫持久層開發實踐詳解
- JSP網絡開發入門與實踐