- Practical Data Wrangling
- Allan Visochek
- 299字
- 2021-07-02 15:16:03
Getting and reading data
The first step is to retrieve a dataset and open it with a program capable of manipulating the data. The simplest way of retrieving a dataset is to find a data file. Python and R can be used to open, read, modify, and save data stored in static files. In Chapter 3, Reading, Exploring, and Modifying Data - Part I, I will introduce the JSON data format and show how to use Python to read, write and modify JSON data. In Chapter 4, Reading, Exploring, and Modifying Data - Part II, I will walk through how to use Python to work with data files in the CSV and XML data formats. In Chapter 6, Cleaning Numerical Data - An Introduction to R and Rstudio, I will introduce R and Rstudio, and show how to use R to read and manipulate data.
Larger data sources are often made available through web interfaces called application programming interfaces (APIs). APIs allow you to retrieve specific bits of data from a larger collection of data. Web APIs can be great resources for data that is otherwise hard to get. In Chapter 8, Getting Data from the Web, I discuss APIs in detail and walk through the use of Python to extract data from APIs.
Another possible source of data is a database. I won't go into detail on the use of databases in this book, though in Chapter 9, Working with Large Datasets, I will show how to interact with a particular database using Python.
- Ansible Configuration Management
- Unreal Engine:Game Development from A to Z
- 大數據戰爭:人工智能時代不能不說的事
- 人工智能超越人類
- Hands-On Machine Learning with TensorFlow.js
- 群體智能與數據挖掘
- 樂高創意機器人教程(中級 下冊 10~16歲) (青少年iCAN+創新創意實踐指導叢書)
- C語言開發技術詳解
- Photoshop CS3圖層、通道、蒙版深度剖析寶典
- INSTANT Autodesk Revit 2013 Customization with .NET How-to
- Windows Server 2008 R2活動目錄內幕
- Microsoft System Center Confi guration Manager
- 中文版AutoCAD 2013高手速成
- 計算智能算法及其生產調度應用
- Red Hat Enterprise Linux 5.0服務器構建與故障排除