- Geospatial Data Science Quick Start Guide
- Abdishakur Hassan Jayakrishnan Vijayaraghavan
- 216字
- 2021-06-24 13:48:19
Consuming Location Data Like a Data Scientist
Location comes in different forms, but what if it comes in a simple structured data format and we overlooked it all this time? Most machine learning algorithms, such as random forests, are geared toward creating insights from structured data in tabular form. In this chapter, we will discuss how to leverage spatial data that is masquerading as tabular data and apply machine learning techniques to it as any data scientist would. For this chapter, we will be using New York taxi trip data to predict trip duration for any given New York taxi trip. We are choosing this dataset because of the following reasons:
- Predicting trip duration has the right mix of geospatial analytics and machine learning
- Finding the time it takes to travel from point A to point B is a routing problem, which will be dealt with in Chapter 6, Let's Build a Routing Engine, and so this chapter is a perfect introduction
We will be using a library known as fastai, an amazing Python library built around popular machine learning libraries such as scikit-learn and PyTorch. In this chapter, we will be discussing the following topics:
- Exploratory data analysis
- Processing spatial data
- Understanding and inferring the error metric
- Building and inferencing a random forest model
- Seven NoSQL Databases in a Week
- 教父母學會上網
- Learning Apache Cassandra(Second Edition)
- ROS機器人編程與SLAM算法解析指南
- 樂高創意機器人教程(中級 下冊 10~16歲) (青少年iCAN+創新創意實踐指導叢書)
- JBoss ESB Beginner’s Guide
- Photoshop CS3圖像處理融會貫通
- 21天學通C語言
- Lightning Fast Animation in Element 3D
- 內模控制及其應用
- 筆記本電腦維修90個精選實例
- 空間機械臂建模、規劃與控制
- 從零開始學SQL Server
- 格蠹匯編
- Linux Shell編程從初學到精通