- Hands-On Artificial Intelligence for IoT
- Amita Kapoor
- 273字
- 2021-07-02 14:02:02
NoSQL data
The Not Only Structured Query Language (NoSQL) database is not a relational database; instead, data can be stored in key-value, JSON, document, columnar, or graph formats. They are frequently used in big data and real-time applications. We will learn here how to access NoSQL data using MongoDB, and we assume you have the MongoDB server configured properly and on:
- We will need to establish a connection with the Mongo daemon using the MongoClient object. The following code establishes the connection to the default host, localhost , and port (27017). And it gives us access to the database:
from pymongo import MongoClient
client = MongoClient()
db = client.test
- In this example, we try to load the cancer dataset available in scikit-learn to the Mongo database. So, we first get the breast cancer dataset and convert it to a pandas DataFrame:
from sklearn.datasets import load_breast_cancer
import pandas as pd
cancer = load_breast_cancer()
data = pd.DataFrame(cancer.data, columns=[cancer.feature_names])
data.head()
- Next, we convert this into the JSON format, use the json.loads() function to decode it, and insert the decoded data into the open database:
import json
data_in_json = data.to_json(orient='split')
rows = json.loads(data_in_json)
db.cancer_data.insert(rows)
- This will create a collection named cancer_data that contains the data. We can query the document we just created, using the cursor object:
cursor = db['cancer_data'].find({})
df = pd.DataFrame(list(cursor))
print(df)

When it comes to distributed data on the IoT, Hadoop Distributed File System (HDFS) is another popular method for providing distributed data storage and access in IoT systems. In the next section, we study how to access and store data in HDFS.
推薦閱讀
- 計(jì)算機(jī)控制技術(shù)
- Visual C# 2008開發(fā)技術(shù)實(shí)例詳解
- RPA(機(jī)器人流程自動化)快速入門:基于Blue Prism
- 在實(shí)戰(zhàn)中成長:Windows Forms開發(fā)之路
- 精通數(shù)據(jù)科學(xué):從線性回歸到深度學(xué)習(xí)
- Extending Ansible
- 自動化生產(chǎn)線安裝與調(diào)試(三菱FX系列)(第二版)
- Mastering Geospatial Analysis with Python
- SQL Server數(shù)據(jù)庫應(yīng)用基礎(chǔ)(第2版)
- 和機(jī)器人一起進(jìn)化
- JRuby語言實(shí)戰(zhàn)技術(shù)
- 數(shù)據(jù)要素:全球經(jīng)濟(jì)社會發(fā)展的新動力
- Learning Cassandra for Administrators
- Hands-On Agile Software Development with JIRA
- 三維動畫制作(3ds max7.0)