- Practical Computer Vision
- Abhinav Dadhich
- 177字
- 2021-06-30 18:54:46
MSCOCO
COCO[2] refers to a common object in context and is a dataset for object recognition, with 80 categories and 330K images. After Pascal VOC'12, this became a popular benchmark for training and evaluating the system. The dataset can be downloaded from http://cocodataset.org/#download.
In order to read the data and use it for applications, there is an API available at https://github.com/cocodataset/cocoapi which needs to be downloaded. To get started, we can use the API provided, as follows:
git clone https://github.com/cocodataset/cocoapi.git
cd cocoapi/PythonAPI
make
This will install the Python API to read the coco dataset.
Many models available online for object detection or image segmentation are first trained on this dataset. If we have specific data that has different object categories than in the MSCOCO dataset, a more common approach that we will see in Chapter 5, Convolution Neural Networks and in Chapter 6, Feature- Based Object Detection, is to first train a model on an MSCOCO dataset and use a part of the trained model and re-train on a new dataset.
- Hands-On Internet of Things with MQTT
- 機器人智能運動規劃技術
- 最簡數據挖掘
- Python Algorithmic Trading Cookbook
- OpenStack Cloud Computing Cookbook(Second Edition)
- 基于32位ColdFire構建嵌入式系統
- Citrix? XenDesktop? 7 Cookbook
- PLC與變頻技術應用
- 水晶石影視動畫精粹:After Effects & Nuke 影視后期合成
- Excel 2007終極技巧金典
- 電動汽車驅動與控制技術
- C#編程兵書
- 數字多媒體技術與應用實例
- 網頁設計與制作
- Java求職寶典