- Learning Microsoft Cognitive Services
- Leif Larsen
- 422字
- 2021-08-13 15:40:14
Vision
APIs under the vision flags allow your apps to understand images and video content. They allow you to retrieve information about faces, feelings, and other visual content. You can stabilize videos and recognize celebrities. You can read text in images and generate thumbnails from videos and images.
There are four APIs contained in the vision domain, which we will look at now.
Computer vision
Using the computer vision API, you can retrieve actionable information from images. This means that you can identify content (such as image format, image size, colors, faces, and more). You can detect whether or not an image is adult/racy. This API can recognize text in images and extract it to machine-readable words. It can detect celebrities from a variety of areas. Lastly, it can generate storage-efficient thumbnails with smart-cropping functionality.
We will look into computer vision in Chapter 2, Analyzing Images to Recognize a Face.
Face
We have already seen a very basic example of what the Face API can do. The rest of the API revolves around the detection, identification, organization, and tagging of faces in photos. As well as face detection, you can also see how likely it is that two faces belong to the same person. You can identify faces and also find similar-looking faces. We can also use the API to recognize emotions in images.
We will dive further into the Face API in Chapter 2, Analyzing Images to Recognize a Face.
Video indexer
Using the video indexer API, you can start indexing videos immediately upon upload. This means that you can get video insights without using experts or custom code. Content discovery can be improved, utilizing the powerful artificial intelligence of this API. This allows you to make your content more discoverable.
The video indexer API will be covered in greater detail in Chapter 3, Analyzing Videos.
Content moderator
The content moderator API utilizes machine learning to automatically moderate content. It can detect potentially offensive and unwanted images, videos, and text for over 100 languages. In addition, it allows you to review detected material to improve the service.
The content moderator will be covered in Chapter 2, Analyzing Images to Recognize a Face.
Custom vision service
The custom vision service allows you to upload your own labeled images to a vision service. This means that you can add images that are specific to your domain to allow recognition using the computer vision API.
The custom vision service will be covered in more detail in Chapter 2, Analyzing Images to Recognize a Face.
- Istio入門與實戰
- 深入理解Spring Cloud與實戰
- 顯卡維修知識精解
- 網絡服務器配置與管理(第3版)
- 施耐德SoMachine控制器應用及編程指南
- 深入淺出SSD:固態存儲核心技術、原理與實戰
- 基于ARM的嵌入式系統和物聯網開發
- Mastering Adobe Photoshop Elements
- 微軟互聯網信息服務(IIS)最佳實踐 (微軟技術開發者叢書)
- 面向對象分析與設計(第3版)(修訂版)
- Creating Flat Design Websites
- 微型計算機系統原理及應用:國產龍芯處理器的軟件和硬件集成(基礎篇)
- 龍芯自主可信計算及應用
- “硬”核:硬件產品成功密碼
- Hands-On Motion Graphics with Adobe After Effects CC