- Building Computer Vision Projects with OpenCV 4 and C++
- David Millán Escrivá Prateek Joshi Vinícius G. Mendon?a Roy Shilkrot
- 284字
- 2021-07-02 12:28:27
Why is it difficult for machines to understand image content?
We now understand how visual data enters the human visual system, and how our system processes it. The issue is that we still don't fully understand how our brain recognizes and organizes this visual data. In machine learning, we just extract some features from images, and ask the computers to learn them using algorithms. We still have these variations, such as shape, size, perspective, angle, illumination, occlusion, and so on.
For example, the same chair looks very different to a machine when you look at it from the profile view. Humans can easily recognize that it's a chair, regardless of how it's presented to us. So, how do we explain this to our machines?
One way to do this would be to store all the different variations of an object, including sizes, angles, perspectives, and so on. But this process is cumbersome and time-consuming. Also, it's actually not possible to gather data that can encompass every single variation. The machines would consume a huge amount of memory and a lot of time to build a model that can recognize these objects.
Even with all this, if an object is partially occluded, computers still won't recognize it. This is because they think this is a new object. So when we build a computer vision library, we need to build the underlying functional blocks that can be combined in many different ways to formulate complex algorithms.
OpenCV provides a lot of these functions, and they are highly optimized. So once we understand what OpenCV is capable of, we can use it effectively to build interesting applications.
Let's go ahead and explore that in the next section.
- 計算機組成原理與接口技術:基于MIPS架構實驗教程(第2版)
- LibGDX Game Development Essentials
- 大規模數據分析和建模:基于Spark與R
- SQL Server入門經典
- 大數據可視化
- 使用GitOps實現Kubernetes的持續部署:模式、流程及工具
- 大數據Hadoop 3.X分布式處理實戰
- SQL優化最佳實踐:構建高效率Oracle數據庫的方法與技巧
- 深入淺出 Hyperscan:高性能正則表達式算法原理與設計
- 達夢數據庫運維實戰
- Oracle數據庫管理、開發與實踐
- 大數據技術原理與應用:概念、存儲、處理、分析與應用
- Node.js High Performance
- 數據中臺實戰:手把手教你搭建數據中臺
- 大數據隱私保護技術與治理機制研究