官术网_书友最值得收藏!

Data Science - A Birds' Eye View

Data science or machine learning is the process of giving the machines the ability to learn from a dataset without being told or programmed. For instance, it is extremely hard to write a program that can take a hand-written digit as an input image and outputs a value from 0-9 according to the image that's written. The same applies to the task of classifying incoming emails as spam or non-spam. For solving such tasks, data scientists use learning methods and tools from the field of data science or machine learning to teach the computer how to automatically recognize digits, by giving it some explanatory features that can distinguish one digit from another. The same for the spam/non-spam problem, instead of using regular expressions and writing hundred of rules to classify the incoming email, we can teach the computer through specific learning algorithms how to distinguish between spam and non-spam emails.

For the spam filtering application, you can code it by a rule-based approach, but it won't be good enough to be used in production, like the one in your mailing server. Building a learning system is an ideal solution for that.

You are probably using applications of data science on a daily basis, often without knowing it. For example, your country might be using a system to detect the ZIP code of your posted letter in order to automatically forward it to the correct area. If you are using Amazon, they often recommend things for you to buy and they do this by learning what sort of things you often search for or buy.

Building a learned/trained machine learning algorithm will require a base of historical data samples from which it's going to learn how to distinguish between different examples and to come up with some knowledge and trends from that data. After that, the learned/trained algorithm could be used for making predictions on unseen data. The learning algorithm will be using raw historical data and will try to come up with some knowledge and trends from that data.

In this chapter, we are going to have a bird's-eye view of data science, how it works as a black box, and the challenges that data scientists face on a daily basis. We are going to cover the following topics:

  • Understanding data science by an example
  • Design procedure of data science algorithms
  • Getting to learn
  • Implementing the fish recognition/detection model
  • Different learning types
  • Data size and industry needs
主站蜘蛛池模板: 朔州市| 宜城市| 博湖县| 通城县| 延长县| 静宁县| 辽宁省| 安阳县| 浪卡子县| 西乌| 本溪市| 磐安县| 青田县| 衢州市| 朝阳县| 沁水县| 调兵山市| 亳州市| 建平县| 花莲县| 青海省| 宁南县| 晴隆县| 乳山市| 金沙县| 通城县| 河津市| 南华县| 张家界市| 高安市| 台安县| 浪卡子县| 祁阳县| 巴林右旗| 富民县| 新河县| 松滋市| 积石山| 铅山县| 安达市| 凯里市|