官术网_书友最值得收藏!

Why we are talking about big data now if data has always existed

By the early 2000’s, rapid advances in computing and technologies, such as storage, allowed users to collect and store data with unprecedented levels of efficiency. The internet further added impetus to this drive by providing a platform that had an unlimited capacity to exchange information at a global scale. Technology advanced at a breathtaking pace and led to major paradigm shifts powered by tools such as social media, connected devices such as smart phones, and the availability of broadband connections, and by extension, user participation, even in remote parts of the world.

By and large, the majority of this data consists of information generated by web-based sources, such as social networks like Facebook and video sharing sites like YouTube. In big data parlance, this is also known as unstructured data; namely, data that is not in a fixed format such as a spreadsheet or the kind that can be easily stored in a traditional database system.

The simultaneous advances in computing capabilities meant that although the rate of data being generated was very high, it was still computationally feasible to analyze it. Algorithms in machine learning, which were once considered intractable due to both the volume as well as algorithmic complexity, could now be analyzed using various new paradigms such as cluster or multinode processing in a much simpler manner that would have earlier necessitated special-purpose machines.

Chart of data generated per minute. Credit: DOMO Inc.

主站蜘蛛池模板: 永德县| 东至县| 铁岭县| 嘉禾县| 晋城| 县级市| 商水县| 凭祥市| 丹巴县| 新邵县| 潼南县| 乳源| 塔河县| 葵青区| 大厂| 繁昌县| 安陆市| 祁东县| 罗田县| 阆中市| 宁德市| 通化县| 龙海市| 永顺县| 汪清县| 民权县| 曲松县| 庐江县| 锦州市| 公主岭市| 全椒县| 涿鹿县| 眉山市| 武城县| 桦川县| 尉氏县| 乌什县| 修水县| 河津市| 和林格尔县| 临猗县|