Machine Learning with Apache Spark Quick Start Guide
Everypersonandeveryorganizationintheworldmanagesdata,whethertheyrealizeitornot.Dataisusedtodescribetheworldaroundusandcanbeusedforalmostanypurpose,fromanalyzingconsumerhabitstofightingdiseaseandseriousorganizedcrime.Ultimately,wemanagedatainordertoderivevaluefromit,andmanyorganizationsaroundtheworldhavetraditionallyinvestedintechnologytohelpprocesstheirdatafasterandmoreefficiently.Butwenowliveinaninterconnectedworlddrivenbymassdatacreationandconsumptionwheredataisnolongerrowsandcolumnsrestrictedtoaspreadsheet,butanorganicandevolvingassetinitsownright.Withthisrealizationcomesmajorchallengesfororganizations:howdowemanagethesheersizeofdatabeingcreatedeverysecond(thinknotonlyspreadsheetsanddatabases,butalsosocialmediaposts,images,videos,music,blogsandsoon)?Andoncewecanmanageallofthisdata,howdowederiverealvaluefromit?ThefocusofMachineLearningwithApacheSparkistohelpusanswerthesequestionsinahands-onmanner.Weintroducethelatestscalabletechnologiestohelpusmanageandprocessbigdata.Wethenintroduceadvancedanalyticalalgorithmsappliedtoreal-worldusecasesinordertouncoverpatterns,deriveactionableinsights,andlearnfromthisbigdata.
·5.2萬字