The Data Wrangling Workshop
Whileahugeamountofdataisreadilyavailabletous,itisnotusefulinitsrawform.Fordatatobemeaningful,itmustbecuratedandrefined.Ifyou’reabeginner,thenTheDataWranglingWorkshopwillhelptobreakdowntheprocessforyou.You’llstartwiththebasicsandbuildyourknowledge,progressingfromthecoreaspectsbehinddatawrangling,tousingthemostpopulartoolsandtechniques.ThisbookstartsbyshowingyouhowtoworkwithdatastructuresusingPython.Throughexamplesandactivities,you’llunderstandwhyyoushouldstayawayfromtraditionalmethodsofdatacleaningusedinotherlanguagesandtakeadvantageofthespecializedpre-builtroutinesinPython.Later,you’lllearnhowtousethesamePythonbackendtoextractandtransformdatafromanarrayofsources,includingtheinternet,largedatabasevaults,andExcelfinancialtables.Tohelpyouprepareformorechallengingscenarios,thebookteachesyouhowtohandlemissingorincorrectdata,andreformatitbasedontherequirementsfromyourdownstreamanalyticstool.Bytheendofthisbook,youwillhavedevelopedasolidunderstandingofhowtoperformdatawranglingwithPython,andlearnedseveraltechniquesandbestpracticestoextract,clean,transform,andformatyourdataefficiently,fromadiversearrayofsources.
·8.9萬字