Hands-On Data Science with the Command Line
TheCommandLinehasbeeninexistenceonUNIX-basedOSesintheformofBashshellforover3decades.However,verylittleisknowntodevelopersastohowcommand-linetoolscanbeOSEMN(pronouncedasawesomeandstandingforObtaining,Scrubbing,Exploring,Modeling,andiNterpretingdata)forcarryingoutsimple-to-advanceddatasciencetasksatspeed.Thisbookwillstartwiththerequisiteconceptsandinstallationstepsforcarryingoutdatasciencetasksusingthecommandline.Youwilllearntocreateadatapipelinetosolvetheproblemofworkingwithsmall-tomedium-sizedfilesonasinglemachine.Youwillunderstandthepowerofthecommandline,learnhowtoeditfilesusingatext-basedandan.Youwillnotonlylearnhowtoautomatejobsandscripts,butalsolearnhowtovisualizedatausingthecommandline.Bytheendofthisbook,youwilllearnhowtospeeduptheprocessandperformautomatedtasksusingcommand-linetools.
·2.2萬字