舉報

會員
Hands-On Data Science with R
Vitor Bianchi Lanzetta Nataraj Dasgupta Ricardo Anjoleto Farias 著
更新時間:2021-06-10 19:13:14
開會員,本書免費讀 >
Risthemostwidelyusedprogramminglanguage,andwhenusedinassociationwithdatascience,thispowerfulcombinationwillsolvethecomplexitiesinvolvedwithunstructureddatasetsintherealworld.Thisbookcoverstheentiredatascienceecosystemforaspiringdatascientists,rightfromzerotoalevelwhereyouareconfidentenoughtogethands-onwithreal-worlddatascienceproblems.ThebookstartswithanintroductiontodatascienceandintroducesreaderstopopularRlibrariesforexecutingdatascienceroutinetasks.Thisbookcoversalltheimportantprocessesindatasciencesuchasdatagathering,cleaningdata,andthenuncoveringpatternsfromit.Youwillexplorealgorithmssuchasmachinelearningalgorithms,predictiveanalyticalmodels,andfinallydeeplearningalgorithms.YouwilllearntorunthemostpowerfulvisualizationpackagesavailableinRsoastoensurethatyoucaneasilyderiveinsightsfromyourdata.Towardstheend,youwillalsolearnhowtointegrateRwithSparkandHadoopandperformlarge-scaledataanalyticswithoutmuchcomplexity.
最新章節
- Leave a review - let other readers know what you think
- Other Books You May Enjoy
- Meeting Stack Overflow
- Content to stay tuned to
- Gathering data
- Growing your skills
品牌:中圖公司
上架時間:2021-06-10 18:23:47
出版社:Packt Publishing
本書數字版權由中圖公司提供,并由其授權上海閱文信息技術有限公司制作發行
- Leave a review - let other readers know what you think 更新時間:2021-06-10 19:13:14
- Other Books You May Enjoy
- Meeting Stack Overflow
- Content to stay tuned to
- Gathering data
- Growing your skills
- The Road Ahead
- Quiz
- Summary
- Building an experiment that uses R
- How modules work
- Azure Machine Learning Studio
- Azure registration
- Why Azure?
- Things to look for
- Cloud types
- Cloud computing
- R on Cloud
- Quiz
- Summary
- Spark DataFrames within the RStudio IDE
- Providing interfaces to Spark packages
- Using Spark machine learning or H2O Sparking Water
- Filtering and aggregating Spark datasets
- Manipulating Spark data using both dplyr and SQL
- Installing the package and Spark
- Large Scale Data Analytics with Hadoop
- Quiz
- Summary
- Some advice about Shiny
- Approach for creating a data product from statistical modeling and web UI
- The observeEvent and eventReactive functions
- The reactive and isolate functions
- Building an application inside R
- How to build a Shiny app
- What is R Shiny?
- Going to Production with R
- Quiz
- Summary
- Crafting visualizations
- Retrieving and cleaning data
- Visualizing Data
- Quiz
- Summary
- Programming an HMM with R
- The Markov chain
- Markovian models – real-world applications
- Markovian-type models
- Markovian in R
- Quiz
- Summary
- Further tips
- Getting practical with Keras
- Getting things ready for Keras
- NNs with Keras
- Training algorithms
- Layers
- Activation functions
- ANN nodes
- Neuroscience inspiration
- Overview – NNs and deep learning
- Daily neural nets
- Neural Networks and Deep Learning
- Quiz
- Summary
- Application details
- Forecasting machine learning application
- The UI and server
- Forecasting and ML App with R
- Quiz
- Summary
- Introduction to feedforward neural networks with R
- Neural networks
- Hierarchical and k-means clustering
- What about regressions?
- Support vector machines
- Random forests – a collection of trees
- Growing trees with tree and rpart
- Starting with decision trees
- The Chilean plebiscite data
- Strengths and weakness
- Tree models
- Tricks for lm
- Linear regression with R
- Generic problems solved by machine learning
- Machine learning vocabulary
- Machine learning everywhere
- What is machine learning?
- Machine Learning with R
- Quiz
- Summary
- Rocker
- Checkpoint
- Packrat
- Saving analysis for future work
- Summarizing data
- Heatmaps
- Bar charts
- Boxplots
- Scatter plots
- Line plots
- Histograms
- Types of charts – basic primer
- Data visualisation
- Printing results
- Simple pattern matching and replacement with R
- Combining strings
- Reading data
- Handling strings in R
- Handling dates using POSIXct or POSIXlt
- Handling strings and dates
- Missing data
- Mixed data types
- Managing data issues
- Reading data
- Data types in R
- Data categories
- Preparing data for analysis
- Data Analysis with R
- Quiz
- Summary
- Cluster analysis
- Visualizing data
- Peeking data
- Looking for patterns – peeking visualizing and clustering data
- Cleaning and transforming data
- Fetching the number of tweets
- Creating your Twitter application
- Retrieving tweets from R community
- Web scraping made easy with rvest
- Legality of web scraping
- Retrieving text from the web
- Scraping a dwarf name
- Stages of KDD
- Good practices of KDD and data mining
- KDD Data Mining and Text Mining
- Quiz
- Summary
- Tutorial – looking at airline flight times data
- Web APIs
- Working with web data
- On-disk formats
- Reading other file formats – Excel SAS and other data sources
- Checking data quality
- Miscellaneous topics
- A special note on dates and/or time
- Reading and writing files with data.table
- The melt functionality
- Pivots on data.table
- Deleting a column
- Creating new columns in data.table
- What is the advantage of searching using key by?
- Ordering columns
- Adding a column
- Grouping operations
- Using data.table for data manipulation
- dbplyr – databases and dplyr
- Joining tables
- Converting wide tables into long tables
- Converting wide tables into long tables
- The tidyr package
- Sampling data
- Summarise
- Using arrange for sorting
- Filtering with filter
- Using select
- Basic dplyr usage
- Using tibble and dplyr for data manipulation
- Merging DataFrames
- Aggregation functions
- Applying families of functions
- Using base R for data manipulation and analysis
- Basic tools of data wrangling
- Data extraction transformation and load
- Data types formats and sources
- Introduction to data wrangling with R
- Data Wrangling with R
- Quiz
- Summary
- A/B testing – a brief introduction and a practical example with R
- Elaborating a little longer
- Running z-tests with R
- Be careful
- Decision rule – a brief overview of the p-value approach
- Running t-tests with R
- Statistical hypothesis testing
- Useful functions to draw automated summaries
- Measures of dispersion
- Calculating mean median and mode with base R
- Measures of central tendency
- Measures of central tendency and dispersion
- Descriptive and Inferential Statistics
- Quiz
- Summary
- UN development index
- Our first R program
- Key features of R
- Using R for data science
- Solving problems with data science
- Other industries
- Web industry
- Manufacturing and retail
- Government
- Pharmaceuticals
- Healthcare
- Finance
- Active domains of data science
- Domain knowledge
- Predictive analytics (machine learning)
- Computer science
- Key components of data science
- Introduction to data science
- Getting Started with Data Science and R
- Reviews
- Get in touch
- Conventions used
- Download the color images
- Download the example code files
- To get the most out of this book
- What this book covers
- Who this book is for
- Preface
- Packt is searching for authors like you
- About the reviewer
- About the authors
- Contributors
- Packt.com
- Why subscribe?
- About Packt
- Title Page
- coverpage
- coverpage
- Title Page
- About Packt
- Why subscribe?
- Packt.com
- Contributors
- About the authors
- About the reviewer
- Packt is searching for authors like you
- Preface
- Who this book is for
- What this book covers
- To get the most out of this book
- Download the example code files
- Download the color images
- Conventions used
- Get in touch
- Reviews
- Getting Started with Data Science and R
- Introduction to data science
- Key components of data science
- Computer science
- Predictive analytics (machine learning)
- Domain knowledge
- Active domains of data science
- Finance
- Healthcare
- Pharmaceuticals
- Government
- Manufacturing and retail
- Web industry
- Other industries
- Solving problems with data science
- Using R for data science
- Key features of R
- Our first R program
- UN development index
- Summary
- Quiz
- Descriptive and Inferential Statistics
- Measures of central tendency and dispersion
- Measures of central tendency
- Calculating mean median and mode with base R
- Measures of dispersion
- Useful functions to draw automated summaries
- Statistical hypothesis testing
- Running t-tests with R
- Decision rule – a brief overview of the p-value approach
- Be careful
- Running z-tests with R
- Elaborating a little longer
- A/B testing – a brief introduction and a practical example with R
- Summary
- Quiz
- Data Wrangling with R
- Introduction to data wrangling with R
- Data types formats and sources
- Data extraction transformation and load
- Basic tools of data wrangling
- Using base R for data manipulation and analysis
- Applying families of functions
- Aggregation functions
- Merging DataFrames
- Using tibble and dplyr for data manipulation
- Basic dplyr usage
- Using select
- Filtering with filter
- Using arrange for sorting
- Summarise
- Sampling data
- The tidyr package
- Converting wide tables into long tables
- Converting wide tables into long tables
- Joining tables
- dbplyr – databases and dplyr
- Using data.table for data manipulation
- Grouping operations
- Adding a column
- Ordering columns
- What is the advantage of searching using key by?
- Creating new columns in data.table
- Deleting a column
- Pivots on data.table
- The melt functionality
- Reading and writing files with data.table
- A special note on dates and/or time
- Miscellaneous topics
- Checking data quality
- Reading other file formats – Excel SAS and other data sources
- On-disk formats
- Working with web data
- Web APIs
- Tutorial – looking at airline flight times data
- Summary
- Quiz
- KDD Data Mining and Text Mining
- Good practices of KDD and data mining
- Stages of KDD
- Scraping a dwarf name
- Retrieving text from the web
- Legality of web scraping
- Web scraping made easy with rvest
- Retrieving tweets from R community
- Creating your Twitter application
- Fetching the number of tweets
- Cleaning and transforming data
- Looking for patterns – peeking visualizing and clustering data
- Peeking data
- Visualizing data
- Cluster analysis
- Summary
- Quiz
- Data Analysis with R
- Preparing data for analysis
- Data categories
- Data types in R
- Reading data
- Managing data issues
- Mixed data types
- Missing data
- Handling strings and dates
- Handling dates using POSIXct or POSIXlt
- Handling strings in R
- Reading data
- Combining strings
- Simple pattern matching and replacement with R
- Printing results
- Data visualisation
- Types of charts – basic primer
- Histograms
- Line plots
- Scatter plots
- Boxplots
- Bar charts
- Heatmaps
- Summarizing data
- Saving analysis for future work
- Packrat
- Checkpoint
- Rocker
- Summary
- Quiz
- Machine Learning with R
- What is machine learning?
- Machine learning everywhere
- Machine learning vocabulary
- Generic problems solved by machine learning
- Linear regression with R
- Tricks for lm
- Tree models
- Strengths and weakness
- The Chilean plebiscite data
- Starting with decision trees
- Growing trees with tree and rpart
- Random forests – a collection of trees
- Support vector machines
- What about regressions?
- Hierarchical and k-means clustering
- Neural networks
- Introduction to feedforward neural networks with R
- Summary
- Quiz
- Forecasting and ML App with R
- The UI and server
- Forecasting machine learning application
- Application details
- Summary
- Quiz
- Neural Networks and Deep Learning
- Daily neural nets
- Overview – NNs and deep learning
- Neuroscience inspiration
- ANN nodes
- Activation functions
- Layers
- Training algorithms
- NNs with Keras
- Getting things ready for Keras
- Getting practical with Keras
- Further tips
- Summary
- Quiz
- Markovian in R
- Markovian-type models
- Markovian models – real-world applications
- The Markov chain
- Programming an HMM with R
- Summary
- Quiz
- Visualizing Data
- Retrieving and cleaning data
- Crafting visualizations
- Summary
- Quiz
- Going to Production with R
- What is R Shiny?
- How to build a Shiny app
- Building an application inside R
- The reactive and isolate functions
- The observeEvent and eventReactive functions
- Approach for creating a data product from statistical modeling and web UI
- Some advice about Shiny
- Summary
- Quiz
- Large Scale Data Analytics with Hadoop
- Installing the package and Spark
- Manipulating Spark data using both dplyr and SQL
- Filtering and aggregating Spark datasets
- Using Spark machine learning or H2O Sparking Water
- Providing interfaces to Spark packages
- Spark DataFrames within the RStudio IDE
- Summary
- Quiz
- R on Cloud
- Cloud computing
- Cloud types
- Things to look for
- Why Azure?
- Azure registration
- Azure Machine Learning Studio
- How modules work
- Building an experiment that uses R
- Summary
- Quiz
- The Road Ahead
- Growing your skills
- Gathering data
- Content to stay tuned to
- Meeting Stack Overflow
- Other Books You May Enjoy
- Leave a review - let other readers know what you think 更新時間:2021-06-10 19:13:14