- Hands-On Data Science with Anaconda
- Dr. Yuxing Yan James Yan
- 370字
- 2021-06-25 21:08:52
Review questions and exercises
- What is the difference between open data and proprietary databases?
- Is it enough for learners in the area of data science to use open data?
- Where can we access open public data?
- From The UCI Data Depository, http://archive.ics.uci.edu/ml/index.php, download a dataset called Wine. Write a program in R to import it.
- From the UCI Data Depository, download a dataset called Forest Fire. Write a program in Python to import it.
- From the UCI Data Depository, download a dataset called Bank Marketing. Write a program in Octave to import it. Answer the following questions: 1) How many banks? and 2) What is the cost?
- How can we find all R functions with read. as their leading letters? (Note that there is a dot after read.)
- How can we find more information on an R function called read.xls()?
- Explain the differences between two R functions: save() and saveRDS().
- Find more information about the read_clipboard() function included in the Python pandas package.
- What is the Quandl platform? What kinds of data could we download from Quandl?
- Write both R and Python programs to download GDP (Gross Domestic Product) data from the Quandl platform.
- When loading an R dataset, what is the difference between using the load() function and the readRDS() function?
- After uploading the Python pandas package, explain why we have the following error message:

- First, download a ZIP file called bank-fall.zip at http://archive.ics.uci.edu/ml/datasets/Bank+Marketing. Unzip the file to get a CSV file; see the related code that follows:

Generate an R dataset called bank.Rata and bank.rds and answer the following questions: a) What is the average age? b) What percentage of people are married? c) Is the default probability of those who are married higher than those who are single?
- How do we merge two datasets in R?
- Write a Python program to download IBM's daily data from Quandl and merge it with Fama-French three-factor. To get a Fama-French daily factor time series we could go to http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html or download a dataset at http://canisius.edu/~yany/python/data/ffDaily.pkl.
- Generate both R and Python datasets for monthly Fama-French-Charhart four factors. Both time series, can be downloaded from Professor French's data library.
- Write a Python program to merge FRED/GDP data with market index data.
推薦閱讀
- 工業(yè)機器人虛擬仿真實例教程:KUKA.Sim Pro(全彩版)
- 大學(xué)計算機基礎(chǔ):基礎(chǔ)理論篇
- Python Artificial Intelligence Projects for Beginners
- 錯覺:AI 如何通過數(shù)據(jù)挖掘誤導(dǎo)我們
- 程序設(shè)計語言與編譯
- PyTorch深度學(xué)習(xí)實戰(zhàn)
- AWS Certified SysOps Administrator:Associate Guide
- 工業(yè)機器人操作與編程
- 網(wǎng)絡(luò)綜合布線設(shè)計與施工技術(shù)
- Salesforce for Beginners
- 分析力!專業(yè)Excel的制作與分析實用法則
- Ansible 2 Cloud Automation Cookbook
- DynamoDB Applied Design Patterns
- Red Hat Enterprise Linux 5.0服務(wù)器構(gòu)建與故障排除
- 從祖先到算法:加速進化的人類文化