- Bioinformatics with Python Cookbook
- Tiago Antao
- 164字
- 2021-06-10 19:01:47
Getting ready
As discussed in the previous recipe, we will use data from the 1,000 Genomes Project. We will use the exome alignment for chromosome 20 of female NA18489. This is just 312 MB. The whole exome alignment for this individual is 14.2 GB, and the whole genome alignment (at a low coverage of 4x) is 40.1 GB. This data is a paired-end with reads of 76 bp. This is common nowadays, but slightly more complex to process. We will take this into account. If your data is not paired, just simplify the following recipe appropriately.
As usual, if you use Notebook, the cell at the top of Chapter02/Working_with_BAM.ipynb will download the data for you. If you don't use Notebooks, get the data from our dataset list at https://github.com/PacktPublishing/Bioinformatics-with-Python-Cookbook-Second-Edition/blob/master/Datasets.ipynb. The files you will want are NA18490_20_exome.bam and NA18490_20_exome.bam.bai.
We will use pysam, a Python wrapper to the SAMtools C API. This was installed in Chapter 1, Python and the Surrounding Software Ecology.
- Flask Web全棧開發實戰
- Qt 5 and OpenCV 4 Computer Vision Projects
- Google Apps Script for Beginners
- Visual FoxPro程序設計教程
- 劍指JVM:虛擬機實踐與性能調優
- Hands-On RESTful Web Services with Go
- 精通網絡視頻核心開發技術
- Building Android UIs with Custom Views
- jQuery炫酷應用實例集錦
- Creating Data Stories with Tableau Public
- Fast Data Processing with Spark(Second Edition)
- JavaScript+jQuery網頁特效設計任務驅動教程
- 從Power BI到Analysis Services:企業級數據分析實戰
- Android Sensor Programming By Example
- 青少年學Python(第2冊)