官术网_书友最值得收藏!

Studying genome accessibility and filtering SNP data

While the previous recipes were focused on giving an overview of Python libraries to deal with alignment and variant call data, in this recipe, we will concentrate on actually using them with a clear purpose in mind.

If you are using NGS data, chances are that your most important file to analyze is a VCF file, which is produced by a genotype caller such as SAMtools, mpileup, or GATK. The quality of your VCF calls may need to be assessed and filtered. Here, we will put in place a framework to filter SNP data. Rather than giving you filtering rules (an impossible task to be performed in a general way), we will give you procedures to assess the quality of your data. With this, you can devise your own filters. Be sure to check Chapter 11, Advanced NGS Processing for more tips on filtering.

主站蜘蛛池模板: 社旗县| 岳池县| 内黄县| 沈阳市| 徐水县| 噶尔县| 剑河县| 宁津县| 榆树市| 左贡县| 阳春市| 平罗县| 广汉市| 邢台县| 小金县| 澳门| 金塔县| 阳信县| 内丘县| 齐河县| 荆州市| 惠来县| 贵州省| 密云县| 齐齐哈尔市| 新竹县| 沾化县| 青田县| 师宗县| 合阳县| 开远市| 中宁县| 江津市| 固始县| 延津县| 射阳县| 兴安盟| 开平市| 兰溪市| 漳州市| 安新县|