官术网_书友最值得收藏!

Processing NGS data with HTSeq

HTSeq (https://htseq.readthedocs.io) is an alternative library that's used for processing NGS data. Most of the functionality made available by HTSeq is actually available in other libraries covered in this book, but you should be aware of it as an alternative way of processing NGS data. HTSeq supports, among others, FASTA, FASTQ, SAM (via pysam), VCF, GFF, and Browser Extensible Data (BED) file formats. It also includes a set of abstractions for processing (mapped) genomic data, encompassing concepts like genomic positions and intervals or alignments. A complete examination of the features of this library is beyond our scope, so we will concentrate on a small subset of features. We will take this opportunity to also introduce the BED file format.

The BED format allows for the specification of features for annotations tracks. It has many uses, but it's common to load BED files into genome browsers to visualize features. Each line includes information about at least the position (chromosome, start and end) and also optional fields such as name or strand. Full details about the format can be found at https://genome.ucsc.edu/FAQ/FAQformat.html#format1.

主站蜘蛛池模板: 新营市| 郓城县| 都江堰市| 红安县| 怀化市| 什邡市| 仪陇县| 分宜县| 玉环县| 临潭县| 山阳县| 南丹县| 大余县| 门头沟区| 拉萨市| 普宁市| 柘荣县| 长丰县| 甘孜县| 长汀县| 牟定县| 浦县| 洪泽县| 丁青县| 昭觉县| 昌都县| 元朗区| 邹平县| 峨眉山市| 宁河县| 襄樊市| 图片| 平顶山市| 麻栗坡县| 新闻| 巴塘县| 元阳县| 惠水县| 汾西县| 松江区| 社旗县|