preprintbioRxiv (Cold Spring Harbor Laboratory)Feb 20, 2014GREEN OA

HTSeq – A Python framework to work with high-throughput sequencing data

European Molecular Biology Laboratory

Indexed incrossref

Abstract

ABSTRACT Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard work flows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data such as genomic coordinates, sequences, sequencing reads, alignments, gene model information, variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential…

Citation impact

1,214
total citations
FWCI
Percentile
References
20
Citations per year

Authors

3

Topics & keywords

Keywords
  • Python (programming language)
  • Computer science
  • Throughput
  • Programming language
  • Operating system
No related works found for this paper.