Pairtools: From sequencing data to chromosome contacts
University of Massachusetts Chan Medical School · University of Southern California · +4 more institutions
Abstract
The field of 3D genome organization produces large amounts of sequencing data from Hi-C and a rapidly-expanding set of other chromosome conformation protocols (3C+). Massive and heterogeneous 3C+ data require high-performance and flexible processing of sequenced reads into contact pairs. To meet these challenges, we present pairtools-a flexible suite of tools for contact extraction from sequencing data. Pairtools provides modular command-line interface (CLI) tools that can be flexibly chained into data processing pipelines. The core operations provided by pairtools are parsing of.sam alignments into Hi-C pairs, sorting and removal of PCR duplicates. In addition, pairtools provides auxiliary tools for building…
Citation impact
- FWCI
- 43.26
- Percentile
- 100%
- References
- 68
Authors
8- OOpen2CCorresponding
University of Massachusetts Chan Medical School
- NANezar AbdennurCorresponding
University of Southern California
- GFGeoffrey FudenbergCorresponding
Friedrich Miescher Institute
- IMIlya M. FlyamerCorresponding
Institute of Molecular Biotechnology, Austrian Academy of Sciences, Massachusetts Institute of Technology
- AAAleksandra A. GalitsynaCorresponding
Institute of Molecular Biotechnology, Austrian Academy of Sciences
Topics & keywords
- Computer science
- Python (programming language)
- Suite
- Modular design
- Benchmarking
- Metagenomics
- Pipeline transport
- Protocol (science)