Detecting sequence signals in targeting peptides using deep learning
Technical University of Denmark · Stockholm University · +5 more institutions
Abstract
In bioinformatics, machine learning methods have been used to predict features embedded in the sequences. In contrast to what is generally assumed, machine learning approaches can also provide new insights into the underlying biology. Here, we demonstrate this by presenting TargetP 2.0, a novel state-of-the-art method to identify N-terminal sorting signals, which direct proteins to the secretory pathway, mitochondria, and chloroplasts or other plastids. By examining the strongest signals from the attention layer in the network, we find that the second residue in the protein, that is, the one following the initial methionine, has a strong influence on the classification. We observe that two-thirds of…
Citation impact
- FWCI
- 38.11
- Percentile
- 100%
- References
- 41
Authors
7- JJJosé Juan Almagro Armenteros
Technical University of Denmark
- MSMarco Salvatore
Stockholm University, Science for Life Laboratory
- OEOlof Emanuelsson
Science for Life Laboratory, KTH Royal Institute of Technology
- OWOle Winther
University of Copenhagen, Copenhagen University Hospital, Rigshospitalet, Technical University of Denmark
- GVGunnar von Heijne
Stockholm University, Science for Life Laboratory
Topics & keywords
- Protein targeting
- Plastid
- Computational biology
- Chloroplast
- Methionine
- Amino acid
- Transit Peptide
- Alanine