articleNucleic Acids ResearchApr 19, 2022GOLD OA

DeepLoc 2.0: multi-label subcellular localization prediction using protein language models

Indian Institute of Technology Madras · University of Copenhagen · +5 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

The prediction of protein subcellular localization is of great relevance for proteomics research. Here, we propose an update to the popular tool DeepLoc with multi-localization prediction and improvements in both performance and interpretability. For training and validation, we curate eukaryotic and human multi-location protein datasets with stringent homology partitioning and enriched with sorting signal information compiled from the literature. We achieve state-of-the-art performance in DeepLoc 2.0 by using a pre-trained protein language model. It has the further advantage that it uses sequence input rather than relying on slower protein profiles. We provide two means of better interpretability: an attention…

No related works found for this paper.

Funding