preprintbioRxiv (Cold Spring Harbor Laboratory)Feb 26, 2022GREEN OA

Dictionary learning for integrative, multimodal, and scalable single-cell analysis

New York Genome Center · New York University · +2 more institutions

Indexed incrossref

Abstract

Abstract Mapping single-cell sequencing profiles to comprehensive reference datasets represents a powerful alternative to unsupervised analysis. Reference datasets, however, are predominantly constructed from single-cell RNA-seq data, and cannot be used to annotate datasets that do not measure gene expression. Here we introduce ‘bridge integration’, a method to harmonize singlecell datasets across modalities by leveraging a multi-omic dataset as a molecular bridge. Each cell in the multi-omic dataset comprises an element in a ‘dictionary’, which can be used to reconstruct unimodal datasets and transform them into a shared space. We demonstrate that our procedure can accurately harmonize transcriptomic data…

Citation impact

281
total citations
FWCI
Percentile
References
71
Citations per year

Authors

11

Topics & keywords

Keywords
  • Computer science
  • Scalability
  • Mass cytometry
  • Modalities
  • Computational biology
  • Information retrieval
  • Chromatin
  • Data mining
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.