Self-referencing embedded strings (SELFIES): A 100% robust molecular string representation

Vector Institute · University of Toronto · +3 more institutions

Indexed indatacite

Abstract

The discovery of novel materials and functional molecules can help to solve some of society's most urgent challenges, ranging from efficient energy harvesting and storage to uncovering novel pharmaceutical drug candidates. Traditionally matter engineering -- generally denoted as inverse design -- was based massively on human intuition and high-throughput virtual screening. The last few years have seen the emergence of significant interest in computer-inspired designs based on evolutionary or deep learning methods. The major challenge here is that the standard strings molecular representation SMILES shows substantial weaknesses in that task because large fractions of strings do not correspond to valid…

Citation impact

726
total citations
FWCI
36.49
Percentile
100%
References
66
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • String (physics)
  • Representation (politics)
  • Intuition
  • Theoretical computer science
  • Generative model
  • Artificial intelligence
  • Generative grammar
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.

Funding