articleMachine Learning Science and TechnologyJul 27, 2020GOLD OA

Self-referencing embedded strings (SELFIES): A 100% robust molecular string representation

MKMario KrennFHFlorian HäseANAkshatKumar NigamPFPascal FriederichAAAlan Aspuru-Guzik

University of Toronto · Vector Institute · +3 more institutions

Indexed inarxivcrossrefdoaj

Abstract

Abstract The discovery of novel materials and functional molecules can help to solve some of society’s most urgent challenges, ranging from efficient energy harvesting and storage to uncovering novel pharmaceutical drug candidates. Traditionally matter engineering–generally denoted as inverse design–was based massively on human intuition and high-throughput virtual screening. The last few years have seen the emergence of significant interest in computer-inspired designs based on evolutionary or deep learning methods. The major challenge here is that the standard strings molecular representation SMILES shows substantial weaknesses in that task because large fractions of strings do not correspond to valid…

Citation impact

582
total citations
FWCI
20.20
Percentile
100%
References
26
Citations per year

Authors

5
  • MK
    Mario KrennCorresponding

    University of Toronto, Vector Institute

  • FH
    Florian Häse

    Harvard University, University of Toronto, Vector Institute

  • AN
    AkshatKumar Nigam

    University of Toronto

  • PF
    Pascal Friederich

    Karlsruhe Institute of Technology, University of Toronto

  • AA
    Alan Aspuru-GuzikCorresponding

    Canadian Institute for Advanced Research, University of Toronto, Vector Institute

Topics & keywords

Keywords
  • String (physics)
  • Intuition
  • Representation (politics)
  • Interpretation (philosophy)
  • Task (project management)
  • Generative model
No related works found for this paper.

Funding