In silico fragmentation for computer assisted identification of metabolite mass spectra
Leibniz Association · Leibniz Institute of Plant Biochemistry · +1 more institution
Abstract
Mass spectrometry has become the analytical method of choice in metabolomics research. The identification of unknown compounds is the main bottleneck. In addition to the precursor mass, tandem MS spectra carry informative fragment peaks, but the coverage of spectral libraries of measured reference compounds are far from covering the complete chemical space. Compound libraries such as PubChem or KEGG describe a larger number of compounds, which can be used to compare their in silico fragmentation with spectra of unknown metabolites.
We created the MetFrag suite to obtain a candidate list from compound libraries based on the precursor mass, subsequently ranked by the agreement between measured and in silico fragments. In the evaluation MetFrag was able to rank most of the correct compounds within the top 3 candidates returned by an exact mass query in KEGG. Compared to a previously published study, MetFrag obtained better results than the commercial MassFrontier software. Especially for large compound libraries, the candidates with a good score show a high structural similarity or just different stereochemistry, a subsequent clustering based on chemical distances reduces this redundancy. The in silico fragmentation requires less than a second to process a molecule, and MetFrag performs a search in KEGG or PubChem on average within 30 to 300 seconds, respectively, on an average desktop PC.
Citation impact
- FWCI
- 11.98
- Percentile
- 100%
- References
- 19
Authors
4- SISebastian I. WolfCorresponding
Leibniz Association, Leibniz Institute of Plant Biochemistry
- SSStephan Schmidt
Leibniz Institute of Plant Biochemistry, Leibniz Association
- MMMatthias Müller‐Hannemann
Martin Luther University Halle-Wittenberg
- SNSteffen Neumann
Leibniz Institute of Plant Biochemistry, Leibniz Association
Topics & keywords
- PubChem
- In silico
- Fragmentation (computing)
- Computer science
- Chemical space
- KEGG
- chEMBL
- Tandem mass spectrometry