12 Data Processing, Metabolomic Databases and Pathway Analysis

Annual Plant Reviews book series, Volume 43: Biology of Plant Metabolomics
Oliver Fiehn

Oliver Fiehn

UC Davis Genome Center, 451 Health Sci Dr, Davis, CA, 95616 USA

Search for more papers by this author
Tobias Kind

Tobias Kind

UC Davis Genome Center, 451 Health Sci Dr, Davis, CA, 95616 USA

Search for more papers by this author
Dinesh Kumar Barupal

Dinesh Kumar Barupal

UC Davis Genome Center, 451 Health Sci Dr, Davis, CA, 95616 USA

Search for more papers by this author
First published: 19 April 2018
Citations: 1
This article was originally published in 2011 in Biology of Plant Metabolomics, Volume 43 (ISBN 9781405199544) of the Annual Plant Reviews book series, this volume edited by Robert D. Hall. The article was republished in Annual Plant Reviews online in April 2018.

Abstract

Metabolomics relies on a variety of bioinformatics tools aiming to derive information from data. Data acquisition with chromatography-coupled mass spectrometry yields high volumes of raw data that need to be processed to achieve de-noised and meaningful biochemical data that subsequently can be presented for statistics and pathway analysis in a coherent manner. A variety of new tools and algorithms have recently been developed that help in this process. Nevertheless, a surprisingly low number of metabolic signals can be unambiguously identified. If cutting-edge methods are used, we can confidently interpret signals that can neither be identified by reference standards nor annotated by database matching as ‘novel metabolites’. Such methods may comprise high-resolution, high-accurate mass data in conjunction with good data alignment programs that perform peak picking and deconvolution, calculation of elemental formulas, database queries and constraining hit lists by interpreting mass spectral fragmentations and retention-based metabolomic libraries. In an effort to improve standardizations for metabolomic reports, we propose that not only metabolites must be named but also, for all reported metabolites, identifiers or structure codes (InChI keys) from public (bio)chemical databases need to be detailed. Machine-readable structures will vastly accelerate our knowledge of the magnitude and importance of the metabolome in various plant species, organs and physiological conditions. We detail how well-annotated metabolome data can then be used to visualize and interrogate biochemical pathways by matching information to the broad plant databases that currently exist.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.