BinDiscover

Publication

Article in the Journal of Cheminformatics.

Production Website

http://bindiscover.metabolomics.us/

Technologies

DB->Pipeline->DB<-API->Frontend

asdfasdfasdfasdf.png

Introduction

The Fiehnlab/West Coast Metabolomics Center (WCMC) has performed metabolomics analysis on 160,000 samples over the last 21 years. We wanted to 1) make those data accessible to everyone and 2) transform the mountains of data into simple, human-readable results.

To do this, we transformed the data from the WCMC database, then made it accessible via a webtool. This webtool also implements our novel approach to harmonize disparate differential analysis.

Results

Production website

The main result is the production website

Wrangling metadata into hierachies

image%20%286%29.png
  1. BinBase records observations from 156,174 metabolomic samples run on a GC-TOF mass spectrometer from 2005-2021. Corresponding biological metadata were curated and the resulting annotation table formed the basis of the exploratory webtool BinDiscover.

  2. BinDiscover associates metabolite intensities across species, organs, and diseases. Established ontologies are used to order biological metadata for queries. For metabolites, we used the ClassyFire ontology to enable compound class-level queries.

  3. Biological metadata are associated with all samples and are represented and can be queried via different ontology levels, such as “digestive system” or “bacteria”. Species, organ and disease ontologies are highlighted by colors. We call this Ontologically Grouped Differential Analysis (OGDA).

Example Investigation

image%20%287%29.png

An example of how to use different aspects of the webtool to bring a complete story about the common apple.

  1. Comparing (Apple, Fruit, No Disease) vs. (Fig, Fruit, No Disease). We obtain a plethora of information comparing these two sample groups.

  2. Comparing (Apple, Fruit, No Disease) vs. (All Species, Fruit, No Disease) with Ontologically Grouped Differential Analysis. Based on the complexity of the first results, apples are compared to all fruit. Here, only a small handful of compounds are observed as being unique to the apple’s metabolome. An example compound, tagatose, is labeled.

  3. Tagatose intensity across all metadata combinations. Further investigation of tagatose with a sunburst diagram reveals that it has the greatest observed intensity in apples among of any type of sample in BinDiscover.

  4. Tagatose compound information.

Explanation of Ontologically Grouped Differential Analysis (OGDA)

Here, we explain how we generate results involving wide groupings of species or organs.

image%20%288%29.png
  1. We perform two OGDA - first, between (Human, Digestive System, Cancer) vs (Human, Blood, No disease) then between (Human, Digestive System, No Disease) vs (Human, Blood, No disease). Combining data across experiments and organs is challenging, and the OGDA extracts robust candidates by filtering out inconsistent metabolites. Two lists of conservatively estimated fold-change and significance values are produced for each compound. Those compounds with both positive and negative fold-changes in the pairwise comparison are assigned a fold change value of zero.

  2. Our approach has successfully produced a tractable list of candidates to explore. By including multiple combinations in our groups, we reduce the list of results to include only high-confidence compounds.