2 Easy Steps To Create Your Standard Metadata File

Metadata is information about data. Making a metadata file is a standard and repetitive step in a bioinformatician’s day-to-day workflow who deals with metabolomics data. While performing the mass spectrometry experiment in a metabolomics lab, the bioinformatician inputs samples’ relative concentration in the machine. Its output is fed to tools like El-MAVEN or Multiquant which are responsible for mapping out intensities for those samples.

But to perform some downstream analyses like Kinetic Flux Analysis, they need absolute concentrations of unknown samples/biosamples. That’s where Polly QuantFit comes in handy.

Polly QuantFit is a cloud-based app which helps in absolute quantification of metabolites. Standard/known samples are mapped as Intensities to concentrations to get the best curve fit. Unknown samples are then mapped to this curve to obtain their concentrations.

Why Do We Need Standard Metadata?

Standard Metadata helps us understand two things-
a. Among all samples, which samples are the standard samples?
b. What is the concentration of all the metabolites in all standard samples?

How metadata file looks like — Fig 3. How Metadata File looks like

The Problem in the Traditional Way of Creating a Metadata File

For a typical metabolomic dataset, the number of samples and metabolites is as given in Fig 4.

Once the user has identified the standard samples, the next step is to fill their concentrations in an excel file.

Now, let’s do some maths to understand what it means for a user in two scenarios:
a. Best Case — 5 standard samples and 100 metabolites
Number of cells to be filled — 5 * 100 = 500
b. Worst case — 10 standard samples and 500 metabolites
Number of cells to be filled — 10 * 500 = 5000

Clearly, the problem here is the time, complexity and possibility of error in manually filling 5000 cells in an excel sheet for one dataset.

How Polly Solved It?

a. Select and move standard samples
Polly autodetects the samples which have the general nomenclature of std/STD prepended in the name as standard samples. Also, it lets user select and move the samples to either of the lists if some sample is missed or if a user follows a different naming convention.

select standard samples — Fig 5. Select Standard Samples

b. Fill ConcentrationThe next step is to fill the concentration for all the metabolites.

Fig 6. Fill Concentrations of all metabolites for all samples

This step is different from filling concentrations in excel file in three ways –

A very common use case is to fill the same concentrations for all metabolites in different samples. The user would either need to fill all these concentrations one by one or copy and paste those values in all columns multiple times. But Polly provides the user the option to apply the same concentrations to all or selected metabolites in just one click.
Option to fill 0 or NA to non-filled cells in the case when the concentration for the standard is zero or standard concentrations is applicable for only a pool of metabolites respectively.
To find some metabolite towards the last column, instead of scrolling all the way, search any metabolite in the search bar and the screen will slide towards that metabolite automatically.

Polly has reduced the time and UX complexity of filling 5000 cells in an excel sheet to just a few clicks.

What Next?

a. This format will be implemented in other Metadata interfaces like the Sample Cohort and Normalization Factor Interface.

b. The work is being done to implement the interface for MS/MS data as well.

Fig 9. Metadata Interface for MS/MS Data

c. We will continue to work on some critical insights gained from our users on further user experience enhancement.

We, at Elucidata, are trying to solve such more challenges faced by scientists through our platform Polly to accelerate the process of target discovery. To know more about Polly, click here.

Blog Categories

Data Analysis and Management

Data Quality & Compliance

Industry Features

Product & Engineering

Data Science & Machine Learning

Company & Culture

FAIR Data

Others

Thank you for reaching out!

Our team will get in touch with you over email within next 24-48hrs.

Oops! Something went wrong while submitting the form.

Other Resources

Case Studies Dataset Roundup Documentation Glossary Solution Briefs Webinars Whitepapers

Upcoming Webinar : Data-centric AI approach to Out-of-distribution problems in Life Sciences

View Details

[Upcoming Webinar] Scaling High-Quality Data Processing: Achieve 4x Cost Reduction for Foundation ModelsRegister Now->

Reserve Your Seat

Upcoming Webinar : Data-centric AI approach to Out-of-distribution problems in Life Sciences

View Details

[Upcoming Webinar] Scaling High-Quality Data Processing: Achieve 4x Cost Reduction for Foundation ModelsRegister Now->

Reserve Your Seat

2 Easy Steps To Create Your Standard Metadata File

Why Do We Need Standard Metadata?

The Problem in the Traditional Way of Creating a Metadata File

How Polly Solved It?

What Next?

Blog Categories

Talk to our Data Expert

Other Resources

Related Blogs

Machine Learning Can Predict, But Can it Decide?

How Elucidata delivers Data-centric AI across the Drug Discovery lifecycle

Beyond Wet Lab: How AI is Powering the Virtual Cells for Drug Discovery

The Data-Centric Mandate: Why the Hero of AI-Driven Drug Discovery is Data, Not the Model

De-risking Autoimmune Clinical Trials with Agentic AI

From Static Snapshots to Living Systems: How PollyKG Redefines Biomedical Knowledge Graphs

Watch the full Webinar

De-risking Autoimmune Clinical Trials with Agentic AI

Blog Categories

Get the latest news, industry insights, and updates delivered directly to your inbox.

Latest Blogs

Machine Learning Can Predict, But Can it Decide?

Machine Learning Can Predict, But Can it Decide?

How Elucidata delivers Data-centric AI across the Drug Discovery lifecycle

How Elucidata delivers Data-centric AI across the Drug Discovery lifecycle

The Data-Centric Mandate: Why the Hero of AI-Driven Drug Discovery is Data, Not the Model

The Data-Centric Mandate: Why the Hero of AI-Driven Drug Discovery is Data, Not the Model

De-risking Autoimmune Clinical Trials with Agentic AI

De-risking Autoimmune Clinical Trials with Agentic AI

From Static Snapshots to Living Systems: How PollyKG Redefines Biomedical Knowledge Graphs

From Static Snapshots to Living Systems: How PollyKG Redefines Biomedical Knowledge Graphs

Elucidata Delivers Scalable Spatial Metabolomics for Precision Medicine

Elucidata Delivers Scalable Spatial Metabolomics for Precision Medicine

Trending Blogs

Machine Learning Can Predict, But Can it Decide?

Clinical Trials Data: Best Practices for Effective Analysis and Integration

Scaling Data Pipelines for High-throughput Bioinformatics

Decoding Complexities: The Critical Role of Deconvolution in Spatial Transcriptomics

Challenges with Diagnostics Data Processing Pipelines

info@elucidata.io

info@elucidata.io

info@elucidata.io