To effectively train predictive models and drug discovery, we need large volumes of high-quality, clean, and linked data. However, preparing these data can be a costly and time-consuming task. In this webinar, Mya Steadman elucidates on the theme- “Building a Solid Data Foundation for Cutting Edge ML” and discusses strategies for training and enhancing the accuracy of predictive models.
Key Points Addressed
To effectively train predictive models and drug discovery, we need large volumes of high-quality, clean, and linked data. However, preparing these data can be a costly and time-consuming task. In this webinar, Mya Steadman elucidates on the theme- “Building a Solid Data Foundation for Cutting Edge ML” and discusses strategies for training and enhancing the accuracy of predictive models.
Key Points Addressed
Scaling clinico-genomic data integration: Large pharmaceutical organizations working with external data providers used Polly to build interoperable clinico-genomic data products 6x faster.
Although purchased datasets are often labeled as "clean," they still lack interoperability—Polly's pipelines bridge this gap with robust integration and harmonization.
Information Retrieval: Drug safety monitoring teams used Polly's Knowledge Graph powered co-scientist to conversationally retrieve the right cohorts & assess drug response—cutting discovery time by 70%.
If you’re working with complex biological data, you may be asking:
Can generative AI truly assist in scientific reasoning, not just data analysis?
What does it mean for hypothesis generation, literature review, or even designing experiments?
Could this accelerate—not replace—my discovery pipeline?
Whether you're skeptical, curious, or already experimenting with AI in your lab—this is a session designed to ground your understanding in evidence, not speculation.