Genomic Data in Python

Name: Genomic Data in Python
Uploaded: 2021-04-21T08:33:32+0530
Description: Genomic Data in Python

admin

Apr 21, 2021 - 08:33

0 32

Abstract:

With the growth of available sequenced datasets, analysis of heterogeneous processed data can answer increasingly relevant biological and clinical questions. Scientists are challenged in performing efficient and reproducible data extraction and analysis pipelines over heterogeneously processed datasets. Available software packages are suitable for analyzing experimental files from such datasets one by one, but do not scale to thousands of experiments. Moreover, they lack proper support for metadata manipulation.The PyGMQL package integrates scalable data extraction over the Apache Spark engine underlying the GMQL implementation with native Python support for interactive data analysis and visualization. It supports data interoperability, solving the impedance mismatch between executing set-oriented queries and programming in Python. PyGMQL provides distribution transparency (the ability to address a remote dataset) and query outsourcing (the ability to assign processing to a remote service) in an orthogonal way. Outsourced processing can address cloud-based installations of the GMQL engine.

Click Here To See More

Genomic Data in Python

Genomic Data in Python

What's Your Reaction?

Related Posts

Popular Posts

Follow Us

Recommended Posts

Popular Tags

Voting Poll