gpuZoo: Cost-effective estimation of gene regulatory networks using the Graphics Processing Unit.
Marouen Ben Guebila, Daniel C Morgan, Kimberly Glass, Marieke L. Kuijjer, Dawn L. DeMeo, John Quackenbush
NAR Genomics and Bioinformatics
doi.org/10.1093/nargab/lqac002
Gene regulatory network inference allows for the modeling of genome-scale regulatory processes that are altered during development, in disease, and in response to perturbations. Our group has developed a collection of tools to model various regulatory processes, including transcriptional (PANDA, SPIDER) and post-transcriptional (PUMA) gene regulation, as well as gene regulation in individual samples (LIONESS). These methods work by postulating a network structure and then optimizing that structure to be consistent with multiple lines of biological evidence through repeated operations on data matrices. Although our methods are widely used, the corresponding computational complexity, and the associated costs and run times, do limit some applications. To improve the cost/time performance of these algorithms, we developed gpuZoo which implements GPU-accelerated calculations, dramatically improving the performance of these algorithms. The runtime of the gpuZoo implementation in MATLAB and Python is up to 61 times faster and 28 times less expensive than multi-core CPU implementation of the same methods. gpuZoo is available in MATLAB through the netZooM package https://github.com/netZoo/netZooM and in Python through the netZooPy package https://github.com/netZoo/netZooPy.
This is the data used for the benchmarks of gpuZoo.
Model | Motif | PPI | Expression | Mode | TFs | Genes |
---|---|---|---|---|---|---|
Small network | download | download | download | Intersection | 652 | 1000 |
Protein-coding genes network | download | download | download | Union | 652 | 27149 |
Transcript network | download | download | download | Union | 1603 | 43698 |
To reproduce the benchmarks, check the github repository gpuZoo.