SciELO - Scientific Electronic Library Online

 
vol.26 número4Distributed Geometric Multigrid Method: Analysis of a V Cycle Truncation Level CriteriaTowards an Active Foveated Approach to Computer Vision índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

CRUZ, Lucas et al. Parallel Performance and I/O Profiling of HPC RNA-Seq Applications. Comp. y Sist. [online]. 2022, vol.26, n.4, pp.1625-1633.  Epub 17-Mar-2023. ISSN 2007-9737.  https://doi.org/10.13053/cys-26-4-4437.

Transcriptomics experiments are often expressed as scientific workflows and benefit from high-performance computing environments. In these environments, workflow management systems can allow handling independent or communicating tasks across nodes, which may be heterogeneous. Specifically, transcriptomics workflows may treat large volumes of data. ParslRNA-Seq is a workflow for analyzing RNA-Seq experiments, which efficiently manages the estimation of differential gene expression levels from raw sequencing reads and can be executed in varied computational environments, ranging from personal computers to high-performance computing environments with parallel scripting library Parsl. In this work, we aim to investigate CPU and I/O metrics critical for improving the efficiency and resilience of current and upcoming RNA-Seq workflows. Based on the resulting profiling of CPU and I/O data collection, we demonstrate that we can correctly identify anomalies of transcriptomics workflow performance that is an essential resource to optimize its use of high-performance computing systems.

Palabras llave : Supercomputing; sorkflow; RNA-seq.

        · texto en Inglés     · Inglés ( pdf )