Skip to Main Content (Press Enter)

Logo UNITO
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione

UNI-FIND
Logo UNITO

|

UNI-FIND

unito.it
  • ×
  • Home
  • Pubblicazioni
  • Progetti
  • Persone
  • Competenze
  • Settori
  • Strutture
  • Terza Missione
  1. Pubblicazioni

Porting the Variant Calling Pipeline for NGS data in cloud-HPC environment

Contributo in Atti di convegno
Data di Pubblicazione:
2023
Abstract:
In recent years we have understood the importance of analyzing and sequencing human genetic variation. A relevant aspect that emerged from the Covid-19 pandemic was the need to obtain results very quickly; this involved using High-Performance Computing (HPC) environments to execute the Next Generation Sequencing (NGS) pipeline. However, HPC is not always the most suitable environment for the entire execution of a pipeline, especially when it involves many heterogeneous tools. The ability to execute parts of the pipeline on different environments can lead to higher performance but also cheaper executions. This work shows the design and optimization process that led us to a state-of-the-art Variant Calling hybrid workflow based on the StreamFlow Workflow Management System (WfMS). We also compare StreamFlow with Snakemake, an established WfMS targeting HPC facilities, observing comparable performance on single environments and satisfactory improvements with a hybrid cloud-HPC configuration.
Tipologia CRIS:
04A-Conference paper in volume
Keywords:
cloud computing, High Performance Computing, Hybrid workflow, StreamFlow
Elenco autori:
Alberto Mulone; Sherine Awad; Davide Chiarugi; Marco Aldinucci
Autori di Ateneo:
ALDINUCCI Marco
MULONE ALBERTO
Link alla scheda completa:
https://iris.unito.it/handle/2318/1919364
Link al Full Text:
https://iris.unito.it/retrieve/handle/2318/1919364/1166786/paper.pdf
Titolo del libro:
2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC)
Progetto:
Third Party - "ACROSS - HPC Big DAta ArtifiCial Intelligence cross Stack PlatfoRm TOwards ExaScale" (EuroHPC-02-2019)
  • Aree Di Ricerca

Aree Di Ricerca

Settori (4)


PE6_2 - Distributed systems, parallel computing, sensor networks, cyber-physical systems - (2022)

CIBO, AGRICOLTURA e ALLEVAMENTI - Farmacologia Veterinaria

ECONOMIA, AZIENDE E ORGANIZZAZIONI - Sistemi e metodologie per la Qualità

INFORMATICA, AUTOMAZIONE e INTELLIGENZA ARTIFICIALE - Industria X.0
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.4.2.0