Data di Pubblicazione:
2025
Abstract:
Due to new workloads arising with workflows and AI applications, dynamic resource allocation can improve productivity across all system and application levels by adapting the applications' configurations to the system's resources. However, HPC system software is not suited nowadays to provide dynamic resource management (DRM) for computing or I/O resources in runtime. This paper presents the ADMIRE framework and its mechanisms for dynamic resource management. ADMIRE's efforts enable an integrated stack in which storage, compute, and orchestration layers were co-designed and validated across a shared, heterogeneous testbed.
Tipologia CRIS:
04A-Conference paper in volume
Keywords:
Adaptivity; Dynamic resource management; High-Performance Computing; Intelligent control; Performance models
Elenco autori:
Carretero, Jesus; Singh, David E.; Fernandez-Muñoz, Javier; Sedona, Rocco; Pernice, Simone; Cantalupo, Barbara; Aldinucci, Marco; Torquati, Massimo; Tarraf, Ahmad; Jeannot, Emmanuel; Wolf, Felix
Link alla scheda completa:
Titolo del libro:
Procedia Computer Science