• Guilherme R. Pretto
  • Bruno L. Dalmazo
  • Jonatas A. Marques
  • Zhongke Wu
  • Xingce Wang
  • Vladimir Korkhov
  • Philippe O.A. Navaux
  • Luciano Paschoal Gaspary

Data centers, clusters, and grids have historically supported High-Performance Computing (HPC) applications. Due to the high capital and operational expenditures associated with such infrastructures, we have witnessed consistent efforts to run HPC applications in the cloud in the recent past. The potential advantages of this shift include higher scalability and lower costs. If, on the one hand, app instantiation—through customized Virtual Machines (VMs)—is a well-solved issue, on the other, the network still represents a significant bottleneck. When switching HPC applications to be executed on the cloud, we lose control of where VMs will be positioned and of the paths that will be traversed for processes to communicate with one another. To bridge this gap, we present Janus, a framework for dynamic, just-in-time path provisioning in cloud infrastructures. By leveraging emerging software-defined networking principles, the framework allows for an HPC application, once deployed, to have interprocess communication paths configured upon usage based on least-used network links (instead of resorting to shortest, pre-computed paths). Janus is fully configurable to cope with different operating parameters and communication strategies, providing a rich ecosystem for application execution speed up. Through an extensive experimental evaluation, we provide evidence that the proposed framework can lead to significant gains regarding runtime. Moreover, we show what one can expect in terms of system overheads, providing essential insights on how better benefiting from Janus.

Original languageEnglish
Pages (from-to)947-964
Number of pages18
JournalCluster Computing
Volume25
Issue number2
Early online date23 Nov 2021
DOIs
StatePublished - Apr 2022

    Research areas

  • Cloud infrastructures, Framework, HPC applications, Link usage-aware path provisioning, NETWORK

    Scopus subject areas

  • Software
  • Computer Networks and Communications

ID: 89177930