TU Darmstadt / ULB / TUbiblio

Holistic Resource Scheduling for Data Center In-Network Computing

Blöcher, Marcel ; Wang, Lin ; Eugster, Patrick ; Schmidt, Max (2022)
Holistic Resource Scheduling for Data Center In-Network Computing.
In: IEEE/ACM Transactions on Networking, (Early Access)
doi: 10.1109/TNET.2022.3174783
Artikel, Bibliographie

Kurzbeschreibung (Abstract)

The recent trend towards more programmable switching hardware in data centers opens up new possibilities for distributed applications to leverage in-network computing (INC). Literature so far has largely focused on individual application scenarios of INC, leaving aside the problem of coordinating usage of potentially scarce and heterogeneous switch resources among multiple INC scenarios, applications, and users. Alas, the traditional model of resource pools of isolated compute containers does not fit an INC-enabled data center. This paper describes HIRE, a holistic INC-aware resource manager which allows for server-local and INC resources to be coordinated in unison. HIRE introduces a novel flexible resource (meta-)model to address heterogeneity and resource interchangeability, and includes two approaches for INC scheduling: (a) retrofitting existing schedulers; (b) designing a new one. For (a), HIRE presents a retrofitting API and demonstrates it with four state-of-the-art schedulers. For (b), HIRE proposes a flow-based scheduler, cast as a min-cost max-flow problem, where a unified cost model is used to integrate the different costs. Experiments with a workload trace of a 4000 machine cluster show that HIRE makes better use of INC resources by serving 8-30% more INC requests, while simultaneously reducing network detours by 20% and reducing tail placement latency by 50%.

Typ des Eintrags: Artikel
Erschienen: 2022
Autor(en): Blöcher, Marcel ; Wang, Lin ; Eugster, Patrick ; Schmidt, Max
Art des Eintrags: Bibliographie
Titel: Holistic Resource Scheduling for Data Center In-Network Computing
Sprache: Englisch
Publikationsjahr: 3 Juni 2022
Verlag: IEEE
Titel der Zeitschrift, Zeitung oder Schriftenreihe: IEEE/ACM Transactions on Networking
(Heft-)Nummer: Early Access
DOI: 10.1109/TNET.2022.3174783
URL / URN: https://ieeexplore.ieee.org/document/9787791
Kurzbeschreibung (Abstract):

The recent trend towards more programmable switching hardware in data centers opens up new possibilities for distributed applications to leverage in-network computing (INC). Literature so far has largely focused on individual application scenarios of INC, leaving aside the problem of coordinating usage of potentially scarce and heterogeneous switch resources among multiple INC scenarios, applications, and users. Alas, the traditional model of resource pools of isolated compute containers does not fit an INC-enabled data center. This paper describes HIRE, a holistic INC-aware resource manager which allows for server-local and INC resources to be coordinated in unison. HIRE introduces a novel flexible resource (meta-)model to address heterogeneity and resource interchangeability, and includes two approaches for INC scheduling: (a) retrofitting existing schedulers; (b) designing a new one. For (a), HIRE presents a retrofitting API and demonstrates it with four state-of-the-art schedulers. For (b), HIRE proposes a flow-based scheduler, cast as a min-cost max-flow problem, where a unified cost model is used to integrate the different costs. Experiments with a workload trace of a 4000 machine cluster show that HIRE makes better use of INC resources by serving 8-30% more INC requests, while simultaneously reducing network detours by 20% and reducing tail placement latency by 50%.

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Telekooperation
DFG-Sonderforschungsbereiche (inkl. Transregio)
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche > SFB 1053: MAKI – Multi-Mechanismen-Adaption für das künftige Internet
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche > SFB 1053: MAKI – Multi-Mechanismen-Adaption für das künftige Internet > B: Adaptionsmechanismen
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche > SFB 1053: MAKI – Multi-Mechanismen-Adaption für das künftige Internet > B: Adaptionsmechanismen > Teilprojekt B2: Koordination und Ausführung
Hinterlegungsdatum: 16 Aug 2022 08:10
Letzte Änderung: 23 Nov 2022 13:41
PPN: 501937676
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen