Game-Theoretic Approach for Grace-Period Policy in Supercomputers

Job scheduling at supercomputing facilities is important for achieving high utilization of these valuable resources while ensuring effective execution of jobs submitted by users. The jobs are scheduled according to their specified resource demands such as expected job completion times, and the avail...

Full description

Saved in:

Bibliographic Details
Published in:	2021 IEEE 24th International Conference on Information Fusion (FUSION) pp. 1 - 7
Main Authors:	He, Fei, Rao, Nageswara S. V., Ma, Chris Y. T.
Format:	Conference Proceeding
Language:	English
Published:	International Society of Information Fusion (ISIF) 01-11-2021
Subjects:	Closed-form solutions Conferences Cost accounting game theory Games grace-period job completion times Nash equilibrium Resource management Supercomputers under- and over-requested time
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Job scheduling at supercomputing facilities is important for achieving high utilization of these valuable resources while ensuring effective execution of jobs submitted by users. The jobs are scheduled according to their specified resource demands such as expected job completion times, and the available resources based on allocations. Jobs that overrun their allocated times are terminated, for example, after a grace-period. It is non-trivial and often very complex for users to accurately estimate the completion times of their jobs, and consequently they face a dilemma: underestimate the job time to have a higher priority and risk job termination due to overrun, or overestimate it to ensure its completion and risk its delayed execution. In this paper, we investigate whether providing grace-period can benefit facility performance by developing a game- theoretic model between a facility provider and multiple users for a simplified scheduling scenario based on job execution times. We present closed-form expressions for the provider's and user's best-response strategies to maximize their respective utility functions. We describe conditions under which offering a grace-period is advantageous to both facility provider and users by deriving the Nash equilibrium of the game.
DOI:	10.23919/FUSION49465.2021.9626952