Papers
Topics
Authors
Recent
Search
2000 character limit reached

Finish-Time Fairness in Scheduling

Updated 10 May 2026
  • Finish-time fairness is a scheduling approach that quantifies equity by comparing actual job completion times to ideal baselines, ensuring fair resource distribution.
  • It employs robust metrics such as relative fairness and discrimination indices to assess and enhance user-centric performance in multi-job environments.
  • Advanced algorithms like UWFQ and runtime partitioning demonstrate measurable improvements in response times and fairness bounds in distributed systems.

Finish-time fairness is a principled approach to resource scheduling in multi-user, multi-job computational environments, centering fairness metrics on the actual or ideal completion times of users’ workloads. Unlike traditional task- or job-level fairness, finish-time fairness quantifies equity from the perspective of user objectives—typically the completion time (makespan) of all jobs submitted by each user—providing rigorous foundations and quantitative indices for measuring and enforcing fairness in large-scale systems such as batch clusters or distributed analytics frameworks (Kažemaks et al., 17 Oct 2025, Dwibedy et al., 2020).

1. Formal Definitions and Quantitative Measures

Finish-time fairness is defined in terms of per-user performance metrics, most commonly the makespan Cmaxr(A)C^r_{\max}(A)—that is, the maximal job completion time for user UrU_r under scheduling algorithm AA. The user-centric approach employs both absolute and relative measures:

  • Relative Fairness (RF): For user UrU_r,

RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}

where COPTrC^r_{\rm OPT} is a lower bound on the user’s achievable makespan, typically 1mi=1nrpir\frac{1}{m}\sum_{i=1}^{n_r} p^r_i for mm identical machines and jobs JirJ^r_i of lengths pirp^r_i.

  • Global Fairness (GF):

UrU_r0

These indices reside in UrU_r1, with 1 denoting optimal fairness (Dwibedy et al., 2020).

Additional discrimination and fairness indices track the degree to which users are advantaged or disadvantaged:

  • User Discrimination Index (UDI): UrU_r2.
  • Global Discrimination Index (GDI): UrU_r3.
  • Relative Discrimination Index (RDI): UrU_r4 if UrU_r5, zero otherwise.
  • Absolute Fairness: Algorithm UrU_r6 achieves absolute fairness if UrU_r7 is identical for all users.

These definitions generalize straightforwardly to other user-centric objectives such as sum of completion times, weighted flow time, or other composite metrics (Dwibedy et al., 2020).

2. Bounded-Fairness Model and Ideal Schedulers

Perfect fairness is often formalized using the fluid Generalized Processor Sharing (GPS) model. Under GPS with per-job shares UrU_r8, a job UrU_r9 arriving at AA0 and requiring total slot time AA1 ideally finishes at

AA2

An ideal finish-time fair scheduler would complete each job exactly at AA3, but this is unattainable in practical, discrete systems. Consequently, the bounded-fairness model admits bounded deviation:

AA4

where AA5 is determined by system granularity and maximum task length. This ensures jobs are not delayed indefinitely beyond fluid-ideal baselines (Kažemaks et al., 17 Oct 2025).

In two-level batch systems (e.g., Spark clusters), the canonical resource allocation is:

  • Total resources AA6.
  • AA7 active users, each entitled to AA8.
  • For AA9 active jobs of user UrU_r0, each job UrU_r1 is assigned UrU_r2.

User-Job Fairness (UJF): A scheduler is ideally finish-time fair if each job executes with its canonical UrU_r3 share at each instant. Bounded UJF requires actual completion within constant UrU_r4 of the ideal (Kažemaks et al., 17 Oct 2025).

3. Scheduling Algorithms for Finish-Time Fairness

Advanced scheduling algorithms have been developed to operationalize finish-time fairness under bounded deviations:

  • User Weighted Fair Queuing (UWFQ): Extends classic Weighted Fair Queuing (WFQ) to two levels—users and jobs—using virtual time calculations. Each user maintains a virtual time; jobs are assigned user-virtual deadlines

UrU_r5

with global deadlines constructed via cumulative user-virtual deadlines. Dispatching proceeds by selecting the job with the smallest global virtual deadline, directly minimizing response time while maintaining bounded finish-time fairness (Kažemaks et al., 17 Oct 2025).

  • Runtime Partitioning: To minimize the maximum scheduling granularity (and thus the fairness constant UrU_r6), tasks within a stage are dynamically split based on a target advisory runtime (ATR), ensuring no single task is excessively large. This reduces skew, alleviates priority inversion, and tightens the finish-time bound by minimizing the impact of straggler tasks (Kažemaks et al., 17 Oct 2025).

Algorithmic runtime and insertion/dispatch operations can be efficiently maintained—UWFQ’s virtual time and update logic run in UrU_r7 per event.

4. Lower Bounds and Impossibility Results

Even under idealized conditions (identical machines, equal-length jobs), a hard lower bound exists for achievable fairness:

  • Let UrU_r8 be the number of users, UrU_r9 machines, RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}0 total jobs.
  • Theorem: For RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}1, RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}2 in the model RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}3, every algorithm RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}4 is RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}5-fair, i.e.,

RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}6

for all RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}7 (Dwibedy et al., 2020). This bound follows both when RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}8 is a multiple of RF(CAr)=COPTrCArRF(C^r_A) = \frac{C^r_{\rm OPT}}{C^r_A}9 (by direct resource division) and otherwise (via global job balancing), demonstrating the inherent limits of absolute finish-time fairness in resource-constrained scenarios. This is buttressed by algorithm-independent induction arguments and adversarial lower bound constructions.

5. Empirical Evaluation and Metrics

Finish-time fairness is evaluated with metrics that align with user-centric objectives:

  • Response Time: COPTrC^r_{\rm OPT}0 (max stage end) COPTrC^r_{\rm OPT}1 (min stage start) per job.
  • Slowdown: COPTrC^r_{\rm OPT}2.
  • Deadline Violation Ratio (DVR): Mean normalized exceedance over ideal finish times

COPTrC^r_{\rm OPT}3

  • Deadline Slack Ratio (DSR): Symmetrically, for jobs completing before their ideal finish.

Performance benchmarks (e.g., synthetic workloads, scaled Google traces) consistently confirm that bounded-fairness scheduling (such as UWFQ) closes the deviation gap to the ideal:

  • UWFQ reduces average response time by up to 74% over current Spark schedulers (Kažemaks et al., 17 Oct 2025).
  • DVR under UWFQ is several times lower than under Cluster Fair Queuing (CFQ) or traditional Fair, demonstrating much tighter adherence to user-centric finish-time bounds.
  • For small to medium jobs, UWFQ speeds up completion by 60–70%, while maintaining theoretical fairness guarantees for large jobs and heterogeneous loads.

6. Generalizations and Connections to Broader Fairness Models

The finish-time fairness formalism generalizes directly to other measurable user objectives:

  • Sum of Completion Times: Replace makespan by COPTrC^r_{\rm OPT}4 for each user.
  • Weighted Sums: Support for user- or task-dependent weights COPTrC^r_{\rm OPT}5.
  • Flow Time Objectives: Minimize COPTrC^r_{\rm OPT}6.

All fairness and discrimination indices extend verbatim to these settings, enabling a consistent user-objective-centric assessment of algorithmic fairness for an array of workload types and optimization goals (Dwibedy et al., 2020).

7. Significance and Applications in Distributed Systems

Finish-time fairness provides a robust, quantitative framework for designing, analyzing, and benchmarking scheduling algorithms in high-throughput, multi-user environments. It directly addresses limitations of task/job-level schedulers, in particular:

  • Countering user gaming (e.g., submitting many small jobs for higher effective throughput).
  • Explicitly quantifying the cost of discrete scheduling in terms of predictable, bounded deviations from ideality.
  • Enabling new algorithmic approaches, like UWFQ and runtime partitioning, that demonstrably improve user-level equity while optimizing performance.

The model has been implemented and evaluated in Apache Spark, a widely used data processing ecosystem, with substantial gains in empirical and theoretical fairness without sacrificing system throughput (Kažemaks et al., 17 Oct 2025). Its quantitative foundation also facilitates analysis, comparison, and verification of broader fairness properties across various scheduling platforms and resource allocation paradigms.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Finish-Time Fairness.