Cook Versions Save

Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark

v1.63.4

1 year ago

Fixed

  • Fixed bug in parallelized Kubernetes watch processing, from @scrosby

Changed

  • Make prometheus JVM metrics use compute cluster name, from @samincheva

v1.63.3

1 year ago

Changed

  • Parallelize Kubernetes watch processing, from @scrosby

v1.63.2

1 year ago

Fixed

  • Do not set scalar-requests to pool specific resources for Kenzo pods, from @ahaysx

v1.63.1

1 year ago

Changed

  • Reverted parallel Kubernetes watch processing for future release

v1.63.0

1 year ago

Added

  • Prometheus metrics
    • JVM metrics, from @samincheva
    • Ring metrics, from @samincheva
    • Parity for remaining codahale metrics, from @samincheva
  • Direct-to-Kubernetes scheduler (Kenzo)
    • Use backpressure of scheduling pods to moderate launching new pods for real jobs, from @ahaysx
    • Prometheus metrics parity, from @ahaysx

Changed

  • Parallelize Kubernetes watch processing, from @scrosby
  • Optimize getting the nodename from a node, from @scrosby

v1.62.6

1 year ago

Added

  • Initial implementation for submitting jobs directly to Kubernetes Scheduler, from @ahaysx
  • Better error handling in scheduler/write functions, from @ahaysx
  • Launch tasks similarly for both Fenzo and K8s Scheduler pools, from @ahaysx
  • Adding prometheus metrics to remaining modules, from @samincheva

Fixed

  • Optimizing total pod count metric and fixing mismatched metric labels, from @samincheva

v1.62.5

1 year ago

Added

  • Make Fenzo config pool-specific and set up for other schedulers, from @ahaysx
  • Prometheus metrics for the kubernetes, API, and tools module, from @samincheva
  • Update of synthetic pods counter metric even if the current match cycle doesn't autoscale, from @samincheva

Changed

  • Performance optimization for add-starting-pods, from @scrosby
  • Remove metatransaction filter from match, from @scrosby
  • Make job resource lookup more efficient in miss path, from @scrosby
  • Relazy some list generation in rank cycle, from @scrosby
  • Parallelize autoscale to run at the same time as main job launches, from @scrosby

v1.62.4

1 year ago

Added

  • Add new JobSubmissionModifier and refactor JobRouter, from @laurameng

Changed

  • Prometheus metrics
    • Updated match cycle metric logic for 0 considerable case, from @samincheva
    • Added prometheus metric for synthetic pods count, from @samincheva
  • Use a factory fn for creating (future) different types of pool handlers, from @ahaysx

v1.62.3

1 year ago

Changed

  • Configured the /metrics endpoint to have a separate rate limit, from @samincheva

v1.62.2

1 year ago

Added

  • Prometheus, from @samincheva
  • Adding match cycle metrics to prometheus, from @samincheva
  • Adding prometheus metric for jobs launch count, from @samincheva

Fixed

  • Use pools & submit pools in /jobs list endpoint, from @laurameng