Volcano Versions Save

A Cloud Native Batch System (Project under CNCF)

v1.4.0-Beta

2 years ago

What's New

1. Support multi-scheduler

In Kubernetes cluster with multiple schedulers, different kinds of workloads should be mapped to certain scheduler sometimes. For example, K8s native workloads such as deployment in namespace kube-system are mapped to default-sheduler while AI and Big data jobs are mapped to Volcano. This feature aims to implements that automaticallty. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/multi-scheduler.md. (https://github.com/volcano-sh/volcano/pull/1576, https://github.com/volcano-sh/volcano/pull/1521, @huone1 @william-wang )

2. Support proportion of resources for GPU node

In order to make full use of scarce resources such as GPU, one solution is to bind them with other resources as shares. For example, it is common to see a lot of CPU-intensive workloads are scheduled to GPU nodes. When GPU-intensive workloads come, they cannot be scheduled because of lack of CPU or Memory in GPU nodes. If workloads requiring both GPU, CPU, Memory at certatin range can be scheduled to GPU nodes first, it is possible to make full use of GPUs. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/proportional.md. (https://github.com/volcano-sh/volcano/pull/1527, @king-jingxiang )

3. Support CPU NUMA-Aware scheduling

As to CPU-intensive workloads especially in AI, Big Data and HPC fileds, It will result in a significant performance improvement if CPU NUMA is enabled. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/numa-aware.md. (https://github.com/volcano-sh/volcano/pull/1493, @huone1 )

4. Provide framework of stress test

In this release, A kind of framework for Volcano stress test is provided. (https://github.com/volcano-sh/volcano/pull/1516, @rudeigerc )

Other Notable Changes

Bug Fixes

v1.3.0

2 years ago

What's New

1. Support minAvailable at task level

Just as the minAvailable at job level, minAvailable at task level will regard replicases at the same task as group and decide whether to schedule pods at the task. Only when the minAvailable is meet will the pods will be scheduled together. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/task-minavailable.md. (https://github.com/volcano-sh/volcano/pull/1459, @shinytang6 )

2. Support minSuccess for Job

Support to configure the least number of pods belonging to the job. It's useful to mark the status of job when minsuccess reaches or not and accelerates the job status judgement. (https://github.com/volcano-sh/volcano/pull/1384, @zen-xu )

3. Support task-topology

In big data processing jobs like Tensorflow & Spark, tasks transmitted a large amount of data between each other, causing transmission delay took a large proportion in job execution time. So task topology plugin was proposed to modify scheduling strategy according to transmission topology inside a job, so as to cut the data amount to be transmitted between nodes, decrease transmission delay proportion in job execution time, and improve resource utilization. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/task-topology-plugin.md. (https://github.com/volcano-sh/volcano/pull/1353, @jiangkaihua )

4. Create new repository volcano.sh/apis

Separate apis from volcano.sh/volcanosh. Any downstream projects can introduce the CRD clientset/lister/informer with the K8s version it needs. (https://github.com/volcano-sh/apis, @Thor-wl )

Other Notable Changes

Bug Fixes

v1.2.0

3 years ago

What's New

1. Add TDM plugin

TDM(Time Division Multiplexing) plugin aims to provide a mechanism for nodes, which can be used for K8S and other cluster(such as Yarn) in separate time.(https://github.com/volcano-sh/volcano/pull/1269, @yahaa )

2. Add SLA plugin

SLA(Service Level Agreement) plugin works for job resource reservation feature. Users can set SLA for jobs to ensure specified jobs to be scheduled in time. It provides an better design and implementation for job resource reservation. (https://github.com/volcano-sh/volcano/pull/1303, @jiangkaihua )

Other Notable Changes

Bug Fixes

v1.1.2

3 years ago

Changes since v1.1.1

v1.1.1

3 years ago

What's New

1. support vc-scheduler loading custom plugins

Separate plugin implementation with scheduler. Support implement custom plugins and load to vc-scheduler dynamically.(https://github.com/volcano-sh/volcano/pull/1218, @zen-xu)

2. add MaxRequeueNum as a controller-manager param

Support configure MaxRequeueNum in config file of vc-scheduler, default to 15 times.(https://github.com/volcano-sh/volcano/pull/1087, @shinytang6)

3. add design documentation of CPU careful regulation

Give the design of CPU careful regulation in socket level.(https://github.com/volcano-sh/volcano/pull/1051, @ProgramerGu)

Other Notable Changes

Bug Fixes

v1.1.0

3 years ago

What's New

1. Add monitor compontent

Monitor compontent added support display some base metrics about Volcano.(https://github.com/volcano-sh/volcano/pull/1066, @alcorj-mizar)

2. Support resource reservation for big job automatically

Reserve resource for pending job which is at highest priority among pending jobs and waits for a long time. The big job is recognized by scheduler automatically.(https://github.com/volcano-sh/volcano/pull/1044, @Thor-wl)

3. Support HDRF

Hierarchical dominant resource fairness is configured with a weighted tree, such that each node in the tree has a positive weight value.(https://github.com/volcano-sh/volcano/pull/928, @ggaaooppeenngg)

Other Notable Changes

Bug Fixes

v0.4.2

3 years ago
  • Fix queue capability validation failed when some running jobs finished or deleted (#959, @Thor-wl)

v1.0.1

3 years ago

Changelog since v1.0.0

v0.3.0

3 years ago

This is to support go get

v0.4.1

3 years ago

v0.4.1 (2020-07-14)