Koordinator Versions Save

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

v1.4.1

2 months ago

What's Changed

  • descheduler: fix bug of nodeAnomalyDetector in LowNodeLoad plugin by @bogo-y in #1858
  • koordlet: decouple CUDA dependency Accelerators feature disabled by @saintube in #1876
  • koordlet: eliminate cache GC is not started by @saintube in #1849
  • koordlet: fix core sched conflicts with GI and revise API by @saintube in #1829
  • koordlet: fix metric record err by @ls-2018 in #1868
  • koordlet: fix misleading query helper by @saintube in #1893
  • koordlet: fix proxy container service and revise cpu.max updater by @saintube in #1885
  • koordlet: fix recursively disabling bvt by @saintube in #1848
  • koordlet: fix struct type lost of node slo extension by @zwzhang0107 in #1847
  • koordlet: revise base collectors and system status check by @saintube in #1877
  • koorldet: remove logging field from the kubelet configz by @zwzhang0107 in #1907
  • metrics: seperater metrics as internal and external for slo-controller and koordlet by @zwzhang0107 in #1807
  • scheduler : supports disable runtime quota by @xulinfei1996 in #1839
  • scheduler: ElasticQuota runtime is no longer calculated when not needed by @eahydra in #1855
  • scheduler: correct NodeNUMAResource handling with NodeFullPCPUsOnly and preferredCPUBindPolicy by @eahydra in #1891
  • scheduler: fix ElasaticQuota sync status missing decorate used by @jasonliu747 in #1852
  • scheduler: fix ElasticQuota creation failure on startup by @eahydra in #1845
  • scheduler: fix initializing built-in quota objects by @eahydra in #1917
  • scheduler: fix missing tree id in GetQuotaSummary() by @jasonliu747 in #1853
  • scheduler: fix the method of calculating desiredCount in DeviceShare ApplyForAll strategy by @eahydra in #1870
  • scheduler: improve Coscheduling PreFilter status by @eahydra in #1861
  • scheduler: improve plugin args by @eahydra in #1857
  • scheduler: optimize ElasticQuota status/metrics sync flow by @eahydra in #1850
  • webhook: add node affinity for pod when MultiQuotaTree on by @shaloulcy in #1864
  • webhook: fix validateQuotaTopology for root quota by @shaloulcy in #1918
  • webhook: fix webhook for root quota creation by @shaloulcy in #1916
  • webhook: improve node affinity performance for MultiQuotaTree by @shaloulcy in #1872

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v1.4.0...v1.4.1

v1.4.0

3 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v1.3.0...v1.4.0

v1.3.0

8 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v1.2.0...v1.3.0

v1.2.0

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v1.1.0...v1.2.0

v1.1.1

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v1.1.0...v1.1.1

v1.1.0

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v1.0.0...v1.1.0

v1.0.0

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v0.7.0...v1.0.0

v0.7.0

1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/koordinator-sh/koordinator/compare/v0.6.2...v0.7.0

v0.6.2

1 year ago

Changelog

  • 54ed9a5 Add pod uid to pod meta when failover (#344)
  • 492a861 Update ci.yaml
  • 1328009 Use the structure as the key of the map instead of string. (#349)
  • f81c89c [koord-runtime-proxy]: fix panic when no hook registered (#355)
  • 42d695f add PodMigrationJob CRD proposal (#358)
  • d1fb8c5 add descheduler framework proposal (#371)
  • 7d46fad add fine-grained device scheduling proposal (#322)
  • 82dc2ac add koord-descheduler (#425)
  • 37a3aec add logs for proxy server (#329)
  • 05a8c11 add pod annotations and labels to container request and cache (#362)
  • 827bd6b add reservation plugin (#353)
  • 78a4ebb add schedule gang md (#333)
  • 993fc21 add scheduling framework extender (#365)
  • 1cf37d0 add xiaohongshu as koordinator adopter (#424)
  • c9cf1a4 api: add PodMigrationJob API (#375)
  • 91cacc4 api: add device crd in scheduling group (#376)
  • dab5a92 api: add device info into NodeMetric CRD (#378)
  • 47e7189 api: update PodMigrationJob and Reservation CRD (#399)
  • 74de8bd api: update reservation api (#384)
  • bb3065a apis: add Gang api definition (#409)
  • 0faf65e bugfix: always need to reset cpuset when cpu supress (#403)
  • 9626a21 bugfix: avoid pod terminating in docker (#445)
  • 24ecb9f bugfix: fix exec exit with 126 (#446)
  • f133061 bugfix: skip when pod sandbox not found (#444)
  • fbf4d97 change qos func name for old format adaption reason (#418)
  • 5b1ce9d clear cpuset of BE container to avoid conflict with kubelet static policy, using the value of besteffort dir (#412)
  • 6e0d88f cri-runtime-proxy: fix containerErr error when failOver pods and containers (#414)
  • 6918290 feat(deps): bump github.com/stretchr/testify from 1.7.5 to 1.8.0 (#326)
  • 3fce836 feat(deps): bump google.golang.org/protobuf from 1.28.0 to 1.28.1 (#419)
  • d763879 feat(deps): bump gorm.io/driver/sqlite from 1.3.4 to 1.3.6 (#347)
  • f32a0ba feat(deps): bump gorm.io/gorm from 1.23.6 to 1.23.8 (#351)
  • bed2191 feat(deps): bump sigs.k8s.io/yaml from 1.2.0 to 1.3.0 (#427)
  • 5b320c0 feat: add gpu metrics to crd (#397)
  • 4301cc9 feat: collect gpu metrics (#361)
  • 488f8d5 feature: report pod alloc of Guaranteed pod and cpu manager policy (#386)
  • 9cf28bb feature: support read args from KubeletConfiguration (#470)
  • b54bb0c fix auditor test in MacOS (#379)
  • cdc41a0 fix go mod
  • 5bcb7a7 fix koord-descheduler initialize profile error (#432)
  • ecead7c fix reservation on mutil-scheduler (#431)
  • 9e8fc01 fix reservation on pod patch failed (#428)
  • b2fcc22 fix the loss of new updated resources from UpdateContainerResources request (#363)
  • 0523d60 fix: consider lse/lsr when cpu suppress (#234) (#372)
  • bf308ed fix: remove inline tag for corev1.ResourceList to fix #390 (#391)
  • 6ac04d4 improve koordlet log verbosity (#338)
  • a89cd98 koord-descheduler: implement PodMigrationJob controller (#404)
  • 78afa0a koord-descheduler: implement descheduling configuration (#422)
  • 49fa42c koord-descheduler: implement descheduling framework (#423)
  • 3ed131c koord-descheduler: release Reservation when PodMigrationJob completes or is deleted (#438)
  • 535bc21 koord-runtime-proxy: regard empty string for CpusetCpus and CpusetMem… (#487)
  • 9eb7b7d koord-scheduler: compatible with Pods using kubelet static CPU manager policy (#433)
  • c9ad604 koord-scheduler: improve reservation validation (#442)
  • b78243b koord-scheduler: support CPU exclusive policy (#359)
  • 8179245 koord-scheduler: support Node CPU orchestration API (#360)
  • 1ab5c99 koord-scheduler: support default preferredCPUBindPolicy for LSE/LSR Pod if not specified (#354)
  • 1e77f1f koord-scheduler: support kubelet cpu manager policy (#434)
  • 171ad3e koordlet: define GPU metric struct (#343)
  • 7442bc5 koordlet: fix build error on macOS caused by GPU (#413)
  • 779ac80 koordlet: introduce Accelerators feature gate for GPU related features (#393)
  • 91d2a4b koordlet: optimize auditor UT with httptest.Server (#382)
  • 283c883 koordlet: refine initJiffies with default value (#367)
  • 7510a3a make slo configmap name configurable (#415)
  • b8dd567 rename resourceQoS to resourceQOS (#339)
  • 0d9d9d4 style: unify the command parameter style of koordlet (#348)
  • d0194b2 turn on pleg (#394)

v0.6.1

1 year ago

Changelog

  • 54ed9a5 Add pod uid to pod meta when failover (#344)
  • 1328009 Use the structure as the key of the map instead of string. (#349)
  • f81c89c [koord-runtime-proxy]: fix panic when no hook registered (#355)
  • 42d695f add PodMigrationJob CRD proposal (#358)
  • d1fb8c5 add descheduler framework proposal (#371)
  • 7d46fad add fine-grained device scheduling proposal (#322)
  • 82dc2ac add koord-descheduler (#425)
  • 37a3aec add logs for proxy server (#329)
  • 05a8c11 add pod annotations and labels to container request and cache (#362)
  • 827bd6b add reservation plugin (#353)
  • 78a4ebb add schedule gang md (#333)
  • 993fc21 add scheduling framework extender (#365)
  • 1cf37d0 add xiaohongshu as koordinator adopter (#424)
  • c9cf1a4 api: add PodMigrationJob API (#375)
  • 91cacc4 api: add device crd in scheduling group (#376)
  • dab5a92 api: add device info into NodeMetric CRD (#378)
  • 47e7189 api: update PodMigrationJob and Reservation CRD (#399)
  • 74de8bd api: update reservation api (#384)
  • bb3065a apis: add Gang api definition (#409)
  • 0faf65e bugfix: always need to reset cpuset when cpu supress (#403)
  • f0daee1 bugfix: avoid pod terminating in docker (#445)
  • 1c44a0a bugfix: skip when pod sandbox not found (#444)
  • fbf4d97 change qos func name for old format adaption reason (#418)
  • 5b1ce9d clear cpuset of BE container to avoid conflict with kubelet static policy, using the value of besteffort dir (#412)
  • 6e0d88f cri-runtime-proxy: fix containerErr error when failOver pods and containers (#414)
  • 6918290 feat(deps): bump github.com/stretchr/testify from 1.7.5 to 1.8.0 (#326)
  • 3fce836 feat(deps): bump google.golang.org/protobuf from 1.28.0 to 1.28.1 (#419)
  • d763879 feat(deps): bump gorm.io/driver/sqlite from 1.3.4 to 1.3.6 (#347)
  • f32a0ba feat(deps): bump gorm.io/gorm from 1.23.6 to 1.23.8 (#351)
  • bed2191 feat(deps): bump sigs.k8s.io/yaml from 1.2.0 to 1.3.0 (#427)
  • 5b320c0 feat: add gpu metrics to crd (#397)
  • 4301cc9 feat: collect gpu metrics (#361)
  • 488f8d5 feature: report pod alloc of Guaranteed pod and cpu manager policy (#386)
  • b54bb0c fix auditor test in MacOS (#379)
  • 5bcb7a7 fix koord-descheduler initialize profile error (#432)
  • ecead7c fix reservation on mutil-scheduler (#431)
  • 9e8fc01 fix reservation on pod patch failed (#428)
  • b2fcc22 fix the loss of new updated resources from UpdateContainerResources request (#363)
  • 0523d60 fix: consider lse/lsr when cpu suppress (#234) (#372)
  • bf308ed fix: remove inline tag for corev1.ResourceList to fix #390 (#391)
  • 6ac04d4 improve koordlet log verbosity (#338)
  • a89cd98 koord-descheduler: implement PodMigrationJob controller (#404)
  • 78afa0a koord-descheduler: implement descheduling configuration (#422)
  • 49fa42c koord-descheduler: implement descheduling framework (#423)
  • 3ed131c koord-descheduler: release Reservation when PodMigrationJob completes or is deleted (#438)
  • 9eb7b7d koord-scheduler: compatible with Pods using kubelet static CPU manager policy (#433)
  • c9ad604 koord-scheduler: improve reservation validation (#442)
  • b78243b koord-scheduler: support CPU exclusive policy (#359)
  • 8179245 koord-scheduler: support Node CPU orchestration API (#360)
  • 1ab5c99 koord-scheduler: support default preferredCPUBindPolicy for LSE/LSR Pod if not specified (#354)
  • 1e77f1f koord-scheduler: support kubelet cpu manager policy (#434)
  • 171ad3e koordlet: define GPU metric struct (#343)
  • 7442bc5 koordlet: fix build error on macOS caused by GPU (#413)
  • 779ac80 koordlet: introduce Accelerators feature gate for GPU related features (#393)
  • 91d2a4b koordlet: optimize auditor UT with httptest.Server (#382)
  • 283c883 koordlet: refine initJiffies with default value (#367)
  • 7510a3a make slo configmap name configurable (#415)
  • b8dd567 rename resourceQoS to resourceQOS (#339)
  • 0d9d9d4 style: unify the command parameter style of koordlet (#348)
  • d0194b2 turn on pleg (#394)