Katib Versions Save

Automated Machine Learning on Kubernetes

v0.16.0

6 months ago

This is the Katib v0.16.0 release.

Breaking Changes

  • Implement KatibConfig API (#2176 by @tenzen-y)
  • Drop Kubernetes v1.24 and support Kubernetes v1.27 (#2182 by @tenzen-y)
  • Drop Kubernetes v1.23 and support Kubernetes v1.26 (#2177 by @tenzen-y)
  • Change failurePolicy to Fail for Katib Webhooks (#2018 by @andreyvelich)

New Features

Core Features

  • Consolidate the Katib Cert Generator to the Katib Controller (#2185 by @tenzen-y)
  • Containerize tests for Katib Conformance (#2146 by @nagar-ajay)

UI Improvements

  • [UI] Default Resume Policy to never from UI (#2195 by @mChowdhury-91)
  • [UI] Remove Deprecated Katib UI (#2179 by @andreyvelich)
  • [UI] Fix Trial Logs when Kubernetes Job Fails (#2164 by @andreyvelich)
  • kwa(front): Support all namespaces (#2119 by @elenzio9)
  • kwa(front): Update the use of SnackBarService (#2113 by @orfeas-k)
  • UI: Remove an unsed import, EventV1beta1Api (#2116 by @tenzen-y)

SDK Improvements

  • [SDK] Enable resource specification for trial containers (#2192 by @droctothorpe)
  • [SDK] Add namespace parameter to KatibClient (#2183 by @droctothorpe)
  • [SDK] Import all Kubernetes Models (#2148 by @andreyvelich)

Bug fixes

  • Bug: Wait for the certs to be mounted inside the container (#2213 by @tenzen-y)
  • Start waiting for certs to be ready before sending data to the channel (#2215 by @tenzen-y)
  • E2E: Add additional checks to verify if the components are ready (#2212 by @tenzen-y)
  • Remove a katib-webhook-cert Secret from components (#2214 by @tenzen-y)
  • Skip to inject the metrics-collector pods to the Katib controller (#2211 by @tenzen-y)
  • Sending an empty data to the certsReady channel (#2196 by @tenzen-y)
  • Fix conformance docker image (#2147 by @nagar-ajay)

Documentation

  • Add PITS Global Data Recovery Services to the adopters list (#2160 by @ghost)
  • Add SDK Breaking Change to Changelog (#2133 by @andreyvelich)
  • Add Changelog for Katib v0.15.0 (#2129 by @andreyvelich)
  • Add Changelog for Katib v0.15.0-rc.1 (#2123 by @andreyvelich)
  • Add Changelog for Katib v0.15.0-rc.0 (#2106 by @andreyvelich)

Misc

  • Upgrade Tensorflow version to v2.13.0 (#2216 by @tenzen-y)
  • Upgrade Go version to v1.20 (#2190 by @tenzen-y)
  • Replace grpc_health_probe with the built-in gRPC container probe feature (#2189 by @tenzen-y)
  • Allow install binaries for the arm64 in the envtest (#2188 by @tenzen-y)
  • Replace action to setup minikube with medyagh/setup-minikube (#2178 by @tenzen-y)
  • Remove Charmed Operators for Katib (#2161 by @ca-scribner)
  • Namespace and trial pod annotations as CLI argument (#2138 by @nagar-ajay)
  • Relax dependencies restriction for the gRPC libraries (#2140 by @tenzen-y)
  • Add SDK Breaking Change to Changelog (#2133 by @andreyvelich)
  • Increase the free spaces in CI (#2131 by @tenzen-y)
  • Reformat katib-operators (#2114 by @tenzen-y)

v0.15.0

1 year ago

This is the Katib v0.15.0 release.

Breaking Changes

  • Use Never Resume Policy as Default (#2102 by @andreyvelich)
  • Chocolate Suggestion Service is removed (#2071 by @tenzen-y)
  • request_number is removed from the GRPC APIs (#1994 by @johnugeorge)
  • Enabling Authorization in Katib UI (#1983 and #2041 by @apo-ger)
  • The new improved and refactored Katib SDK is not backward compatible (#2075 by @andreyvelich)

New Features

Major Features

  • Narrow down Katib RBAC rules (#2091 by @johnugeorge)
  • Support Postgres as a Katib DB (#1921 by @anencore94)
  • More Suggestion container fields in Katib Config (#2000 by @fischor)
  • Katib UI: Create the LOGS tab of Trial's details page (#2117 by @elenzio9)
  • Katib UI: Enable pagination/sorting/filtering (#2017 and #2040 by @elenzio9)
  • [SDK] Create Tune API in the Katib SDK (#1951 by @andreyvelich)
  • [SDK] Get Trial Metrics from Katib DB (#2050 by @andreyvelich)

Core Features

  • Add Conformance Program Doc for AutoML and Training WG (#2048 by @andreyvelich)
  • Support for grid search algorithm in Optuna Suggestion Service (#2060 by @tenzen-y)
  • Add Trial Labels During Pod Mutation (#2047 by @andreyvelich)
  • Support for k8s v1.25 in CI (#1997 by @johnugeorge)
  • Add the CI to build multi-platform container images (#1956 by @tenzen-y)
  • Drop Kubernetes v1.21 and introduce Kubernetes v1.24 (#1953 by @tenzen-y)
  • Add --connect-timeout flag to katib-db-manager (#1937 by @tenzen-y)
  • Implement validations for DARTS suggestion service (#1926 by @tenzen-y)
  • Implement validation for Optuna suggestion service (#1924 by @tenzen-y)

UI Improvements

  • Make links in KWA's tables actual links (#2090 by @elenzio9)
  • frontend: Rework the trial graph using ECharts in KWA (#2089 by @elenzio9)
  • kwa(front): Add UI tests with Cypress (#2088 by @orfeas-k)
  • frontend: Enable actions in experiment graph (#2065 by @elenzio9)
  • frontend: Show message in case of uncompleted trial instead of the graph (#2063 by @elenzio9)
  • frontend: Add source maps in the browser (#2043 by @elenzio9)
  • Backend for getting logs of a trial (#2039 by @d-gol)
  • frontend: Show the successful trials in the experiment graph (#2013) (#2033 by @elenzio9)
  • frontend: Migrate from tslint to eslint in KWA (#2042 by @elenzio9)
  • Dedicated yaml tab for Trials (#2034 by @elenzio9)
  • KWA: Use new Editor component (Monaco) (#2023 by @orfeas-k)
  • kwa(build): Introduce COMMIT file for building KWA (#2014 by @orfeas-k)
  • frontend: Fix 500 error after detail page refresh (#1967) (#2001 by @elenzio9)
  • Introduce KWA's frontend component for kfp links (#1991 by @elenzio9)
  • UI: Rename and right align the age column (#1989 by @elenzio9)
  • Show the trials table's status column first (#1990 by @elenzio9)
  • UI: Make KWA's main table responsive and add toolbar (#1982 by @elenzio9)
  • UI: Fix unit tests (#1977 by @elenzio9)
  • UI: Format code (#1979 by @orfeas-k)
  • Recreate the Experiments Parallel Coordinates Graph (#1974 by @elenzio9)
  • Improve UI API/controller logging to ease troubleshooting (#1966 by @lukeogg)

SDK Improvements

  • [SDK] Use Katib SDK for E2E Tests (#2075 by @andreyvelich)
  • [SDK] Use Katib Client without Kube Config (#2098 by @andreyvelich)
  • [SDK] Fix namespace parameter in tune API (#1981 by @andreyvelich)
  • [SDK] Remove Final Keyword from constants (#1980 by @andreyvelich)

Bug fixes

  • Fix Release Script for Updating SDK Version (#2104 by @andreyvelich)
  • [Fix] add early stopped trials in converter (#2004 by @shaowei-su)
  • [bugfix] Fix value passing bug in New Experiment form (#2027 by @orfeas-k)
  • Fix main process retrieve logic for early stopping (#1988 by @shaowei-su)
  • [hotfix]: filter by name of experiment (#1920 by @anencore94)
  • Fix push script to include new images (#1911 by @johnugeorge)
  • fix: only validate Kubernetes Job (#2025 by @zhixian82)
  • Upgrade grpc-health-probe version to fix some security issues (#2093 by @tenzen-y)
  • Format Katib Charm Operator (#2115 by @tenzen-y)

Documentation

  • Add CERN to adopters (#2010 by @d-gol)
  • Add More Katib Presentations 2022 (#2009 by @andreyvelich)
  • Add the documentation for simple-pbt (#1978 by @tenzen-y)
  • Add the license to pbt (#1958 by @tenzen-y)
  • Update the Katib version in docs (#1950 by @tenzen-y)
  • Update CHANGELOG for v0.14.0 release (#1932 by @johnugeorge)

Misc

  • Update Training operator Image in CI (#2103 by @johnugeorge)
  • Upgrade Go libraries to resolve security issues (#2094 by @tenzen-y)
  • Run e2e with various Python versions to verify Python SDK (#2092 by @tenzen-y)
  • Add a --prefer-binary flag to 'pip install' command (#2096 by @tenzen-y)
  • Upgrade PyTorch version to v1.13.0 (#2082 by @tenzen-y)
  • Upgrade Tensorflow version (#2079 by @tenzen-y)
  • Upgrade Python version to 3.10 (#2057 by @tenzen-y)
  • Pin the NumPy version with v1.23.5 in some images (#2070 by @tenzen-y)
  • Upgrade the actions-setup-minikube version to v2.7.2 (#2064 by @tenzen-y)
  • Remove Certificate Chain from Cert Generator (#2045 by @andreyvelich)
  • Add resources to earlystopping container (#2038 by @zhixian82)
  • Add scripts to verify generated codes and Go Modules (#1999 by @tenzen-y)
  • [Test] Reduce Katib GitHub Action Runs (#2036 by @andreyvelich)
  • gh-actions: Extend action to run Frontend Unit tests (#1998 by @orfeas-k)
  • [chore] Upgrade docker/metadata-action, actions/checkout, and actions/setup-python version (#1996 by @tenzen-y)
  • [chore] Upgrade Go version to v1.19 (#1995 by @tenzen-y)
  • Support for arm64 in simple-pbt image (#1948 by @tenzen-y)
  • Support arm64 in darts-cnn-cifar10 image (#1947 by @tenzen-y)
  • Support for arm64 in enas-cnn-cifar10 image (#1944 by @tenzen-y)
  • Support for arm64 in pytorch-mnist image (#1943 by @tenzen-y)
  • Support for arm64 in mxnet-mnist image (#1940 by @tenzen-y)
  • Use the katib-new-ui for Charmed gh-actions (#1987 by @tenzen-y)
  • [feat] health check for katib-controller (#1934 by @anencore94)
  • Upgrade Optuna from v2.x.x to v3.0.0 (#1942 by @keisuke-umezawa)
  • Add validation webhooks for maxFailedTrialCount and parallelTrialCount (#1936 by @tenzen-y)
  • Introduce Automatic platform ARGs (#1935 by @tenzen-y)
  • Update training operator image in CI (#1933 by @johnugeorge)
  • Update Katib SDK version (#1931 by @johnugeorge)
  • [chore] Upgrade Go version to v1.18 (#1925 by @tenzen-y)
  • Add the pytorch-mnist with GPU support container image (#1916 by @tenzen-y)

Changelog

Full Changelog

v0.14.0

1 year ago

This is the Katib v0.14.0 release.

New Features

Core Features

  • Population based training (#1833 by @a9p)
  • Support JSON format logs in file-metrics-collector (#1765 by @tenzen-y)
  • Include MetricsUnavailable condition to Complete in Trial (#1877 by @tenzen-y)
  • Allow running examples on Apple Silicon M1 and fix image build errors for arm64 (#1898 by @tenzen-y)
  • Configurable job name and service name for cert generator (#1889 by @shaowei-su)

UI Features and Enhancements

  • Add PBT to experiment creation form (#1909 by @a9p)
  • Distinct page for each Trial in the UI (#1783 by @d-gol)

Bug fixes

Documentation

Misc

  • Updating the training operator image in CI (#1910 by @johnugeorge)
  • Upgrade Python and Pytorch versions for some examples (#1906 by @tenzen-y)
  • Linting for K8s YAML files (#1901 by @Rishit-dagli)
  • Change integration test sysytem from KinD Cluster to Minikube Cluster (#1899 by @tenzen-y)
  • Upgrade mysql version to v8.0.29 (#1897 by @tenzen-y)
  • Upgrade tensorflow-aarch64 version to v2.9.1 (#1891 by @tenzen-y)
  • chore: Upgrade Go libraries to resolve some security issues in the katib-controller (#1888 by @tenzen-y)
  • Migrate kubeflow-katib-presubmit to GitHub Actions (#1882 by @tenzen-y)
  • Add semicolon when using command command in Makefile (#1885 by @tenzen-y)
  • Fix HAS_SHELLCHECK and HAS_SETUP_ENVTEST in Makefile (#1884 by @tenzen-y)
  • Remove presubmit tests depending on optional-test-infra (#1871 by @aws-kf-ci-bot)
  • Upgrade the Tensorflow version to address some security issues (#1870 by @tenzen-y)
  • Upgrade the grpc_health_probe version to v0.4.11 to resolve security vulnerability CVE-2022-27191 (#1875 by @tenzen-y)
  • additional metric names should not include objective metric name (#1874 by @henrysecond1)
  • Upgrade the Kubernetes Python client to 22.6.0 (#1869 by @tenzen-y)
  • Upgrade the kubebuilder to v3.2.0 and Kubernetes Go libraries to v1.22.2 (#1861 by @tenzen-y)
  • Update FPGA XGBoost example (#1865 by @eliaskoromilas)
  • Fix kubeflowkatib/mxnet-mnist image (#1866 by @tenzen-y)
  • pins pip and setuptools versions operators to avoid installation issues (#1867 by @DnPlas)
  • Add shellcheck (#1857 by @tenzen-y)
  • Bump kubeflow-katib and kfp version in notebook examples (#1849 by @tenzen-y)
  • Add prometheus scraping and grafana support to charmed katib-controller operator (#1839 by @jardon)
  • Upgrade Black to fix linting (#1842 by @jardon)

Change Log

Check the Full Change Log.

v0.14.0-rc.0

1 year ago

v0.13.0

2 years ago

This is the Katib v0.13.0 release.

Breaking changes:

  1. Namespace label for Metrics collector enabled Katib namespaces is changed to katib.kubeflow.org/metrics-collector-injection=enabled #1740
  2. Current request number field in gRPC API is renamed to current_request_number #1728
  3. training.kubeflow.org prefix is added to the default primary pod labelsjob-role and replica-type of the Training Operators #1813

New Features

Algorithms and Components

  • Implement validation for Early Stopping (#1709 by @tenzen-y)
  • Change namespace label for Metrics Collector injection (#1740 by @andreyvelich)
  • Modify gRPC API with Current Request Number (#1728 by @andreyvelich)
  • Allow to remove each resource in Katib config (#1729 by @andreyvelich)
  • Support leader election for Katib Controller (#1713 by @tenzen-y)
  • Change default Metrics Collect format (#1707 by @anencore94)
  • Bump Python version to 3.9 (#1731 by @tenzen-y)
  • Update Go version to 1.17 (#1683 by @andreyvelich)
  • Create Python script to run e2e Argo Workflow (#1674 by @andreyvelich)
  • Reimplement Katib Cert Generator in Go (#1662 by @tenzen-y)
  • SDK: change list apis to return objects as default (#1630 by @anencore94)

UI Features

  • Enhance Katib UI feasible space (#1721 by @seong7)
  • Handle missing TrialTemplates in Katib UI (#1652 by @kimwnasptd)
  • Add Prettier devDependency in Katib UI (#1629 by @seong7)

Documentation

  • Fix a link for GRPC API documentation (#1786 by @tenzen-y)
  • Add my presentations that include Katib (#1753 by @terrytangyuan)
  • Add Akuity to list of adopters (#1749 by @terrytangyuan)
  • Change Argo -> Argo Workflows (#1741 by @terrytangyuan)
  • Update Algorithm Service Doc for the new CI script (#1724 by @andreyvelich)
  • Update link to Training Operator (#1699 by @terrytangyuan)
  • Refactor examples folder structure (#1691 by @andreyvelich)
  • Fix README in examples directory (#1687 by @tenzen-y)
  • Add Kubeflow MXJob example (#1688 by @andreyvelich)
  • Update FPGA examples (#1685 by @eliaskoromilas)
  • Refactor README (#1667 by @andreyvelich)
  • Change the minimal Kustomize version in the developer guide (#1675 by @tenzen-y)
  • Add Katib release process guide (#1641 by @andreyvelich)

Bug Fixes

  • Remove unrecognized keys from metadata.yaml in Charmed operators (#1759 by @DnPlas)
  • Fix the default Metrics Collector regex (#1755 by @andreyvelich)
  • Fix Status Handling in Charmed Operators (#1743 by @DomFleischmann)
  • Fix bug on list type HP in Katib UI (#1704 by @seong7)
  • Fix Range for Int and Double values in Grid search (#1732 by @andreyvelich)
  • Check if parameter references exist in Experiment parameters (#1726 by @henrysecond1)
  • Fix same set for HyperParameters in Bayesian Optimization algorithm (#1701 by @fabianvdW)
  • Close MySQL statement and rows resources when SQL exec ends (#1720 by @chenwenjun-github)
  • Fix Cluster Role of Katib Controller to access image pull secrets (#1725 by @henrysecond1)
  • Emit events when fails to reconcile all Trials (#1706 by @henrysecond1)
  • Missing metrics port annotation (#1715 by @alexeykaplin)
  • Fix absolute value in Katib UI (#1676 by @anencore94)
  • Add missing omitempty parameter to APIs (#1645 by @andreyvelich)
  • Reconcile semantics for Suggestion Algorithms (#1633 by @johnugeorge)
  • Fix default label for Training Operators (#1813 by @andreyvelich)
  • Update supported Python version for Katib SDK (#1798 by @tenzen-y)

Misc

  • Use release tags for Trial images (#1757 by @andreyvelich)
  • Upgrade cert-manager API from v1alpha2 to v1 (#1752 by @haoxins)
  • Add Workflow to Publish Katib Images (#1746 by @andreyvelich)
  • Update Charmed Katib Operators + CI to 0.12 (#1717 by @knkski)
  • Updating Katib CI to use Training Operator (#1710 by @midhun1998)
  • Update OWNERS for Charmed operators (#1718 by @ca-scribner)
  • Implement some unit tests for the Katib Config package (#1690 by @tenzen-y)
  • Add GitHub Actions for Python unit tests (#1677 by @andreyvelich)
  • Add OWNERS file for the Katib new UI (#1681 by @kimwnasptd)
  • Add envtest to check reconcileRBAC (#1678 by @tenzen-y)
  • Use golangci-lint as linter for Go (#1671 by @tenzen-y)

v0.12.0

2 years ago

This is the Katib v0.12.0 release.

The major advantages:

  • Optuna Suggestion service with the new algorithms, big thanks to @g-votte and @c-bata.
  • Sobol's Quasirandom Sequence algorithm and IPOP-CMA-ES or BIPOP-CMA-ES restart strategies, big thanks to @c-bata.
  • Katib can perform Argo Workflows, big thanks to @andreyvelich.

New Features

Algorithms and Components

  • Add Optuna based suggestion service (#1613 by @g-votte)
  • Support Sobol's Quasirandom Sequence using Goptuna. (#1523 by @c-bata)
  • Bump the Goptuna version up to v0.8.0 with IPOP-CMA-ES and BIPOP-CMA-ES support. (#1519 by @c-bata)
  • Validate possible operations for Grid suggestion (#1205 by @andreyvelich)
  • Validate for Bayesian Optimization algorithm settings (#1600 by @anencore94)
  • Add Support for Argo Workflows (#1605 by @andreyvelich)
  • Add Support for XGBoost Operator with LightGBM example (#1603 by @andreyvelich)
  • Allow empty resources for CPU and Memory in Katib config (#1564 by @andreyvelich)
  • Add kustomization overlay: katib-openshift (#1513 by @maanur)
  • Switch to SDI in Katib Charm (#1555 by @knkski)

UI Features

  • Add Multivariate TPE to Katib UI (#1625 by @andreyvelich)
  • Update Katib UI with Optuna Algorithm Settings (#1626 by @andreyvelich)
  • Change the default image for the new Katib UI (#1608 by @andreyvelich)

Documentation

  • Add Katib 2021 ROADMAP (#1524 by @andreyvelich)
  • Add AutoML and Training WG Summit July 2021 (#1615 by @andreyvelich)
  • Add the new Katib presentations 2021 (#1539 by @andreyvelich)
  • Add Doc checklist to PR template (#1568 by @andreyvelich)
  • Fix typo in operators/README (#1557 by @evilnick)
  • Adds docs on how to use Katib Charm within KF (#1556 by @RFMVasconcelos)
  • Fix a link to Kustomize manifest for new Katib UI (#1521 by @c-bata)

Bug Fixes

  • Fix UI for handling missing params (#1657 by @kimwnasptd)
  • Reconcile semantics for Suggestion Algorithms (#1644 by @johnugeorge)
  • Fix Metrics Collector error in case of non-existing Process (#1614 by @andreyvelich)
  • Fix mysql version in docker image (#1594 by @munagekar)
  • Fix grep in Tekton Experiment Doc (#1578 by @andreyvelich)
  • Error messages corrected (#1522 by @himanshu007-creator)
  • Install charmcraft 1.0.0 (#1593 by @DomFleischmann)

Misc

  • Modify XGBoostJob example for the new Controller (#1623 by @andreyvelich)
  • Modify Labels for controller resources (#1621 by @andreyvelich)
  • Modify Labels for Katib Components (#1611 by @andreyvelich)
  • Upgrade CRDs to apiextensions.k8s.io/v1 (#1610 by @andreyvelich)
  • Update Katib SDK with OpenAPI generator (#1572 by @andreyvelich)
  • Disable default PV for Experiment with resume from volume (#1552 by @andreyvelich)
  • Remove PV from MySQL component (#1527 by @andreyvelich)
  • feat: add naming regex check on validating webhook (#1541 by @anencore94)

Change Log

Check the Full Change Log.

v0.11.1

2 years ago

This is the Katib v0.11.1 release.

Bug fixes

Enhancements

Check the Full Change Log.

v0.11.0

3 years ago

This is the Katib v0.11.0 release. The major advantages:

  • Katib is now supporting Kubernetes >= 1.18
  • Possibility to deploy a new Katib UI, big thanks to @kimwnasptd!
  • Juju operator support, big thanks to @DomFleischmann, @knkski and @RFMVasconcelos!

New Features

Core Features

UI Features and Enhancements

Katib Juju operator

Bug fixes

Documentation

Misc

Change Log

Check the Full Change Log.

v0.10.1

3 years ago

v0.10.0

3 years ago

This is the Katib 0.10 release for the Kubeflow 1.2. The new Katib v1beta1 API version has been released.

New Features

Core Features

UI Features and Enhancements

Bug fixes

Misc

Change Log

See the Full Change Log.