MLServer Versions Save

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

1.5.0

2 months ago

What's Changed

Update CHANGELOG by @github-actions in https://github.com/SeldonIO/MLServer/pull/1592
build: Migrate away from Node v16 actions by @jesse-c in https://github.com/SeldonIO/MLServer/pull/1596
build: Bump version and improve release doc by @jesse-c in https://github.com/SeldonIO/MLServer/pull/1602
build: Upgrade stale packages (fastapi, starlette, tensorflow, torch) by @sakoush in https://github.com/SeldonIO/MLServer/pull/1603
fix(ci): tests and security workflow fixes by @sakoush in https://github.com/SeldonIO/MLServer/pull/1608
Re-generate License Info by @github-actions in https://github.com/SeldonIO/MLServer/pull/1612
fix(ci): Missing quote in CI test for all_runtimes by @sakoush in https://github.com/SeldonIO/MLServer/pull/1617
build(docker): Bump dependencies by @jesse-c in https://github.com/SeldonIO/MLServer/pull/1618
docs: List supported Python versions by @jesse-c in https://github.com/SeldonIO/MLServer/pull/1591
fix(ci): Have separate smaller tasks for release by @sakoush in https://github.com/SeldonIO/MLServer/pull/1619

Notes

We remove support for python 3.8, check https://github.com/SeldonIO/MLServer/pull/1603 for more info. Docker images for mlserver are already using python 3.10.

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.4.0...1.5.0

1.4.0

2 months ago

What's Changed

Free up some space for GH actions by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1282
Introduce tracing with OpenTelemetry by @vtaskow in https://github.com/SeldonIO/MLServer/pull/1281
Update release CI to use Poetry by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1283
Re-generate License Info by @github-actions in https://github.com/SeldonIO/MLServer/pull/1284
Add support for white-box explainers to alibi-explain runtime by @ascillitoe in https://github.com/SeldonIO/MLServer/pull/1279
Update CHANGELOG by @github-actions in https://github.com/SeldonIO/MLServer/pull/1294
Fix build-wheels.sh error when copying to output path by @lc525 in https://github.com/SeldonIO/MLServer/pull/1286
Fix typo by @strickvl in https://github.com/SeldonIO/MLServer/pull/1289
feat(logging): Distinguish logs from different models by @vtaskow in https://github.com/SeldonIO/MLServer/pull/1302
Make sure we use our Response class by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1314
Adding Quick-Start Guide to docs by @ramonpzg in https://github.com/SeldonIO/MLServer/pull/1315
feat(logging): Provide JSON-formatted structured logging as option by @vtaskow in https://github.com/SeldonIO/MLServer/pull/1308
Bump in conda version and mamba solver by @dtpryce in https://github.com/SeldonIO/MLServer/pull/1298
feat(huggingface): Merge model settings by @jesse-c in https://github.com/SeldonIO/MLServer/pull/1337
feat(huggingface): Load local artefacts in HuggingFace runtime by @vtaskow in https://github.com/SeldonIO/MLServer/pull/1319
Document and test behaviour around NaN by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1346
Address flakiness on 'mlserver build' tests by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1363
Bump Poetry and lockfiles by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1369
Bump Miniforge3 to 23.3.1 by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1372
Re-generate License Info by @github-actions in https://github.com/SeldonIO/MLServer/pull/1373
Improved huggingface batch logic by @ajsalow in https://github.com/SeldonIO/MLServer/pull/1336
Add inference params support to MLFlow's custom invocation endpoint (… by @M4nouel in https://github.com/SeldonIO/MLServer/pull/1375
Increase build space for runtime builds by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1385
Fix minor typo in sklearn README by @krishanbhasin-gc in https://github.com/SeldonIO/MLServer/pull/1402
Add catboost classifier support by @krishanbhasin-gc in https://github.com/SeldonIO/MLServer/pull/1403
added model_kwargs to huggingface model by @nanbo-liu in https://github.com/SeldonIO/MLServer/pull/1417
Re-generate License Info by @github-actions in https://github.com/SeldonIO/MLServer/pull/1456
Local response cache implementation by @SachinVarghese in https://github.com/SeldonIO/MLServer/pull/1440
fix link to custom runtimes by @kretes in https://github.com/SeldonIO/MLServer/pull/1467
Improve typing on Environment class by @krishanbhasin-gc in https://github.com/SeldonIO/MLServer/pull/1469
build(dependabot): Change reviewers by @jesse-c in https://github.com/SeldonIO/MLServer/pull/1548
MLServer changes from internal fork - deps and CI updates by @sakoush in https://github.com/SeldonIO/MLServer/pull/1588

New Contributors

@vtaskow made their first contribution in https://github.com/SeldonIO/MLServer/pull/1281
@lc525 made their first contribution in https://github.com/SeldonIO/MLServer/pull/1286
@strickvl made their first contribution in https://github.com/SeldonIO/MLServer/pull/1289
@ramonpzg made their first contribution in https://github.com/SeldonIO/MLServer/pull/1315
@jesse-c made their first contribution in https://github.com/SeldonIO/MLServer/pull/1337
@ajsalow made their first contribution in https://github.com/SeldonIO/MLServer/pull/1336
@M4nouel made their first contribution in https://github.com/SeldonIO/MLServer/pull/1375
@nanbo-liu made their first contribution in https://github.com/SeldonIO/MLServer/pull/1417
@kretes made their first contribution in https://github.com/SeldonIO/MLServer/pull/1467

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.3.5...1.4.0

1.3.5

10 months ago

What's Changed

Rename HF codec to hf by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1268
Publish is_drift metric to Prom by @joshsgoldstein in https://github.com/SeldonIO/MLServer/pull/1263

New Contributors

@joshsgoldstein made their first contribution in https://github.com/SeldonIO/MLServer/pull/1263

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.3.4...1.3.5

1.3.4

10 months ago

What's Changed

Silent logging by @dtpryce in https://github.com/SeldonIO/MLServer/pull/1230
Fix mlserver infer with BYTES by @RafalSkolasinski in https://github.com/SeldonIO/MLServer/pull/1213

New Contributors

@dtpryce made their first contribution in https://github.com/SeldonIO/MLServer/pull/1230

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.3.3...1.3.4

1.3.3

11 months ago

What's Changed

Add default LD_LIBRARY_PATH env var by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1120
Adding cassava tutorial (mlserver + seldon core) by @edshee in https://github.com/SeldonIO/MLServer/pull/1156
Add docs around converting to / from JSON by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1165
Document SKLearn available outputs by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1167
Fix minor typo in alibi-explain tests by @ascillitoe in https://github.com/SeldonIO/MLServer/pull/1170
Add support for .ubj models and improve XGBoost docs by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1168
Fix content type annotations for pandas codecs by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1162
Added option to configure the grpc histogram by @cristiancl25 in https://github.com/SeldonIO/MLServer/pull/1143
Add OS classifiers to project's metadata by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1171
Don't use qsize for parallel worker queue by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1169
Fix small typo in Python API docs by @krishanbhasin-gc in https://github.com/SeldonIO/MLServer/pull/1174
Fix star import in mlserver.codecs.* by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1172

New Contributors

@cristiancl25 made their first contribution in https://github.com/SeldonIO/MLServer/pull/1143
@krishanbhasin-gc made their first contribution in https://github.com/SeldonIO/MLServer/pull/1174

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.3.2...1.3.3

1.3.2

1 year ago

What's Changed

Use default initialiser if not using a custom env by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1104
Add support for online drift detectors by @ascillitoe in https://github.com/SeldonIO/MLServer/pull/1108
added intera and inter op parallelism parameters to the hugggingface … by @saeid93 in https://github.com/SeldonIO/MLServer/pull/1081
Fix settings reference in runtime docs by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1109
Bump Alibi libs requirements by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1121
Add default LD_LIBRARY_PATH env var by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1120
Ignore both .metrics and .envs folders by @adriangonz in https://github.com/SeldonIO/MLServer/pull/1132

New Contributors

@ascillitoe made their first contribution in https://github.com/SeldonIO/MLServer/pull/1108

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.3.1...1.3.2

1.3.1

1 year ago

What's Changed

Move OpenAPI schemas into Python package (#1095)

1.3.0

1 year ago

WARNING :warning: : The 1.3.0 has been yanked from PyPi due to a packaging issue. This should have been now resolved in >= 1.3.1.

What's Changed

Custom Model Environments

More often that not, your custom runtimes will depend on external 3rd party dependencies which are not included within the main MLServer package - or different versions of the same package (e.g. scikit-learn==1.1.0 vs scikit-learn==1.2.0). In these cases, to load your custom runtime, MLServer will need access to these dependencies.

In MLServer 1.3.0, it is now possible to load this custom set of dependencies by providing them, through an environment tarball, whose path can be specified within your model-settings.json file. This custom environment will get provisioned on the fly after loading a model - alongside the default environment and any other custom environments.

Under the hood, each of these environments will run their own separate pool of workers.

Custom Metrics

The MLServer framework now includes a simple interface that allows you to register and keep track of any custom metrics:

[mlserver.register()](https://mlserver.readthedocs.io/en/latest/reference/api/metrics.html#mlserver.register): Register a new metric.
[mlserver.log()](https://mlserver.readthedocs.io/en/latest/reference/api/metrics.html#mlserver.log): Log a new set of metric / value pairs.

Custom metrics will generally be registered in the [load()](https://mlserver.readthedocs.io/en/latest/reference/api/model.html#mlserver.MLModel.load) method and then used in the [predict()](https://mlserver.readthedocs.io/en/latest/reference/api/model.html#mlserver.MLModel.predict) method of your custom runtime. These metrics can then be polled and queried via Prometheus.

OpenAPI

MLServer 1.3.0 now includes an autogenerated Swagger UI which can be used to interact dynamically with the Open Inference Protocol.

The autogenerated Swagger UI can be accessed under the /v2/docs endpoint.

Alongside the general API documentation, MLServer also exposes now a set of API docs tailored to individual models, showing the specific endpoints available for each one.

The model-specific autogenerated Swagger UI can be accessed under the following endpoints:

/v2/models/{model_name}/docs
/v2/models/{model_name}/versions/{model_version}/docs

HuggingFace Improvements

MLServer now includes improved Codec support for all the main different types that can be returned by HugginFace models - ensuring that the values returned via the Open Inference Protocol are more semantic and meaningful.

Massive thanks to @pepesi for taking the lead on improving the HuggingFace runtime!

Support for Custom Model Repositories

Internally, MLServer leverages a Model Repository implementation which is used to discover and find different models (and their versions) available to load. The latest version of MLServer will now allow you to swap this for your own model repository implementation - letting you integrate against your own model repository workflows.

This is exposed via the model_repository_implementation flag of your settings.json configuration file.

Thanks to @jgallardorama (aka @jgallardorama-itx ) for his effort contributing this feature!

Batch and Worker Queue Metrics

MLServer 1.3.0 introduces a new set of metrics to increase visibility around two of its internal queues:

Adaptive batching queue: used to accumulate request batches on the fly.
Parallel inference queue: used to send over requests to the inference worker pool.

Many thanks to @alvarorsant for taking the time to implement this highly requested feature!

Image Size Optimisations

The latest version of MLServer includes a few optimisations around image size, which help reduce the size of the official set of images by more than ~60% - making them more convenient to use and integrate within your workloads. In the case of the full seldonio/mlserver:1.3.0 image (including all runtimes and dependencies), this means going from 10GB down to ~3GB.

Python API Documentation

Alongside its built-in inference runtimes, MLServer also exposes a Python framework that you can use to extend MLServer and write your own codecs and inference runtimes. The MLServer official docs now include a reference page documenting the main components of this framework in more detail.

New Contributors

@rio made their first contribution in https://github.com/SeldonIO/MLServer/pull/864
@pepesi made their first contribution in https://github.com/SeldonIO/MLServer/pull/692
@jgallardorama made their first contribution in https://github.com/SeldonIO/MLServer/pull/849
@alvarorsant made their first contribution in https://github.com/SeldonIO/MLServer/pull/860
@gawsoftpl made their first contribution in https://github.com/SeldonIO/MLServer/pull/950
@stephen37 made their first contribution in https://github.com/SeldonIO/MLServer/pull/1033
@sauerburger made their first contribution in https://github.com/SeldonIO/MLServer/pull/1064

1.2.4

1 year ago

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.2.3...1.2.4

1.2.3

1 year ago

Full Changelog: https://github.com/SeldonIO/MLServer/compare/1.2.2...1.2.3