Production infrastructure for machine learning at scale
New features
c6
and g5
families can be found https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)Bug fixes
cortex
CLI https://github.com/cortexlabs/cortex/pull/2443 (RobertLucian)Misc
New features
Bug fixes
/
) https://github.com/cortexlabs/cortex/pull/2407 (RobertLucian)cortex cluster export
command https://github.com/cortexlabs/cortex/pull/2415 https://github.com/cortexlabs/cortex/issues/2414 (RobertLucian)Misc
New features
pre_stop
command for containers https://github.com/cortexlabs/cortex/pull/2403 (docs) (deliahu)Misc
Bug fixes
cortex get --env
command https://github.com/cortexlabs/cortex/pull/2404 (deliahu)cortex cluster up
for spot node groups with on-demand base capacity https://github.com/cortexlabs/cortex/pull/2406 (RobertLucian)We have released v0.1.0 of the Nucleus model server!
Nucleus is a model server for TensorFlow and generic Python models. It is compatible with Cortex clusters, Kubernetes clusters, and any other container-based deployment platforms. Nucleus can also be run locally via Docker compose.
Some of Nucleus's features include:
New features
max_concurrency
field) https://github.com/cortexlabs/cortex/pull/2376 https://github.com/cortexlabs/cortex/issues/2200 (miguelvr)cortex describe API_NAME
command for Realtime and Async APIs https://github.com/cortexlabs/cortex/pull/2368 https://github.com/cortexlabs/cortex/issues/2320 https://github.com/cortexlabs/cortex/issues/2359 (RobertLucian)Misc
Bug fixes
api_load_balancer_cidr_white_list
and operator_load_balancer_cidr_white_list
https://github.com/cortexlabs/cortex/pull/2363 (RobertLucian)New features
cortex cluster health
command to show the health of the cluster's components https://github.com/cortexlabs/cortex/pull/2313 https://github.com/cortexlabs/cortex/issues/2029 (miguelvr)Reliability
cortex get
to improve scalability and reliability of the command https://github.com/cortexlabs/cortex/pull/2333 https://github.com/cortexlabs/cortex/issues/2319 (vishalbollu)Bug fixes
Content-Type
header is properly set to application/json
for responses to Async API submissions https://github.com/cortexlabs/cortex/pull/2323 (vishalbollu)node_groups
selector to be updated in a running API https://github.com/cortexlabs/cortex/pull/2354 (RobertLucian)Docs
Misc
cortex cluster info --print-config
displays YAML that is consumable by cortex cluster configure
https://github.com/cortexlabs/cortex/pull/2324 (vishalbollu)New features
ssl_certificate_arn
, api_load_balancer_cidr_white_list
, and operator_load_balancer_cidr_white_list
to be updated on an existing cluster (via the cortex cluster configure
command) https://github.com/cortexlabs/cortex/pull/2305 https://github.com/cortexlabs/cortex/issues/2107 (vishalbollu)Bug fixes
Misc
max_instances
to 0
https://github.com/cortexlabs/cortex/pull/2310 (RobertLucian)New features
cortex cluster configure
command to add, remove, or scale nodegroups on a running cluster https://github.com/cortexlabs/cortex/pull/2246 https://github.com/cortexlabs/cortex/issues/2096 (RobertLucian)cortex cluster info --print-config
command to print the current configuration of a running cluster https://github.com/cortexlabs/cortex/pull/2246 (RobertLucian)cortex refresh
command for Async APIs https://github.com/cortexlabs/cortex/pull/2265 https://github.com/cortexlabs/cortex/issues/2237 (deliahu)Breaking changes
cortex cluster scale
command has been replaced by the cortex cluster configure
command.Bug fixes
cortex cluster up
more permissive (to avoid unnecessarily failing checks on GovCloud) https://github.com/cortexlabs/cortex/pull/2248 (vishalbollu)Docs
Misc
cortex cluster info
for running batch jobs https://github.com/cortexlabs/cortex/pull/2270 (deliahu)New features
Breaking changes
predictor
section has been removed, the pod
section has been added, and the autoscaling
parameters have been modified slightly (depending on the workload type). See updated docs for Realtime, Async, Batch, and Task. If you'd like to to see examples of Dockerizing Python applications, see our test/apis folder.cortex prepare-debug
command has been removed; Cortex now exclusively runs Docker containers, which can be run locally via docker run
.cortex patch
command as been removed; its behavior is now identical to cortex deploy
.cortex logs
command now prints a CloudWatch Insights URL with a pre-populated query which can be executed to show logs from your workloads, since this is the recommended approach in production. If you wish to stream logs from a pod at random, you can use cortex logs --random-pod
(keep in mind that these logs will not include some system logs related to your workload).Bug fixes
Docs
Misc
cortex logs
https://github.com/cortexlabs/cortex/issues/2085 (vishalbollu)New features
cortex env rename
command https://github.com/cortexlabs/cortex/pull/2165 https://github.com/cortexlabs/cortex/issues/1773 (deliahu)Breaking changes
deploy()
method. Now, deploy()
is used only to deploy project folders, and deploy_realtime_api()
, deploy_async_api()
, deploy_batch_api()
, and deploy_task_api()
are for deploying Python classes. (docs)--keep-aws-resources
flag is provided to cortex cluster down
). Users' files should not be stored in the Cortex internal bucket.Bug fixes
cortex cluster info
https://github.com/cortexlabs/cortex/pull/2140 https://github.com/cortexlabs/cortex/issues/2129 (RobertLucian)Misc
--keep-aws-resources
flag with cortex cluster down
to preserve AWS resources https://github.com/cortexlabs/cortex/pull/2161 https://github.com/cortexlabs/cortex/issues/1612 (RobertLucian)--min-instances
or --max-instances
with cortex cluster scale
https://github.com/cortexlabs/cortex/pull/2149 (RobertLucian)