Trains Versions Save

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

v1.12.2

8 months ago

New Features and Improvements

  • Add always_create_from_code argument to PipelineController (default True) to allow remote execution to create the Pipeline DAG at runtime (as opposed to adhering to the existing structure stored on the task when running locally)

Bug Fixes

  • Fix using OutputModel reporting and other methods fail if OutputModel.update_weights() is not called before that (#1078)
  • Fix hash for Pandas objects artifacts
  • Fix Hydra/OmegaConf --overrides support
  • Fix Pipeline metrics not forwarded to pipeline controller at the end of a pipeline step

v1.12.1

9 months ago

New Features and Improvements

  • Raise error if dataset name is empty (#1083, thanks @mathematicalmichael!)
  • Add support for spawn start method for Python multiprocess. This should help circumvent issues like this
  • Improve docstrings

Bug Fixes

  • Fix S3 uploads fail with LazyEvalWrapper type error (#1081)

v1.12.0

9 months ago

Breaking Changes

  • As a result of fixing #1051 and similar issues with retrieving task data/models/artifacts/samples stored in GCS we introduced some breaking changes. To access resources registered to GCS from ClearML SDK versions 1.11.0 to 1.11.1, see the instructions here.

New Features and Improvements

  • Add include_archive parameter to Dataset.list_datasets(), now possible to include archived datasets in the returned list (#1069, thanks @natephysics!)
  • Add support to specify the multipart threshold and chunk size using the aws.boto3.multipart_chunksize and aws.boto3.multipart_threshold configuration options (#1059, thanks @cgaudreau-ubisoft!)
  • Add PipelineController.get_pipeline() for retrieving previously run pipelines

Bug Fixes

  • Fix AWS driver issues (#1000, thanks @achaiah!)
    • Fix credential authentication failure when attempting to use token
    • Fix instantiation within VPC without AvailabilityZones
  • Fix continue_last_task=0 is ignored in pipelines run with retry_on_failure (#1054)
  • Fix Task.connect_configuration() doesn’t handle dictionaries with special characters
  • Fix pipeline steps created with PipelineDecorator aren't cached
  • Fix Task.get_by_name() doesn't return the most recent task when multiple tasks have same name
  • Fix boto3 certificate verification ignores custom self-signed certificates

v1.11.1

10 months ago

New Features and Improvements

  • Support lightning>=2.0 (#1033, thanks @aweinmann!)
  • Add clearml-init support for standard HTTP and HTTPS ports in webapp in conjunction with non-default api/files server ports (#1031, thanks @pktiuk!)
  • Improve docstrings

Bug Fixes

  • Fix remote execution does not log configuration values from modified jsonargparse configuration files
  • Fix ValueError when setting task properties at the start of a pipeline component
  • Fix OS exceptions on passing Path where str expected
  • Fix infinite recursion when remotely updating a dict added to task with Task.connect()
  • Fix Task.connect race condition overwriting task description with connected input model

v1.11.0

11 months ago

New Features and Improvements

  • Add offline mode for ClearML Datasets (#854)
  • Improve performance of Dataset.add_external_files() (#962, thanks @john-zielke-snkeos!)
  • Add Task.launch_multi_node() for distributed experiment execution
  • Add Task.get_all_reported_scalars() to fetch all available scalar data
  • Add Task.remove_input_models() to disassociate input models from a task
  • Improve Dataset.list_datasets() to include dataset version in the result
  • Add support for PyTorch Lightning CLI >=2.0
  • Add support for automatic Dataset content preview for JSON files
  • Improve docstrings

Bug Fixes

  • Fix multiple local parallel experiments conflicting on storage access validation (#897)
  • Fix code running with joblib hangs (#1009)
  • Fix comment for storage scheme gs:// (#1018, thanks @pzarfos!)
  • Fix typo "Categories per column" (#1021, thanks @antonaleks!)
  • Fix downloading artifacts from GCP when the download URI includes unsafe characters
  • Fix pipeline step cache is not cleared when the only change to the step is that the docker image changed
  • Fix Task.report_text() sometimes reporting to an incorrect task when multiple tasks run simultaneously
  • Fix Task.set_offline(offline_mode=False) raising an UnboundLocalError
  • Fix scikit-image package (skimage) is sometimes not detected as a dependency
  • Fix PipelineDecorator sometimes causing a race condition when starting a remote execution

v1.10.4-fix

11 months ago

New Features and Improvements

  • Update references to GetAllRequest in Task docstrings (#982, thanks @Make42!)

Bug Fixes

  • Fix issue with JSON conversion when using jsonargparse during remote execution (clearml-agent #153 and #1010)
  • Fix ProxyDictPostWrite.update() not triggering a write-back to the backend (#985)
  • Fix standard streams being patched in subtasks, which was causing the streams to be reported to the wrong task
  • Fix TypeError when using Task.query_tasks() without specifying the task_filter parameter
  • Fix Tensorboard negative iterations, they are now zeroed
  • Fix Task.init(continue_last_task=0) when running remotely, now no longer ignored
  • Fix urllib3 import error when using urllib3>=2.0.0

v1.10.3

1 year ago

Bug Fixes

  • Fix mixed-up logs when using pipelines. Now pipeline controller logs won't be visible in pipeline steps' logs.

v1.10.2

1 year ago

New Features and Improvements

  • Allow task filtering with Task.query_tasks() using TaskTypes (#938 )
  • Allow adding tags to individual pipeline steps and to the PipelineController
  • Restrict Dataset.create() usage for old server API versions
  • Improve docstrings

Bug Fixes

  • Fix Task.get_tasks docstring (#937)
  • Fix Task.get_reported_scalars() docstring for x_axis parameter when set to "timestamp" (#964, thanks @jday1!)
  • Fix missing plots on models in Windows
  • Fix Pipeline version auto-bumping
  • Fix APIClient.events.debug_images() for latest server API version

v1.10.1

1 year ago

New Features and Improvements

  • Add cast parameter to Task.get_parameter method (#958, thanks @harry-optimised!)

Bug Fixes

  • Fix backward compatibility issue when using datetime filtering with APIClient().models.get_all and APIClient().tasks.get_all requests
  • Fix docstrings explaining tags filtering on Model and Task

v1.10.0

1 year ago

Breaking Changes

  • When using MinIO or Azure storage, downloading using StorageManager.download_file() and StorageManager.download_folder() will not create a subfolder with the bucket name in the generated download path (#709)

New Features and Improvements

  • Improve the Task.mark_completed, Task.close, and the hyperparameter example (#927, thanks @Make42!)
  • Add Task.get_debug_samples() to retrieve task's debug samples (#761)
  • Add custom artifact serialization for pipelines using the new artifact_{serialization,deserialization}_function parameters in PipelineController and PipelineDecorator (#689)
  • Allow registering offline models (#653)
  • Add table layout configuration to Logger.report_table() through extra_data parameter, solving (#796)
  • Add support for attaching plots and scalars to models using the new Model.report_* methods
  • Add force_download argument to Model.get_local_copy() to force the downloading of a new copy of a model even if already in the cache
  • Add PipelineController.connect_configuration() to add configuration dictionaries to a pipeline
  • Add Dataset.delete configurable hosted contents deletion support, allowing deletion of both ClearML file server hosted files, and external storage hosted files
  • Add PipelineController.is_successful() criteria customization
  • Adjust CLEARML_VCS_DIFF environment variable behavior, setting it to an empty string now forces the task to not log uncommitted changes
  • Add support for API v2.23
  • Add Getting Started Notebooks
  • Add the output_uri=True argument in the hook config for the OpenMMLab example
  • Add local HPO command line execution with the --local argument
  • Add support for python package namespace detection

Bug Fixes

  • Fix logging dependencies that use the subdirectory argument when pip installing from a git repo (#947, thanks @jday1!)
  • Fix missing comma leading to a SyntaxError (#959, thanks @jday1!)
  • Fix auto-logging for multiple models trained by the same framework in the same task (#767)
  • Fix wrong overriding of custom server URLs when running clearml-init (clearml-server #181 and #910)
  • Fix parallel coordinates plot display of categorical variables (#907)
  • Fix task.connect() docstring (ClearML Docs issue #473)
  • Fix auto-publishing in Dataset.finalize() raises exception due to backward compatibility issue (#908)
  • Fix auto-detection of Jupyter Notebooks in Sagemaker
  • Fix Gradio binding being triggered even when gradio is not imported
  • Fix reconnection following an offline session. Only allow closed tasks to be reconnected
  • Fix python-fire support
  • Fix mishandling nested dictionaries in Task.connect() inside PipelineDecorator
  • Fix get_or_create_project crashes when run in parallel
  • Fix task.get_user_properties(value_only=True), previously raising an exception
  • Fix silence_errors parameter not working as expected in StorageManager.get_file_size_bytes()
  • Fix setting Dataset metadata name to a string prefixed with data_ breaking Dataset.get
  • Fix Pipeline fails when creating tasks from multi-argument functions, due to a parsing error when working with Python comma slices