Trains Versions Save

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

v1.15.1

1 week ago

Bug Fixes

Fix auto-scaler should recheck that the worker is still IDLE before shutting it down (#1240, thanks @cthorey!)
Fix resource monitor fails to get GPU stats in some edge cases

v1.15.0

3 weeks ago

New Features

Add draft option to pipeline steps (#1226, thanks @CharlesFrankum!)
Add support for custom working directory for pipelines (#1194)
Add Task.get_requirements() method returning the task’s requirements
Allow controlling the number of threads used by StorageManager.download_folder() using the max_workers argument
Update examples dependencies
Improve auto populate in Task.init()
Documentation:
- Add docstrings for model properties
- Improve docstring for force_requirements_env_freeze
- Add Task.set_packages() docstring notes

Bug Fixes

Fix UTF-8 script code encoding issue (#1208, thanks @ae-ae!)
Fix Colab docs (#1220, thanks @tkukurin!)
Fix metrics reporting with OutputModel while in offline mode results in an error (#1172)
Fix task running in Google Colab doesn't properly get the notebook diff (#1204)
Fix hydra binds break in offline mode (#1215)
Fix HPO crashes when optimizing for single value scalars (#1221)
Fix GPU info such as gpu_memory and gpu_type is not being collected in some cases
Fix clearml-data CLI tool will move non-dataset tasks to a .dataset project if the respective task is not a dataset
Fix pandas DataFrame artifacts with hierarchical indices get mangled by the CSV round-trip
Fix urllib3 sends deprecation warning when setting ssl_version
Fix Task.connect(dict) return value is not dict-compatible
Fix jsonargparse sub-command config parsing
Fix Lightning integration crashes when a config entry contains . in its name
Fix Python 3.5 compatibility

v1.14.4

2 months ago

Bug Fixes

Fix AWS S3 configuration secure property not forwarded in StorageHelper (#1205, thanks @odulcy-mindee!)
Fix NVIDIA_VISIBLE_DEVICES=all env var value will result in no GPU metrics being reported
Fix setting output_uri when an AWS S3 config section with secure=false exists raises an error
Fix CacheManager attempts to clear cache when file limit has not been reached resulting in slow unnecessary checks

v1.14.3

2 months ago

New Features and Bug Fixes

Add hash check for pickle files (resolves CVE-2024-24590)
Fix clearml-task relative paths calculation when --cwd, --folder and --script are provided (#1161, thanks @dimidagd!)
Fix regression causing an error when secure is provided in AWS S3 bucket settings

v1.14.2

2 months ago

New Features and Improvements

Report upload/download with tqdm if installed
Add support for a PipelineController.version property
Add support for specifying an AWS profile when providing bucket storage credentials
Add an ignore_remote_overrides argument to Task.connect() and Task.connect_configuration() allowing users to disable overriding values when running remotely
Add support for setting a custom certificate path using the CLEARML_API_HOST_VERIFY_CERT environment variable (instead of just allowing the user to disable verification)
Add dataset reporting example (#1180)
Update README GIFs (#1191)

Bug Fixes

Fix missing component callbacks on multiple step calls (#1195, thanks @materight!)
Fix unsafe usage of tempfile.mktemp
Fix potential path traversal on file download (CVE-2024-24591)
Fix Python 3.12 support by not using distutils
Fix TaskScheduler does not save local state before running remotely
Fix pipeline should continue instead of aborting when PipelineController.abort_on_failure is set to False and a step has been aborted
Fix HPO crashes when its corresponding task is initialized with continue_last_task=True

v1.14.1

3 months ago

Bug Fixes

Fix created pipelines do not appear in the UI pipelines section

v1.14.0

3 months ago

New Features and Improvements

Add support for decorated pipeline steps (#1154)
Improve torch.distributed behavior and auto-detect Huggingface accelerate distributed execution
Improve jsonargparse integration: allow_config_file_override_from_ui argument added for enabling/disabling overriding configuration file entries
Add raise_on_error parameter to Model.update_weights() and Model.update_weights_package()
Add HPO multi-objective optimization support by specifying multiple optimization objectives to the HyperParameterOptimizer class
Improve pipelines from functions: libraries used by pipeline steps no longer have to be imported in the step itself and can be imported globally instead
Support clearing repository details in task.set_repo()
Auto detect tensorflow-intel as tensorflow
Add support for auto detecting torch and transformers accelerate distributed execution
Add tabular data multi-stage pipeline process example

Bug Fixes

Fix langchain integration (#1124, #1126)
Fix using target_project with TaskScheduler.add_task() corrupts project if used with pipelines (#1137)
Fix PipelineController.add_function_step() fails when adding custom decorated step (#1154)
Fix jsonargparse binding does not capture parameters before Task.init() is called (#1164)
Fix invalid jsonargparse arguments should be deleted
Fix handling of appended parameters to Hydra defaults list
Fix Task.set_repo() and Task.set_packages() do not wait for automatic asynchronous repository detection
Fix active GPU filtering in resource monitor to support GPU ID and not just GPU idx
Fix misleading error raised when using the SDK with no configuration file or an invalid configuration file
Fix process sometimes hangs issue by improving exit and exception handlers, unregistering signal handling in child processes except for the BackgroundMonitor

v1.13.2

5 months ago

New Features

Support parallel dataset verification (#1131, thanks @charlienewey-odin!)
Support lightgbm model loading from a string (#1136, thanks @bedapisl!)
Add support for Hydra command-line syntax for modifying omegaconf
Add support for setting output_uri in pipeline decorators and functions
Add more visibility when overriding jsonargparse arguments
Use StorageManager cache when downloading models
Support parsing GS credentials as a JSON string if they do not point to a JSON file
Support ignoring default_output_uri on remote runs when Task.init(output_uri=False) was specified and the backend task data has no output_uri specified
Add configuration files logging example
Support raising LoginError when initializing
Add HuggingFace Transformers Example (#1146)
Improve docstrings for tag filters
Add a security policy file (#1148)

Bug Fixes

Fix Google Storage subdir path in bucket URL breaks validation (#1117, thanks @materight!)
Fix jsonschema DeprecationWarning in newer versions (#1128, thanks @xquyvu!)
Fix task.set_script(diff="") does not clear the uncommitted changes (#1106)
Fix parallel dataset verification fails on older Python versions (#1144)
Fix attrs.__version__ deprecation warning (#1145)
Fix Task.get_parameters_as_dict(cast=True) raises a YAML parser error
Fix potential threading issues when setting self.lock to None
Fix pipeline version is not displayed in the table view and version list
Fix displayed pipeline run time in pipeline components is not the same as that shown in the info tab in full detail
Fix Model.connect() in remote execution might result in the wrong model being connected
Fix exception raised on ConnectionError (which actually indicates that a read has timed out, and should be retried)
Fix missing configuration error not raised in certain cases
Fix loading queues containing metadata fails due to an incorrect queue metadata scheme
Fix pyjwt requirements for Python < 2.9
Fix pipeline examples

v1.13.1

6 months ago

Bug Fixes

Fix Hydra overrides integration (#1120)
Fix GPU memory used reports 0 when memory can not be queried per process (edge case)

v1.13.0

7 months ago

New Features

Add support for recursive list, dict, and tuple ref parsing for pipeline controller.add step() parameter overrides (#1099, thanks @natephysics!)
Update PyNVML to the latest NVIDIA version for better GPU monitoring
Add force_single_script_file argument to Task.create() to avoid git repository auto detection
Use os.register_at_fork instead of monkey patching fork for python > 3.6
Add support for programmatically archive and unarchive models from the model registry using the Model.archive() and Mode.unarchive() methods (#1096)
Improve documentation

Bug Fixes

Fix Hydra overrides integration (#552)
Fix model reporting in TensorFlow 2.13 does not work properly (#1112)
Fix GPU memory used reports 0 when memory can not be queried per process
Fix FastAI binding not reporting scalars if TensorBoard is installed
Fix JSON serialization error does not raise exception
Fix OutputModel should prefer connected task name over filename stem
Fix dataset zip extraction might fail when creating folders concurrently (with Exception [Errno 17] File exists warning)
Fix import sys should not appear in pipeline component auto detected packages
Fix compatibility issue with requests < 2.28