Chaos Genius Versions Save

ML powered analytics engine for outlier detection and root cause analysis.

v0.11.0

1 year ago

Hello everyone :wave:, we’re announcing the release of Chaos Genius v.0.11.0 today! :tada::rocket: This release we bring an addition to our Anomaly Detection and Alerts module as well as resolve critical bugs.

:rotating_light: Enhancement to our Anomaly Detection and Alerts Module

To make Anomaly Detection more insightful we now provide users expected value of their time series data in addition to the expected range. Furthermore, we have updated the alerts format and dashboard to provide a better percentage of change based on expected values.

  • Add Expected Value to Anomaly and Alerts #1132 #1138 #1140 #1144

:bug: Bug fixes

  • fix(retrain): clear recent alerts data and set appropriate last_anomaly_timestamp #1128
  • chore(dev-ops): Upgrade dependencies #1131 #1127 #1126 #1124

v0.10.2

1 year ago

Hello everyone :wave:,

We’re announcing the release of Chaos Genius v.0.10.2 today! :tada::rocket: This release we focus on resolving critical bugs and adding a new connector.

:chains: Adding Clickhouse as a Data Connector

To make Chaos Genius operational for more environments, we’ve added a new data connector - Clickhouse - to our roster. Thank you @MahmoudElhalwany for your contribution!

Feature/clickhouse connector #1090

:bug: Bug fixes

  • fix(alert-report): ignore alerts of deleted/inactive alerts #1118
  • fix(datasource): use case-insensitive check for LIMIT #1116
  • fix(front-end): Bug event alert toast on test query #1105
  • fix(front-end):changed sidebar support text and icon #1092
  • chore(dev-ops): Upgrade dependencies #1109 #1108 #1107 #1096

v0.10.1

1 year ago

Hey everyone :wave:, we’re back announcing the release of Chaos Genius v0.10.1! :tada::rocket: This is a minor hotfix release to resolve some critical bugs.

:bug: Bug fixes

  • Dimension and subdimensional options should be searchable #1080
  • feat(subdim-filter): sort dimension and value lists #1069
  • feat: count column for druid supports float now #1075
  • fix: check dashboard/datasource active in filter options #1079
  • Removed xAxis legend of KPI summary chart #1067
    • Thank you Somya Jain and Ashish Sahoo for all of the above suggestions and bug reports
  • fix(docker): add analytics params to alerts worker #1078
    • Thank you Grant Xu for reporting this bug
  • fix(anomaly): ensure correct data types #1077
    • Thank you Alberto Azambuja for reporting this bug
  • fix(anomaly): convert slack into hourly value for hourly KPIs #1082
  • fix(front-end): called all alerts API when alerts are deleted #1081
  • Disable analytics endpoint #1073

v0.10.0

1 year ago

Hey everyone :wave:, we’re announcing the release of Chaos Genius v0.10.0! :tada::rocket:

This release focuses on clearing the bugs backlog and improving the quality of our root cause analysis for anomalies. Here are some brief notes on our new features and bug fixes:

:zap: Enhancements to Root Cause/Drill Down Algorithm for Anomalies

Till now, for finding the reasons behind an anomalous data point, Drilldowns would mostly sort the underlying sub-populations by the severity score of their anomalies. This often led to high severity but low impact sub-populations being ranked higher in the Drilldowns hierarchy. To fix this, Drilldowns for an anomalous point in a KPI are now sorted by a new metric called Impact Score. This metric weighs both the severity of the data point as well as the contribution the subdimension has to the overall KPI. Users will now have more relevant and clearer insights into their data.

  • Impact Score Metric #1033

:bug: Bug fixes

  • Fix anomaly subdim filter and data download for subdims with empty value #1057
  • removed check of hasdruiddropdown #1056
  • fix(drilldowns): reset count col before data loading #1055
  • fix(data-loader): fix bug when metric and count column are same #1053
  • fixed security vulnerabilites #1052
  • fix(drilldowns): use severity when impact is 0 #1051
  • fix(anomaly): when metric and count column are same for pre-aggregated data #1048
  • feat: add path safe KPI and subdim names for CSV download #1045
  • download csv name changes #1047 #1044
  • fix(anomaly-params): use .get instead of indexing, check rca_time exists #1043
  • added description and numerical range to slider values #1041
  • fix(migrations): handle case where value of subdim can be an empty value #1040
  • perf: optimize graph JSON creation #1039
  • Make anomaly CSV download work for subdims too #1038 #1036
  • feat(alerts): add CSV to email alert report, better CSV names #1035
  • Schedule Deepdrills/summary at same time as anomaly for daily running KPIs #1034
  • refactor anomaly data view for style, lints, type errors #1032
  • fix(anomaly): smaller time window, drop dups for hourly KPI drilldown #1029
  • fix: added guard condition to fix #1021 #1027

v0.9.1

1 year ago

A minor hotfix release for some critical bugs.

:bug: Fixes

  • Alert dashboard did not work when anomaly was manually disabled for some KPIs (#1020)
  • Alerts had an extra "Sub-dimensional anomalies" heading and "Reasons for change" text even when there were no sub-dimensional anomalies or relevant subdims (#1024)
  • CSS bug in dashboard filter (#1023)

v0.9.0

1 year ago

Hey everyone :wave:, we’re back announcing the release of v0.9.0 of Chaos Genius! Here are some brief notes on our new features and bug fixes:

:arrow_double_down: Sub Dimensional Filter for Anomaly Detection

Users can now browse through anomalies for any sub-dimensions (e.g. Country = UK) by just choosing the filters from the top; this has been a much requested feature and we’re happy that we’re finally supporting this.

  • Sub-Dimension Anomaly Filter #999
  • Sub-Dim Filter Front End #1002 #1004

Sub-dimensional filter in anomaly

:rotating_light: Enhancements to our Alerts Module

Sub-Dimensional alerts have returned with a new and improved format! Users can choose to receive sub dimensional alerts while adding a KPI alert. To make alerts more insightful, we now provide Reasons for Change for any anomaly as well!

  • Add relevant subdims to overall anomalies and toggle for subdim level anomalies #1000 #1010 #1012

Reasons for change in anomaly alerts

Sub-dimensional anomalies

:chains: Adding DataBricks & Athena as Data Connectors

To make Chaos Genius operational for more environments, we’ve added two new data connectors - AWS Athena & Databricks - to our roster.

  • Databricks Datasource Support #1001
  • feat(connectors): added AWS Athena Connector #964

:bug: Bug fixes

  • Fix/pyarrow dep coflict #1008
  • Fixes the bug where having strings with quotes in your data would break it #1005
  • Added condition for comparing KPI search texts #1003
  • Fix empty event alerts tab in alerts dashboard #1000
  • Dashboard link in alert does not filter by the specific alert #1000
  • Alerts and KPIs are not linked in the alerts dashboard #1000
  • fix: error handling for fetching the table info in case of permission issue #998
  • fix(edit-kpi): kpi validation triggered on edit kpi #996
  • Fix breadcrumbs in KPI screen #993
  • Fixes #991 by checking for validation success before running checks for tz-aware data #992

v0.8.0

1 year ago

Release Notes for Chaos Genius 0.8.0

Hello everyone :wave:, we’re announcing the release of Chaos Genius v0.8.0 today :tada:

We’ve focused on enriching user experience and fixing critical bugs. We briefly cover the main features & fixes here.

:rotating_light: Enhancements to our Alerts Module - More Insights, Less Noise

To make alerts more insightful, we now identify KPI behavior such as :arrow_up: Spike, :arrow_down_small: Drop and :record_button: Anomalous occurrences. Furthermore we’ve reduced noise in alert messages by switching off sub-dimensional alerts by default. However they will still be available via the Alerts Dashboard. We are currently working on making Sub-Dimensional Alerts more relevant and insightful for the users in the upcoming releases, so stay tuned!

  • Alerts Revamp #974
  • Alert Fixes Post Revamp #982
  • Added Disclaimer for Consolidated Alerts Reports #971, #986

:monocle_face: Increased focus on Anomaly Detection

After listening to all our users feedback we have decided to shift our focus and deepen our offering on Anomaly Detection. On that end, we have made the DeepDrills section optional. This feature will now be disabled by default. Users have the option to enable it, if they want. Please refer to our docs on how to enable DeepDrills.

  • DeepDrills Decoupling and Restructuring #967
  • Decouple DeepDrills Frontend #970
  • Added DEEPDRILLS_ENABLED in global config API endpoint #972

:sparkle: Druid Features Addition

The name for Count column for Druid KPIs is now configurable as part of KPI definition. This column can also be proxy for count for the rolled-up data i.e not representing the actual count. We support Druid for rollup scenarios only.

  • Druid Configurable Count Column #985
  • Druid Feature Changes Frontend #981, #983, #984

:bug: Bug fixes

  • Cannot define KPI with tables which contain ‘-’ in the table name. #793, #980
  • Fix bug in edit KPI for NoneType #977
  • A Druid data source does not need to be named “Druid”. #985
  • Fix typo in controller.py #968
  • Editing a KPI gets it removed from every dashboard except ‘All’. #965, #966
  • Fix type errors and lint issues #979

v0.7.0

2 years ago

Release Notes for Chaos Genius 0.7.0

Hello everyone, our 0.7.0 release takes care of usability & experience issues our users found operating Chaos Genius at scale. We briefly cover the main features & fixes here.

🎊 Pagination to manage 1000s of KPIs

Some of you who are monitoring a large number of KPIs found degraded performance in Home Screen, KPI Explorer, Data Sources & Alerts. We have now added pagination & server-side search so Chaos Genius UI scale wells for 1000s of KPIs. Thank you, Klub team for bringing this to our attention!

image image
  • added paginate param for data sources dropdown in event alert #953
  • added paginate params for the KPI list and changed font size in pagination #946
  • pagination frontend #935
  • added fixes for filtering of url #949

🐛 Robust KPI Editing

We now have more robust editing for KPIs, including support for removing all subdimensions during the edit process. Thanks, @GRANTOSMO & Athul!

  • [BUG] Task Failure Alert for KPIs with Empty Sub-dimension List After Editing #938
  • fix(editkpi): issue of subdim anomaly failure after removal of all subdims #939

🛡️ Vulnerabilities Fixes

Like with other releases we make sure that there are no vulnerability or security issues in any libraries or dependencies Chaos Genius uses. We have also made CORS disabled by default - and it is configurable (thanks, @rsohlot).

  • vulnerabilities fixed on dependencies #944
  • bump cryptography from 3.4.8 to 37.0.1 #942
  • configurable cors #791

v0.6.0

2 years ago

Release Notes for Chaos Genius 0.6.0

Hey everyone, we have tackled a bunch of key issues in our 0.6.0 release based on your feedback. Here are a brief notes around them.

  1. What's New
  2. Bug fixes

:tada: What's New

Optimized Metadata Loader

Many of you encountered metadata related issues with connecting to cloud data warehouses with large number of data assets. This was likely because of the amount of metadata that was being live fetched - to tackle this better we now have a system that asynchronously loads all the metadata from your datastore making this much more seamless. We have battle tested its performance with datawarehouses with over 100K tables.

Thank you, @danielefrigo, @ankneo, @sparshgupta & the klub team!

Configurable KPI Settings

You no longer need to delete & recreate a KPI to make any major changes to them. Now we support editing KPI configuration as well as anomaly settings.

Thank you, @fampay-tech, @KShivendu, @gxu-kangaroo!

Expanded TZ Support

If you are dealing with different timezones for database & reporting, this feature is relevant. We now support timezone aware columns as well as provide data transformations when your database timezone & reporting timezone differs. As a config while creating data sources, we will now ask you to select your DB timezone - as a default it is UTC.

cc: @fampay-tech, @KShivendu

Analytics Download

Many of our users have asked the option to download our analytics data for downstream ad-hoc analytics. We now support data downloads for panel metrics, DeepDrills as well as anomaly detection. Just click on the blue download button for the respective analytics report to be downloaded as a CSV.

We thank the Klub team for bringing this feature request.

:construction_worker: Other enhancements

We have also worked to improve some other features including:

  • Made hourly alerting more robust to missing data
  • Streamlined alert schedulers to be more fault tolerant
  • Fixed an issue in tabular KPI support for third party data.

:bug: Bug fixes

v0.5.2

2 years ago

🎉 Release Notes for Chaos Genius 0.5.2

We are doing a hotfix release to tackle an issue.

Bug Fixes 🐛

To upgrade your CG instance, follow the commands here.