COVID-19 Vulnerability Index
Fixes #21, #25, #26, #28 - handle cases where there is no inpatient data. Also fixed some install issues related to having different versions of various dependencies.
Fixes #24
Fixes #21 - import error when using the latest version of Pandas
This release is a significant update
This release incorporates a new model that is now appropriate for adults ages 18 and over. This model was trained on a combination of the original CMS Medicare data along with additional data provided by HealthFirst.
The original Medicare only 'xgboost' model is still available by adding a -m xgboost
option to cv19index. However, even for Medicare populations we recommend moving to the new xgboost_all_ages
model. This model is now the default.
Many thanks to HealthFirst for being one of the first users of the model and for allowing us to use their data in order to create a model for all ages.
We have simplified the library to take in just 2 files. A demographics file and a claims files. These need to have a few key columns, which is outlined in cv19index/resources/xgboost/demographics.schema.json and cv19index/resources/xgboost/claims.schema.json. The column names must match and are case sensitive. These file can have other columns. The core goal is for a simple dump of these two datasets (demographics and claims) to be a basis for building the model with minimal changes.
Converted the model name from model_medium to xgboost and added support for SageMaker