A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Fix a bug in classification_report_imbalanced where the parameter target_names
was not taken into account when output_dict=True
. #989 by AYY7.
SMOTENC now handles mix types of data type such as bool and pd.CategoricalDtype
by delegating the conversion to scikit-learn encoder. #1002 by Guillaume Lemaitre.
Handle sparse matrices in SMOTEN and raise a warning since it requires a conversion to dense matrices. #1003 by Guillaume Lemaitre.
Remove spurious warning raised when minority class get over-sampled more than the number of sample in the majority class. #1007 by Guillaume Lemaitre.
The fitted attribute ohe_
in SMOTENC is deprecated and will be removed in version 0.13. Use categorical_encoder_
instead. #1000 by Guillaume Lemaitre.
The default of the parameters sampling_strategy
and replacement will change in BalancedRandomForestClassifier to follow the implementation of the original paper. This changes will take effect in version 0.13. #1006 by Guillaume Lemaitre.
SMOTENC now accepts a parameter categorical_encoder
allowing to specify a OneHotEncoder
with custom parameters. #1000 by Guillaume Lemaitre.
SMOTEN now accepts a parameter categorical_encoder
allowing to specify a OrdinalEncoder
with custom parameters. A new fitted parameter categorical_encoder_
is exposed to access the fitted encoder. #1001 by Guillaume Lemaitre.
RandomUnderSampler and RandomOverSampler (when shrinkage
is not None
) now accept any data types and will not attempt any data conversion. #1004 by Guillaume Lemaitre.
SMOTENC now support passing array-like of str
when passing the categorical_features
parameter. #1008 by :userGuillaume Lemaitre <glemaitre>
.
SMOTENC now support automatic categorical inference when categorical_features
is set to "auto"
. #1009 by :userGuillaume Lemaitre <glemaitre>
.
python -OO
that replaces doc by None. #953 bu Guillaume Lemaitre.feature_names_in_
as well as get_feature_names_out
for all samplers. #959 by Guillaume Lemaitre.n_jobs
has been deprecated from the classes ADASYN, BorderlineSMOTE, SMOTE, SMOTENC, SMOTEN, and SVMSMOTE. Instead, pass a nearest neighbors estimator where n_jobs is set. #887 by Guillaume Lemaitre.base_estimator
is deprecated and will be removed in version 0.12. It is impacted the following classes: BalancedBaggingClassifier, EasyEnsembleClassifier, RUSBoostClassifier. #946 by Guillaume Lemaitre.Compatibility with scikit-learn 1.1.0
Compatibility with scikit-learn 1.0.2
February 18, 2021
imblearn.metrics.macro_averaged_mean_absolute_error
returning the average across class of the MAE. This metric is used in ordinal classification. #780 by Aurélien Massiot.imblearn.metrics.pairwise.ValueDifferenceMetric
to compute pairwise distances between samples containing only categorical values. #796 by Guillaume Lemaitre.imblearn.over_sampling.SMOTEN
to over-sample data only containing categorical features. #802 by Guillaume Lemaitre.imblearn.ensemble.BalancedBaggingClassifier
unlocking the implementation of methods based on resampled bagging. #808 by Guillaume Lemaitre.output_dict
in imblearn.metrics.classification_report_imbalanced
to return a dictionary instead of a string. #770 by Guillaume Lemaitre.imblearn.under_sampling.ClusterCentroids
where voting="hard"
could have lead to select a sample from any class instead of the targeted class. #769 by Guillaume Lemaitre.imblearn.FunctionSampler
where validation was performed even with validate=False
when calling fit
. #790 by Guillaume Lemaitre.extras_require
within the setup.py
file. #816 by Guillaume Lemaitre.pydata-sphinx-theme
. #801 by Guillaume Lemaitre.imblearn.utils.testing.warns
is deprecated in 0.8 and will be removed 1.0. #815 by Guillaume Lemaitre.