Version 0.2#
January 1, 2017
Changelog#
Bug fixes#
Fixed a bug in
under_sampling.NearMisswhich was not picking the right samples during under sampling for the method 3. By Guillaume Lemaitre.Fixed a bug in
ensemble.EasyEnsemble, correction of therandom_stategeneration. By Guillaume Lemaitre and Christos Aridas.Fixed a bug in
under_sampling.RepeatedEditedNearestNeighbours, add additional stopping criterion to avoid that the minority class become a majority class or that a class disappear. By Guillaume Lemaitre.Fixed a bug in
under_sampling.AllKNN, add stopping criteria to avoid that the minority class become a majority class or that a class disappear. By Guillaume Lemaitre.Fixed a bug in
under_sampling.CondensedNeareastNeigbour, correction of the list of indices returned. By Guillaume Lemaitre.Fixed a bug in
ensemble.BalanceCascade, solve the issue to obtain a single array if desired. By Guillaume Lemaitre.Fixed a bug in
pipeline.Pipeline, solve to embedPipelinein otherPipeline. #231 by Christos Aridas.Fixed a bug in
pipeline.Pipeline, solve the issue to put to sampler in the samePipeline. #188 by Christos Aridas.Fixed a bug in
under_sampling.CondensedNeareastNeigbour, correction of the shape ofsel_xwhen only one sample is selected. By Aliaksei Halachkin.Fixed a bug in
under_sampling.NeighbourhoodCleaningRule, selecting neighbours instead of minority class misclassified samples. #230 by Aleksandr Loskutov.Fixed a bug in
over_sampling.ADASYN, correction of the creation of a new sample so that the new sample lies between the minority sample and the nearest neighbour. #235 by Rafael Wampfler.
New features#
Added AllKNN under sampling technique. By Dayvid Oliveira.
Added a module
metricsimplementing some specific scoring function for the problem of balancing. #204 by Guillaume Lemaitre and Christos Aridas.
Enhancement#
Added support for bumpversion. By Guillaume Lemaitre.
Validate the type of target in binary samplers. A warning is raised for the moment. By Guillaume Lemaitre and Christos Aridas.
Change from
cross_validationmodule tomodel_selectionmodule forsklearndeprecation cycle. By Dayvid Oliveira and Christos Aridas.
API changes summary#
size_nghhas been deprecated incombine.SMOTEENN. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_nghhas been deprecated inunder_sampling.EditedNearestNeighbors. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_nghhas been deprecated inunder_sampling.CondensedNeareastNeigbour. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_nghhas been deprecated inunder_sampling.OneSidedSelection. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_nghhas been deprecated inunder_sampling.NeighbourhoodCleaningRule. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_nghhas been deprecated inunder_sampling.RepeatedEditedNearestNeighbours. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.size_nghhas been deprecated inunder_sampling.AllKNN. Usen_neighborsinstead. By Guillaume Lemaitre, Christos Aridas, and Dayvid Oliveira.Two base classes
BaseBinaryclassSamplerandBaseMulticlassSamplerhave been created to handle the target type and raise warning in case of abnormality. By Guillaume Lemaitre and Christos Aridas.Move
random_stateto be assigned in theSamplerMixininitialization. By Guillaume Lemaitre.Provide estimators instead of parameters in
combine.SMOTEENNandcombine.SMOTETomek. Therefore, the list of parameters have been deprecated. By Guillaume Lemaitre and Christos Aridas.khas been deprecated inover_sampling.ADASYN. Usen_neighborsinstead. #183 by Guillaume Lemaitre.kandmhave been deprecated inover_sampling.SMOTE. Usek_neighborsandm_neighborsinstead. #182 by Guillaume Lemaitre.n_neighborsacceptKNeighborsMixinbased object forunder_sampling.EditedNearestNeighbors,under_sampling.CondensedNeareastNeigbour,under_sampling.NeighbourhoodCleaningRule,under_sampling.RepeatedEditedNearestNeighbours, andunder_sampling.AllKNN. #109 by Guillaume Lemaitre.
Documentation changes#
Replace some remaining
UnbalancedDatasetoccurences. By Francois Magimel.Added doctest in the documentation. By Guillaume Lemaitre.