FIX BorddelineSMOTE-2 use the full dataset to generate new sample #1023

glemaitre · 2023-07-10T19:23:43Z

closes #861

Make sure that we use the full dataset to generate new samples in BorderlineSMOTE version 2.

… synthetic sample

solegalli · 2023-07-11T08:50:02Z

imblearn/over_sampling/_smote/filter.py

+
+            self.nn_k_.fit(X_to_sample_from)
+            nns = self.nn_k_.kneighbors(X_danger, return_distance=False)[:, 1:]
+            X_new, y_new = self._make_samples(


This implementation does not fully reflect the description of Borderline smote 2 in the paper. The paper says that to create the samples by interpolation between the template of the minority and a neigbhour of the majority, it multiplies by a factor between 0 and 0.5 (instead of 0-1) to ensure the synthetic data is closer to the minority.

If I understand this code correctly, we are multiplying everything by a factor between 0 and 1. Pls correct me if I am wrong.

Nop, indeed. I forgot to look at the next page of the article. I will try to propose a fix.

FIX make sure that BorddelineSMOTE-2 use the full dataset to generate…

af5d20a

… synthetic sample

glemaitre marked this pull request as draft July 10, 2023 19:24

glemaitre added 2 commits July 10, 2023 22:33

iter

6bd1fba

iter

bcf16cc

glemaitre marked this pull request as ready for review July 10, 2023 20:39

glemaitre added 2 commits July 10, 2023 22:50

iter

c09eb5a

iter

638b0f4

glemaitre merged commit 2859cb0 into scikit-learn-contrib:master Jul 10, 2023

solegalli reviewed Jul 11, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX BorddelineSMOTE-2 use the full dataset to generate new sample #1023

FIX BorddelineSMOTE-2 use the full dataset to generate new sample #1023

glemaitre commented Jul 10, 2023 •

edited

Loading

solegalli Jul 11, 2023

glemaitre Jul 11, 2023

FIX BorddelineSMOTE-2 use the full dataset to generate new sample #1023

FIX BorddelineSMOTE-2 use the full dataset to generate new sample #1023

Conversation

glemaitre commented Jul 10, 2023 • edited Loading

solegalli Jul 11, 2023

Choose a reason for hiding this comment

glemaitre Jul 11, 2023

Choose a reason for hiding this comment

glemaitre commented Jul 10, 2023 •

edited

Loading