-
Notifications
You must be signed in to change notification settings - Fork 28.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-9478] [ml] Add class weights to Random Forest #13851
Conversation
…eded(getCalculator & fromString method)
…he predictions are correct with Impurity=WeightedGini
… ProbClassifier, but changing the definition of Impurity and DecTreClassifier
…Strategy 3.make classweights can be passed when reconstructing the tree, including json read/write
… and reverting getOldStrategy to old versions
Can one of the admins verify this patch? |
@n-triple-a Could you please see my comment and provide your feedback on the JIRA? Also, in an effort to promote clear communication and reduce duplicated effort, it is common to comment on the JIRA that you intend to work on the task before work begins, especially on JIRAs like this one which have had others already doing work on it and require a significant amount of work. Thanks! |
Hi @n-triple-a is this still active? |
What changes were proposed in this pull request?
This PR is to implement class weights support to Random Forest (and also Decision Tree). This is useful in handling unbalanced data in classification problems.
How was this patch tested?
Add a unit test in the
DecisionTreeClassifierSuite
. Manual tests are also done locally on an unbalanced dataset.