-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[R-package] ensure use of interaction_constraints does not lead to features being ignored #6377
[R-package] ensure use of interaction_constraints does not lead to features being ignored #6377
Conversation
Getting a failing unit test:
I will have a look at it next week (afk). |
The test used incomplete interaction constraints. Since the new functionality will add missing features to the list of interaction constraint vectors, the test failed. Now, the test uses completely specified constraints.
… are added as own group.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! Left a few minor comments.
@
me if you need help with the failing tests. Let's also please get confirmation that excluding omitted features from training wasn't intentional (#6376 (comment)).
Co-authored-by: James Lamb <[email protected]>
Hey @mayer79 , across your recent PRs I've seen multiple "fix linting" types of commits. Totally fine to keep using Continuous Integration to get that feedback (we don't have a lot of activity going on in the repo right now), but you'd probably find it faster to run the linting locally. It only requires R and the Rscript .ci/lint_r_code.R $(pwd)/R-package |
Thanks, this is the stuff that I should have asked long time ago, but never did :-). |
Pipeline seems happy @jameslamb - but really no pressure :-) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks very much for this! And I really appreciate you keeping it up to date with master
while waiting for reviews.
I just left one suggestion about covering this a bit more thoroughly with tests. If you don't have time in the next week to get to that, let me know if you're open to me writing that test and pushing it here...I'm sorry for the time pressure (especially after not reviewing this for so long 😬 ), but I'm going to try to push for v4.4.0 to get out in the next week, ahead of the numpy
2.0 release (#6439 (comment)), and I'd love to get this change into it if we can.
Thanks so much for the review! I will try to improve the tests this week. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look great to me, thanks very much @mayer79 !
This will get into v4.4.0 😁
It seems that CI here is suffering from this issue: tox-dev/filelock#337 That issue was just discovered earlier today, and fixes for it are still rolling out. To keep making progress towards the release, I've just temporarily commented out running |
This enhances the R-API of interaction constraints by adding a feature group with those features that do not appear in any of the interaction groups. Currently, these are simply dropped from training, which seems undesirable.
Additionally, it reorganizes the code of the corresponding helper function
.check_interaction_constraints()
.It solves the R-package part of #6376. I will attempt a separate PR for the Python-package.
Example
Without the PR, the result is
i.e., the last two features are silently dropped from the training.