-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Datagenerator #39
Datagenerator #39
Conversation
When the sequence length is not uniform, categories do not all have the same number of samples. This can make certain categories get dropped if they are in an unfortunate position in the indices list. The net effect is to blow up some tests, randomly. Since this behaviour is known (incomplete batches cannot be used), it is not reasonable to fail tests when it is encountered. Thus all sequences are given the same length for the time being.
When splitting samples into categories, alignment issues can, again, cause problems. However, with an evenly divisible number of samples, and with equally large samplers, the loaders consistently exhibit the correct behaviour.
I haven't done with this branch yet. I haven't sent PR, neither a review request. Anyway thanks for the updates.
|
@Saran-nns I created this new follow-up issue. Since I don't know what the Weighted random samples loaders are supposed to do, I'd be grateful for an unit test that helps clarify things. |
|
Previously stride was an argument provided to the data generator. This argument lets the user select a denser sampling of the dataset; appropriate for category-wise sampling. Now the argument is exposed to the end user.
|
Codecov Report
@@ Coverage Diff @@
## master #39 +/- ##
==========================================
+ Coverage 65.17% 72.40% +7.22%
==========================================
Files 11 25 +14
Lines 1674 2551 +877
==========================================
+ Hits 1091 1847 +756
- Misses 583 704 +121
Continue to review full report at Codecov.
|
This pr also resolves #40 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
No description provided.