-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
extend DummyDatasetMultipleSequenceLength #492
base: master
Are you sure you want to change the base?
Conversation
c773046
to
f704424
Compare
""" | ||
:rtype: list[str] | ||
""" | ||
return ["seq-%i" % i for i in range(self._num_seqs)] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
self._num_seqs
might be inf
. It should handle that correctly (fall back to the default behavior).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the default behaviour?
Does it even make sense for a dummy dataset to be initialized with num_seqs=inf
?
In GeneratingDataset
it is per default inf
but here it has to be set by the user.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the default behaviour?
Whatever super
of this function does. Just check the code.
Does it even make sense for a dummy dataset to be initialized with
num_seqs=inf
?
Yes it does. Why not? It's usually only used for testing. In any case, this is explicitly supported.
def get_seq_len(i): | ||
return self.seq_len['data'] | ||
|
||
self._seq_order = self.get_seq_order_for_epoch(epoch, self._num_seqs, get_seq_len=get_seq_len) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure this makes sense to support for this dataset.
Also you are not using it, i.e. ignoring it (but maybe just because it's incomplete so far).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The self._seq_order
you mean?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
By support, I mean init_seq_order
in general.
By use, I mean _seq_order
.
Why is this specific for |
Please also see failing tests.
|
I don't see any reason why not to use it for |
Added
get_all_tags
,init_seq_order
andget_current_seq_order
for DummyDatasetMultipleSequenceLength