Skip to content

Commit

Permalink
PR #17772: [keras/datasets/reuters.py] Standardise docstring usage of…
Browse files Browse the repository at this point in the history
… "Default to"

Imported from GitHub PR #17772

This is one of many PRs. Discussion + request to split into multiple PRs @ #17748
Copybara import of the project:

--
bd352a9 by Samuel Marks <[email protected]>:

[keras/datasets/reuters.py] Standardise docstring usage of "Defaults to"

Merging this change closes #17772

FUTURE_COPYBARA_INTEGRATE_REVIEW=#17772 from SamuelMarks:keras.datasets.reuters-defaults-to bd352a9
PiperOrigin-RevId: 522176647
  • Loading branch information
tensorflower-gardener committed Apr 5, 2023
1 parent ecb4f98 commit ac38342
Showing 1 changed file with 8 additions and 8 deletions.
16 changes: 8 additions & 8 deletions keras/datasets/reuters.py
Original file line number Diff line number Diff line change
Expand Up @@ -65,20 +65,20 @@ def load_data(
ranked by how often they occur (in the training set) and only
the `num_words` most frequent words are kept. Any less frequent word
will appear as `oov_char` value in the sequence data. If None,
all words are kept. Defaults to None, so all words are kept.
all words are kept. Defaults to `None`.
skip_top: skip the top N most frequently occurring words
(which may not be informative). These words will appear as
`oov_char` value in the dataset. Defaults to 0, so no words are
skipped.
`oov_char` value in the dataset. 0 means no words are
skipped. Defaults to 0
maxlen: int or None. Maximum sequence length.
Any longer sequence will be truncated. Defaults to None, which
means no truncation.
Any longer sequence will be truncated. None means no truncation.
Defaults to `None`.
test_split: Float between 0 and 1. Fraction of the dataset to be used
as test data. Defaults to 0.2, meaning 20% of the dataset is used as
test data.
as test data. 0.2 means that 20% of the dataset is used as
test data. Defaults to 0.2
seed: int. Seed for reproducible data shuffling.
start_char: int. The start of a sequence will be marked with this
character. Defaults to 1 because 0 is usually the padding character.
character. 0 is usually the padding character. Defaults to `1`.
oov_char: int. The out-of-vocabulary character.
Words that were cut out because of the `num_words` or
`skip_top` limits will be replaced with this character.
Expand Down

0 comments on commit ac38342

Please sign in to comment.