Skip to content

Commit

Permalink
Added documentation for WikiCorpus metadata. Fix #1161 (#1163)
Browse files Browse the repository at this point in the history
  • Loading branch information
kirit93 authored and tmylk committed Feb 24, 2017
1 parent c971411 commit 5571e99
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 0 deletions.
1 change: 1 addition & 0 deletions gensim/corpora/indexedcorpus.py
Original file line number Diff line number Diff line change
Expand Up @@ -69,6 +69,7 @@ def serialize(serializer, fname, corpus, id2word=None, index_fname=None, progres
each saved document,
* the `docbyoffset(offset)` method, which returns a document
positioned at `offset` bytes within the persistent storage (file).
* metadata if set to true will ensure that serialize will write out article titles to a pickle file.
Example:
Expand Down
1 change: 1 addition & 0 deletions gensim/corpora/wikicorpus.py
Original file line number Diff line number Diff line change
Expand Up @@ -266,6 +266,7 @@ def __init__(self, fname, processes=None, lemmatize=utils.has_pattern(), diction
If `pattern` package is installed, use fancier shallow parsing to get
token lemmas. Otherwise, use simple regexp tokenization. You can override
this automatic logic by forcing the `lemmatize` parameter explicitly.
self.metadata if set to true will ensure that serialize will write out article titles to a pickle file.
"""
self.fname = fname
Expand Down

0 comments on commit 5571e99

Please sign in to comment.