Skip to content

v2.0.0

Compare
Choose a tag to compare
@broken broken released this 08 Nov 20:22
· 2 commits to 2.0 since this release

Major Updates

  • Added a regex_split op.
  • Fixes a bug in case_fold_utf8 and normalize_utf8 ops where they were unable to locate the ICU data file.
  • Fixed a problem with the BertTokenizer where it was using merge_dims which is unreleased for the corresponding version of TensorFlow.
  • Updated the BertTokenizer to use regex_split to match the exact regex used by original BERT.