Skip to content

Commit

Permalink
new pass rates
Browse files Browse the repository at this point in the history
  • Loading branch information
pavel-esir committed Dec 18, 2024
1 parent d0b2764 commit b92b864
Show file tree
Hide file tree
Showing 3 changed files with 4,521 additions and 4,521 deletions.
42 changes: 21 additions & 21 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -512,17 +512,17 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tbody>
<tr>
<td >BPE</td>
<td >97.18</td>
<td >96.96</td>
<td >4544</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >89.19</td>
<td >87.29</td>
<td >6633</td>
</tr>
<tr>
<td >Tiktoken</td>
<td >96.56</td>
<td >97.33</td>
<td >524</td>
</tr>
<tr>
Expand Down Expand Up @@ -620,7 +620,7 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tr>
<td >BPE</td>
<td >laion/CLIP-ViT-bigG-14-laion2B-39B-b160k</td>
<td >100.00</td>
<td >96.17</td>
<td >261</td>
</tr>
<tr>
Expand Down Expand Up @@ -656,19 +656,19 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tr>
<td >SentencePiece</td>
<td >NousResearch/Llama-2-13b-hf</td>
<td >97.55</td>
<td >100.00</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >NousResearch/Llama-2-13b-hf_legacy_sp_backend</td>
<td >97.55</td>
<td >99.18</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >NousResearch/Llama-2-13b-hf_sp_backend</td>
<td >94.29</td>
<td >100.00</td>
<td >245</td>
</tr>
<tr>
Expand Down Expand Up @@ -698,25 +698,25 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tr>
<td >SentencePiece</td>
<td >camembert-base_legacy_sp_backend</td>
<td >75.51</td>
<td >70.61</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >camembert-base_sp_backend</td>
<td >52.24</td>
<td >47.35</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >facebook/musicgen-small_legacy_sp_backend</td>
<td >78.37</td>
<td >73.47</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >facebook/musicgen-small_sp_backend</td>
<td >83.67</td>
<td >78.78</td>
<td >245</td>
</tr>
<tr>
Expand All @@ -740,13 +740,13 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tr>
<td >SentencePiece</td>
<td >microsoft/deberta-v3-base_legacy_sp_backend</td>
<td >100.00</td>
<td >95.10</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >microsoft/deberta-v3-base_sp_backend</td>
<td >96.73</td>
<td >91.84</td>
<td >245</td>
</tr>
<tr>
Expand Down Expand Up @@ -776,43 +776,43 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tr>
<td >SentencePiece</td>
<td >rinna/bilingual-gpt-neox-4b_sp_backend</td>
<td >80.41</td>
<td >77.96</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >t5-base_legacy_sp_backend</td>
<td >80.00</td>
<td >75.10</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >t5-base_sp_backend</td>
<td >85.31</td>
<td >80.41</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >xlm-roberta-base_legacy_sp_backend</td>
<td >95.10</td>
<td >90.20</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >xlm-roberta-base_sp_backend</td>
<td >95.10</td>
<td >90.20</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >xlnet-base-cased_legacy_sp_backend</td>
<td >57.96</td>
<td >53.06</td>
<td >245</td>
</tr>
<tr>
<td >SentencePiece</td>
<td >xlnet-base-cased_sp_backend</td>
<td >64.49</td>
<td >59.59</td>
<td >245</td>
</tr>
<tr>
Expand All @@ -824,7 +824,7 @@ This report is autogenerated and includes tokenizers and detokenizers tests. The
<tr>
<td >Tiktoken</td>
<td >THUDM/glm-4-9b-chat</td>
<td >93.16</td>
<td >94.68</td>
<td >263</td>
</tr>
<tr>
Expand Down
2 changes: 1 addition & 1 deletion tests/pass_rates.json
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
{
"tests/tokenizers_test.py::test_": 0.9297414485305926
"tests/tokenizers_test.py::test_": 0.9319897221776137
}
Loading

0 comments on commit b92b864

Please sign in to comment.