Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ported master fixes #64

Merged
merged 3 commits into from
Mar 13, 2024
Merged

Conversation

mryzhov
Copy link
Collaborator

@mryzhov mryzhov commented Mar 7, 2024

No description provided.

@mryzhov mryzhov requested a review from apaniukov March 7, 2024 12:49
@mryzhov mryzhov self-assigned this Mar 7, 2024
New special tokens was added to ChatGLM repository, that causes Sentencepiece to crash during decoding  because of indices was not added to the main vocabulary (these tokens was not marked as special in the repository and were filtered out because of it). Include tokens to a vocab and also align vocab sizes better.
Has to lower pass rate, because of ChatGLM3 decoder inserts spaces between special tokens and Sentencepiece does not. No functional difference between actual texts.
@mryzhov mryzhov enabled auto-merge (squash) March 13, 2024 11:23
@mryzhov mryzhov merged commit 976257b into openvinotoolkit:releases/2023/3 Mar 13, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants