Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prepare a script to convert results of different models to the unified format #14

Open
dchaplinsky opened this issue Jul 1, 2021 · 3 comments

Comments

@dchaplinsky
Copy link
Contributor

As it was discussed previously, our current idea is to train different models and then made an ensemble model which will be used to pre-annotate new texts (which will be proofread by the editor later) to expand the corpus even more.

To implement that we need a CLI python script which will convert results of different models back to BRAT format. Next stage is merging :)

@gawy
Copy link
Collaborator

gawy commented Jul 28, 2021

For model evaluation purposes results were converted to IOB format so it might be easier to use IOB as a universal format.
@dchaplinsky what do you think
Except for merging model results what intended usages do you foresee?

the only downside of IOB is that it is not a sparse format and so will take more space, but it does not seem to be that big of a deal.

@dchaplinsky
Copy link
Contributor Author

Great idea, works for me.

@dchaplinsky
Copy link
Contributor Author

The only thing that I'd like to check is if BRAT has support for IOB (I don't remember to be honest).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants