Prepare a script to convert results of different models to the unified format #14

dchaplinsky · 2021-07-01T12:28:09Z

As it was discussed previously, our current idea is to train different models and then made an ensemble model which will be used to pre-annotate new texts (which will be proofread by the editor later) to expand the corpus even more.

To implement that we need a CLI python script which will convert results of different models back to BRAT format. Next stage is merging :)

gawy · 2021-07-28T07:47:19Z

For model evaluation purposes results were converted to IOB format so it might be easier to use IOB as a universal format.
@dchaplinsky what do you think
Except for merging model results what intended usages do you foresee?

the only downside of IOB is that it is not a sparse format and so will take more space, but it does not seem to be that big of a deal.

dchaplinsky · 2021-07-28T14:29:28Z

Great idea, works for me.

dchaplinsky · 2021-07-28T14:29:58Z

The only thing that I'd like to check is if BRAT has support for IOB (I don't remember to be honest).

dchaplinsky mentioned this issue Jul 1, 2021

Please implement script for merging results of different models #15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prepare a script to convert results of different models to the unified format #14

Prepare a script to convert results of different models to the unified format #14

dchaplinsky commented Jul 1, 2021

gawy commented Jul 28, 2021

dchaplinsky commented Jul 28, 2021

dchaplinsky commented Jul 28, 2021

Prepare a script to convert results of different models to the unified format #14

Prepare a script to convert results of different models to the unified format #14

Comments

dchaplinsky commented Jul 1, 2021

gawy commented Jul 28, 2021

dchaplinsky commented Jul 28, 2021

dchaplinsky commented Jul 28, 2021