Running BERT QA as ensemble + performance results #50

vblagoje · 2022-09-25T22:34:26Z

vblagoje
Sep 25, 2022

Hey everyone,

Following @byshiue recommendations from #46 I made an ensemble model using BERT embeddings for preprocessing, BERT fastertransformer encoder as the main model, and finally QA head as postprocessing. I wanted to test if I can run QA task as an ensemble end-to-end.

The results are spectacular to say the least. So good that I want to first verify that I am doing everything as I should. I am providing python script comparing plain-vanilla PyTorch HuggingFace execution along with Triton/FT setup. Here are the results of running that script a few times. Here are my triton logs.

Looking forward to your comments and feedback.
Best, Vladimir

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running BERT QA as ensemble + performance results #50

{{title}}

Replies: 0 comments

Select a reply

Running BERT QA as ensemble + performance results #50

vblagoje Sep 25, 2022

Replies: 0 comments

vblagoje
Sep 25, 2022