Extractive QA with multiple answers #406

SamPse · 2023-01-12T08:35:46Z

Hello, I am new to txtai and I want to know if I can have a list of results with the QA approach.

My code is:

context = ["The doctor administered a 7g dose of Acetarsol and a 16mg dose of Ibuprofen",
           "The patient took one Alosetron tablet"]


queries = ["What is the drug taken?", "What is the dose?"]

questions = Questions(path=("sultan/BioM-ELECTRA-Large-SQuAD2-BioASQ8B", tokenizer), gpu=True)
results = [questions([question] * len(context), context) for question in queries]
results.append(context)

pd.DataFrame(list(zip(*results)), columns=["Drug", "Dose", "Text"])

I would like to have this result.

Drug	Dose	Text
Acetarsol	7g	The doctor administered a 7g dose of Acetarsol and a 16mg dose of Ibuprofen
Ibuprofen	16mg	The doctor administered a 7g dose of Acetarsol and a 16mg dose of Ibuprofen
Alosetron	None	The patient took one Alosetron tablet

thank you for your help

The text was updated successfully, but these errors were encountered:

davidmezzetti · 2023-01-12T13:34:18Z

Thank you for taking the time to submit an issue.

Currently, the Questions pipeline only supports returning the top answer. But the underlying Hugging Face pipeline supports multiple answers using the topk argument.

See this link for more: huggingface/transformers#3207

It would be a fairly straightforward change to add this to txtai

SamPse · 2023-01-12T16:21:44Z

Thank you for your answer.
I don't know how exactly do that. If you have an example with code I'm interested. Especially for the Extractive task.
https://github.com/neuml/txtai/blob/master/examples/20_Extractive_QA_to_build_structured_data.ipynb

Once again a very good work and the documentation is very rich :)

davidmezzetti · 2023-01-17T12:19:27Z

Thank you.

The comment here: huggingface/transformers#3207 (comment)

Has an example on how to apply the topk parameter and return multiple results.

casafurix · 2023-01-18T12:50:21Z

Can this work for sagemaker? Actually I am trying to deploy this pipeline in sagemaker, how can we customize the topk parameter in that? (As Sagemaker works a bit differently), Sagemaker currently returns only 1 answer, and am unable to modify the parameters to increase the number of returned items. Thank you.

This is my model and pipeline setup in Sagemaker:
hub = {
'HF_MODEL_ID':'valhalla/t5-base-qa-qg-hl',
'HF_TASK':'text2text-generation'
}

davidmezzetti · 2023-01-24T03:27:36Z

Unfortunately, I'm not too familiar with the Hugging Face SageMaker interface. The Hugging Face team may be able to help on that one. I'd take a look at asking here - https://discuss.huggingface.co/

rcali21 · 2023-05-25T02:44:02Z

@SamPse This is a bit dated, but I'd like to implement this in a future PR. Do you mind providing your full code? Thanks!

davidmezzetti · 2023-08-30T18:17:07Z

Keeping this issue open, still a good issue to consider.

davidmezzetti added the good first issue Good for newcomers label Feb 11, 2023

davidmezzetti mentioned this issue Feb 12, 2023

Add translation pipeline parameter to return selected models and detected language #424

Merged

rcali21 mentioned this issue May 30, 2023

Add topk functionality to qa pipeline #480

Closed

davidmezzetti removed the good first issue Good for newcomers label Aug 30, 2023

davidmezzetti mentioned this issue Jan 8, 2024

Pipeline Extractor search; get multiple answers #633

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extractive QA with multiple answers #406

Extractive QA with multiple answers #406

SamPse commented Jan 12, 2023 •

edited

Loading

davidmezzetti commented Jan 12, 2023

SamPse commented Jan 12, 2023

davidmezzetti commented Jan 17, 2023

casafurix commented Jan 18, 2023

davidmezzetti commented Jan 24, 2023 •

edited

Loading

rcali21 commented May 25, 2023

davidmezzetti commented Aug 30, 2023

Extractive QA with multiple answers #406

Extractive QA with multiple answers #406

Comments

SamPse commented Jan 12, 2023 • edited Loading

davidmezzetti commented Jan 12, 2023

SamPse commented Jan 12, 2023

davidmezzetti commented Jan 17, 2023

casafurix commented Jan 18, 2023

davidmezzetti commented Jan 24, 2023 • edited Loading

rcali21 commented May 25, 2023

davidmezzetti commented Aug 30, 2023

SamPse commented Jan 12, 2023 •

edited

Loading

davidmezzetti commented Jan 24, 2023 •

edited

Loading