Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create SQuAD metric README.md #3873

Merged
merged 10 commits into from
Mar 10, 2022
Merged

Create SQuAD metric README.md #3873

merged 10 commits into from
Mar 10, 2022

Conversation

sashavor
Copy link
Contributor

@sashavor sashavor commented Mar 9, 2022

Proposal for a metrics card structure (with an example based on the SQuAD metric).
@thomwolf @lhoestq @douwekiela @lewtun -- feel free to comment on structure or content (it's an initial draft, so I realize there's stuff missing!).

Proposal for a metrics card structure (with an example based on the SQuAD metric).
@thomwolf @lhoestq @douwekiela @lewtun -- feel free to comment on structure or content (it's an initial draft, so I realize there's stuff missing!).
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

Copy link
Member

@thomwolf thomwolf left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very cool!

metrics/squad/README.md Outdated Show resolved Hide resolved
metrics/squad/README.md Outdated Show resolved Hide resolved
metrics/squad/README.md Show resolved Hide resolved
metrics/squad/README.md Show resolved Hide resolved
@thomwolf
Copy link
Member

thomwolf commented Mar 9, 2022

Oh one last thing I almost forgot, I think I would add a section "Examples" with examples of inputs and outputs and in particular: an example giving maximal values, an examples giving minimal values and maybe a standard examples from SQuAD. What do you think?

updating with @thomwolf 's suggestions
@sashavor sashavor requested a review from thomwolf March 9, 2022 15:23
Copy link
Member

@lhoestq lhoestq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool thank you ! I also imagine that we could have a widget for people to play with the metric on the website somehow

metrics/squad/README.md Show resolved Hide resolved
metrics/squad/README.md Outdated Show resolved Hide resolved
metrics/squad/README.md Show resolved Hide resolved
metrics/squad/README.md Outdated Show resolved Hide resolved
Sasha Luccioni and others added 4 commits March 9, 2022 10:34
Co-authored-by: Quentin Lhoest <[email protected]>
Co-authored-by: Quentin Lhoest <[email protected]>
Updating structure as per @lhoestq 's suggestion
@sashavor sashavor requested a review from lhoestq March 9, 2022 16:04
metrics/squad/README.md Outdated Show resolved Hide resolved
Copy link
Member

@lewtun lewtun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is very cool and will really help people get a better understanding for how our metrics work!

I've left some tiny comments, but otherwise this looks great to me :)

{'exact_match': 66.66666666666667, 'f1': 66.66666666666667}

## Limitations and bias
This metric works only with the [SQuAD v.1 dataset](https://huggingface.co/datasets/squad) -- it will not work with any other dataset formats.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't it more accurate to say that this metric only works for datasets that have the same schema / format as SQuAD v1?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

well, it's technically a dataset-specific metric, cause there are ids that are internal to SQuAD, but you're right that they don't get checked, really.

changing the code formatting as per @lewtun 's comments
Copy link
Member

@lhoestq lhoestq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Feel free to merge if it's all good for you :)

metrics/squad/README.md Show resolved Hide resolved
@sashavor sashavor merged commit 89f3751 into master Mar 10, 2022
@sashavor sashavor deleted the squad-metric-card branch March 10, 2022 16:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants