Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update datasets task tags to align tags with models #4067

Merged
merged 12 commits into from
Apr 13, 2022
  •  
  •  
  •  
4 changes: 2 additions & 2 deletions datasets/acronym_identification/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,9 @@ size_categories:
source_datasets:
- original
task_categories:
- structure-prediction
- token-classification
task_ids:
- structure-prediction-other-acronym-identification
- token-classification-other-acronym-identification
paperswithcode_id: acronym-identification
pretty_name: Acronym Identification Dataset
---
Expand Down
4 changes: 2 additions & 2 deletions datasets/ade_corpus_v2/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,9 +22,9 @@ task_categories:
Ade_corpus_v2_classification:
- text-classification
Ade_corpus_v2_drug_ade_relation:
- structure-prediction
- token-classification
Ade_corpus_v2_drug_dosage_relation:
- structure-prediction
- token-classification
task_ids:
Ade_corpus_v2_classification:
- fact-checking
Expand Down
2 changes: 1 addition & 1 deletion datasets/afrikaans_ner_corpus/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ size_categories:
source_datasets:
- original
task_categories:
- structure-prediction
- token-classification
task_ids:
- named-entity-recognition
paperswithcode_id: null
Expand Down
8 changes: 5 additions & 3 deletions datasets/air_dialogue/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,12 +15,14 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
- sequence-modeling
- conversational
- text-generation
- fill-mask
task_ids:
- conditional-text-generation-other-dialogue-generation
- dialogue-generation
- dialogue-modeling
- language-modeling
- masked-language-modeling
paperswithcode_id: null
---

Expand Down
3 changes: 2 additions & 1 deletion datasets/allegro_reviews/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,10 @@ size_categories:
source_datasets:
- original
task_categories:
- text-scoring
- text-classification
task_ids:
- sentiment-scoring
- text-scoring
paperswithcode_id: allegro-reviews
pretty_name: Allegro Reviews
---
Expand Down
5 changes: 2 additions & 3 deletions datasets/alt/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -40,10 +40,9 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
- structure-prediction
- translation
- token-classification
task_ids:
- machine-translation
- parsing
paperswithcode_id: alt
pretty_name: Asian Language Treebank
Expand Down
9 changes: 5 additions & 4 deletions datasets/amazon_reviews_multi/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,15 +58,16 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
- sequence-modeling
- summarization
- text-generation
- fill-mask
- text-classification
- text-scoring
task_ids:
- text-scoring
- language-modeling
- masked-language-modeling
- sentiment-classification
- sentiment-scoring
- summarization
- topic-classification
paperswithcode_id: null
pretty_name: The Multilingual Amazon Reviews Corpus
Expand Down
3 changes: 1 addition & 2 deletions datasets/ami/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,9 +16,8 @@ size_categories:
source_datasets:
- original
task_categories:
- speech-processing
task_ids:
- automatic-speech-recognition
task_ids: []
---

# Dataset Card for AMI Corpus
Expand Down
2 changes: 1 addition & 1 deletion datasets/amttl/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ size_categories:
source_datasets:
- original
task_categories:
- structure-prediction
- token-classification
task_ids:
- parsing
paperswithcode_id: null
Expand Down
3 changes: 2 additions & 1 deletion datasets/app_reviews/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,9 @@ size_categories:
source_datasets:
- original
task_categories:
- text-scoring
- text-classification
task_ids:
- text-scoring
- sentiment-scoring
paperswithcode_id: null
pretty_name: AppReviews
Expand Down
1 change: 1 addition & 0 deletions datasets/aquamuse/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@ source_datasets:
task_categories:
- other
- question-answering
- text2text-generation
task_ids:
- abstractive-qa
- extractive-qa
Expand Down
4 changes: 3 additions & 1 deletion datasets/arabic_billion_words/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,9 +33,11 @@ size_categories:
source_datasets:
- original
task_categories:
- sequence-modeling
- text-generation
- fill-mask
task_ids:
- language-modeling
- masked-language-modeling
paperswithcode_id: null
pretty_name: Arabic Billion Words
---
Expand Down
2 changes: 1 addition & 1 deletion datasets/arabic_pos_dialect/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ size_categories:
source_datasets:
- extended
task_categories:
- structure-prediction
- token-classification
task_ids:
- part-of-speech-tagging
paperswithcode_id: null
Expand Down
3 changes: 1 addition & 2 deletions datasets/arabic_speech_corpus/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,9 +16,8 @@ size_categories:
source_datasets:
- original
task_categories:
- speech-processing
task_ids:
- automatic-speech-recognition
task_ids: []
---

# Dataset Card for Arabic Speech Corpus
Expand Down
5 changes: 2 additions & 3 deletions datasets/arxiv_dataset/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,15 +14,14 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
- translation
- summarization
- text-retrieval
task_ids:
- document-retrieval
- entity-linking-retrieval
- explanation-generation
- fact-checking-retrieval
- machine-translation
- summarization
- text-simplification
paperswithcode_id: null
pretty_name: arXiv Dataset
Expand Down
8 changes: 4 additions & 4 deletions datasets/asset/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,12 +16,12 @@ source_datasets:
- extended|other-turkcorpus
task_categories:
ratings:
- text-scoring
- text-classification
simplification:
- conditional-text-generation
- text2text-generation
task_ids:
ratings:
- text-scoring-other-simplification-evaluation
- text-classification-other-simplification-evaluation
simplification:
- text-simplification
paperswithcode_id: asset
Expand Down Expand Up @@ -67,7 +67,7 @@ splitting in [HSplit](https://www.aclweb.org/anthology/D18-1081.pdf)), the simpl

### Supported Tasks and Leaderboards

The dataset supports the evaluation of `test-simplification` systems. Success in this tasks is typically measured using the [SARI](https://huggingface.co/metrics/sari) and [FKBLEU](https://huggingface.co/metrics/fkbleu) metrics described in the paper [Optimizing Statistical Machine Translation for Text Simplification](https://www.aclweb.org/anthology/Q16-1029.pdf).
The dataset supports the evaluation of `text-simplification` systems. Success in this tasks is typically measured using the [SARI](https://huggingface.co/metrics/sari) and [FKBLEU](https://huggingface.co/metrics/fkbleu) metrics described in the paper [Optimizing Statistical Machine Translation for Text Simplification](https://www.aclweb.org/anthology/Q16-1029.pdf).

### Languages

Expand Down
2 changes: 1 addition & 1 deletion datasets/assin/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,8 +16,8 @@ source_datasets:
- original
task_categories:
- text-classification
- text-scoring
task_ids:
- text-scoring
- natural-language-inference
- semantic-similarity-scoring
paperswithcode_id: assin
Expand Down
2 changes: 1 addition & 1 deletion datasets/assin2/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,8 @@ source_datasets:
- original
task_categories:
- text-classification
- text-scoring
task_ids:
- text-scoring
- natural-language-inference
- semantic-similarity-scoring
paperswithcode_id: assin2
Expand Down
4 changes: 2 additions & 2 deletions datasets/atomic/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,9 +15,9 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
- text2text-generation
task_ids:
- other-structured-to-text
- text2text-generation-other-common-sense-if-then-reasoning
paperswithcode_id: atomic
---

Expand Down
5 changes: 2 additions & 3 deletions datasets/autshumato/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -40,9 +40,8 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
task_ids:
- machine-translation
- translation
task_ids: []
paperswithcode_id: null
pretty_name: autshumato
---
Expand Down
5 changes: 2 additions & 3 deletions datasets/bbaw_egyptian/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,9 +16,8 @@ size_categories:
source_datasets:
- extended|wikipedia
task_categories:
- conditional-text-generation
task_ids:
- machine-translation
- translation
task_ids: []
paperswithcode_id: null
pretty_name: BbawEgyptian
---
Expand Down
2 changes: 1 addition & 1 deletion datasets/bc2gm_corpus/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ size_categories:
source_datasets:
- original
task_categories:
- structure-prediction
- token-classification
task_ids:
- named-entity-recognition
paperswithcode_id: null
Expand Down
2 changes: 1 addition & 1 deletion datasets/beans/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ source_datasets:
task_categories:
- image-classification
task_ids:
- single-label-image-classification
- multi-class-image-classification
---

# Dataset Card for Beans
Expand Down
4 changes: 2 additions & 2 deletions datasets/best2009/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,9 @@ size_categories:
source_datasets:
- original
task_categories:
- structure-prediction
- token-classification
task_ids:
- structure-prediction-other-word-tokenization
- token-classification-other-word-tokenization
paperswithcode_id: null
pretty_name: best2009
---
Expand Down
5 changes: 2 additions & 3 deletions datasets/bianet/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,9 +27,8 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
task_ids:
- machine-translation
- translation
task_ids: []
paperswithcode_id: bianet
pretty_name: Bianet
---
Expand Down
5 changes: 2 additions & 3 deletions datasets/bible_para/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -115,9 +115,8 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
task_ids:
- machine-translation
- translation
task_ids: []
paperswithcode_id: null
pretty_name: BiblePara
---
Expand Down
4 changes: 2 additions & 2 deletions datasets/big_patent/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,9 +33,9 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
task_ids:
- summarization
task_ids:
- summarization-other-patent-summarization
paperswithcode_id: bigpatent
pretty_name: Big Patent
---
Expand Down
4 changes: 2 additions & 2 deletions datasets/billsum/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,9 @@ size_categories:
source_datasets:
- original
task_categories:
- conditional-text-generation
task_ids:
- summarization
task_ids:
- summarization-other-bills-summarization
paperswithcode_id: billsum
pretty_name: BillSum
---
Expand Down
3 changes: 2 additions & 1 deletion datasets/biosses/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,9 @@ size_categories:
source_datasets:
- original
task_categories:
- text-scoring
- text-classification
task_ids:
- text-scoring
- semantic-similarity-scoring
paperswithcode_id: biosses
pretty_name: BIOSSES
Expand Down
4 changes: 3 additions & 1 deletion datasets/blbooks/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,10 +20,12 @@ size_categories:
source_datasets:
- original
task_categories:
- sequence-modeling
- text-generation
- fill-mask
- other
task_ids:
- language-modeling
- masked-language-modeling
- other-other-digital-humanities-research
---

Expand Down
4 changes: 3 additions & 1 deletion datasets/blbooksgenre/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,11 +25,13 @@ source_datasets:
- original
task_categories:
- text-classification
- sequence-modeling
- text-generation
- fill-mask
task_ids:
- topic-classification
- multi-label-classification
- language-modeling
- masked-language-modeling
---

# Dataset Card for blbooksgenre
Expand Down
4 changes: 3 additions & 1 deletion datasets/bnl_newspapers/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,9 +22,11 @@ size_categories:
source_datasets:
- original
task_categories:
- sequence-modeling
- text-generation
- fill-mask
task_ids:
- language-modeling
- masked-language-modeling
---

# Dataset Card for BnL Historical Newspapers
Expand Down
Loading