Skip to content

mastershin/learn-llm-101

Repository files navigation

Major NLP Tasks

  • Text classification
  • Text summarization
  • Question answering
  • Speech recognition / Text to Speech
  • Language translation
  • Chatbots

LLM References

Evaluation Dataset

Training Dataset

SuperGLUE Dataset

Short Name Full Name Description
BoolQ Boolean Questions Involves answering a yes/no question based on a short passage.
CB CommitmentBank Tests understanding of entailment and contradiction in a three-sentence format.
COPA Choice of Plausible Alternatives Measures causal reasoning by asking for the cause/effect of a given sentence.
MultiRC Multi-Sentence Reading Comprehension Involves answering questions about a paragraph where each question may have multiple correct answers.
ReCoRD Reading Comprehension with Commonsense Reasoning Requires selecting the correct named entity from a passage to fill in the blank of a question.
RTE Recognizing Textual Entailment Involves identifying whether a sentence entails, contradicts, or is neutral towards another sentence.
WiC Words in Context Tests understanding of word sense disambiguation in different contexts.
WSC Winograd Schema Challenge Focuses on resolving coreference resolution within a sentence, often requiring commonsense reasoning.
AX-b Broad Coverage Diagnostic A diagnostic set to evaluate model performance on a broad range of linguistic phenomena.
AX-g Winogender Schema Diagnostics Tests for the presence of gender bias in automated coreference resolution systems.

GLUE Dataset

Short Name Full Name Description
CoLA Corpus of Linguistic Acceptability Measures the ability to determine if an English sentence is linguistically acceptable.
SST-2 Stanford Sentiment Treebank Consists of sentences from movie reviews with human annotations about their sentiment.
MRPC Microsoft Research Paraphrase Corpus Focuses on identifying whether two sentences are paraphrases of each other.
STS-B Semantic Textual Similarity Benchmark Involves determining the semantic similarity between two sentences.
QQP Quora Question Pairs Aims to identify whether two Quora questions are semantically equivalent.
MNLI Multi-Genre Natural Language Inference Features sentence pairs labeled for textual entailment across various text genres.
QNLI Question Natural Language Inference Involves determining whether a paragraph contains the answer to a question.
RTE Recognizing Textual Entailment Requires understanding whether one sentence entails another.
WNLI Winograd Natural Language Inference Tests reading comprehension by determining the correct referent of a pronoun in a sentence, depending on contextual clues from specific words or phrases.

Noteworthy Papers, including Attention Mechanisms

Noteworthy LLMs

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published