Are Gigaword examples with summary length bigger than the article length considered when the final metrics are computed? #8

Tomarchelone · 2024-01-10T09:43:35Z

In Gigaword dataset there are some examples where the summary is longer than the source sequence. Sometimes the sourse is a single unk word. As I can see in dataclass.py, such examples are dropped from the pipeline completely.

Were the rouge scores reported in the paper computed without those examples? If yes, then it is incorrect to compare the resulting scores with the baselines. For example, as I can see, the rouge scores for Concept Pointer were taken directly from the paper, where they measured the performance on all test examples.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are Gigaword examples with summary length bigger than the article length considered when the final metrics are computed? #8

Are Gigaword examples with summary length bigger than the article length considered when the final metrics are computed? #8

Tomarchelone commented Jan 10, 2024

Are Gigaword examples with summary length bigger than the article length considered when the final metrics are computed? #8

Are Gigaword examples with summary length bigger than the article length considered when the final metrics are computed? #8

Comments

Tomarchelone commented Jan 10, 2024