-
Notifications
You must be signed in to change notification settings - Fork 27.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pegasus- Arxiv predicts random text #7163
Comments
Seems to be a problem with the 'google/pegasus-arxiv' model, when you use 'google/pegasus-xsum' you get: |
Yes I tried different pegasus models (including alot of other models) and pegasus-large e.G. outputs this (which I think is really good1): while pegasus-multinews outputs pretty well generated texts, but unfortunately wrong in the content: Gigaword and billsum are both also outputting non useful texts. Also another question, while pegasus-large and pegasus-cnn_dailymail both only return the most important sentences, pegasus-multinews generates even new text. I was hoping the same for the arxiv model, is there a reason that it differs in that way? |
If you want to prove a bug, try running an evaluation on a public dataset from the datasets package, and posting the result #6844 . |
Environment infotransformers version: 3.1.0 To ReproduceI found unexpected behaviour when using Pegasus-Pubmed on Pubmed document.
Expected behaviourI expect a summary of the input but i received a longer (relative to the input) version of the text, input has length of 929 vs 1129 of predicted summary. Input: although the association is modest , it is important because of the increasing prevalence of metabolic syndrome and the effect that depression can have on the ability of patients to successfully make lifestyle changes and comply with medication required for hypertension and dyslipidemia . the association is demonstrated here in a general population to our knowledge for the first time , whereas earlier studies ( table 1 ) used subgroups of populations ( 813,17 ) . this distinction is important because many individuals with metabolic syndrome have diabetes , which itself is known to be associated with depression ( 5 ) . metabolic syndrome has been defined in several ways that involve quantitative anthropometric , clinical , and laboratory measurements ( 1,2 ) . for the primary assessment , we chose ncep atp iii ( 1 ) criteria , since these criteria were used in most of the previously reported studies ( 8,9,1113,17 ) . Output: ['depression is known to be associated with metabolic syndrome, but its association with metabolic syndrome has not been studied in a general population. we examined the association between depression and metabolic syndrome using ncep atp iii criteria in a population - based sample ( n = 3,018 ). metabolic syndrome was defined as having three or more of the following : body mass index 25 kg / m2, waist circumference 90 cm, and triglyceride 130 mg / dl. depression was assessed using the center for epidemiologic studies depression scale ( cesds ). multivariate logistic regression was used to estimate odds ratios ( ors ) and 95% confidence intervals ( cis ) for the association between depression and metabolic syndrome. we found a significant association between depression and metabolic syndrome in a general population. after adjustment for age, sex, race / ethnicity, education, smoking, physical activity, alcohol intake, and body mass index, metabolic syndrome was associated with increased odds of depression ( or = 1.16, 95% ci 1.041.32 ). the association was stronger in women than in men.'] Is that behaviour correct? |
Output should be < 256 tokens (not characters). |
We've now replicated that our pegasus port performs similarly well to the authors implementation on 11 datasets, including arxiv. |
Environment info
transformers
version: 3.1.0Who can help
@sshleifer
Information
Model I am using (Pegasus-Arxiv):
The problem arises when using:
To reproduce
Steps to reproduce the behavior:
Expected behavior
I expect a clear summary of the text, but I receive a text with no connection to the input, written as a scientific paper:
['this is the first of a series of papers in which we address the question of whether or not the laws of thermodynamics are valid in the limit of infinitely many degrees of freedom. we show that the laws of thermodynamics are valid in the limit of infinitely many degrees of freedom. this is the first of a series of papers in which we address the question of whether or not the laws of thermodynamics are valid in the limit of infinitely many degrees of freedom. we show that the laws of thermodynamics are valid in the limit of infinitely many degrees of freedom. [ theorem]acknowledgement [ theorem]algorithm [ theorem]axiom [ theorem]claim [ theorem]conclusion [ theorem]condition [ theorem]conjecture [ theorem]corollary [ theorem]criterion [ theorem]definition [ theorem]example [ theorem]exercise [ theorem]lemma [ theorem]notation [ theorem]problem [ theorem]proposition [ theorem]remark [ theorem]solution [ theorem]summary this is the first of a series of papers in which we address the question of whether or not the laws of thermodynamics are valid in the limit of infinitely many degrees of freedom.']
Am I doing something wrong or is it the model?
Thanks
The text was updated successfully, but these errors were encountered: