some questions on evaluating #5

ALR-alr · 2024-08-26T09:14:44Z

How can the model evaluate on GLEU tasks？The tasks are text-pure， but in the paper it said “Similar to PLM, when prefix image is none, this task will degenerate into “text-to-image generation” task, forcing the model to generate an image with the input caption”， so how can the model complete text-pure tasks？

ALR-alr · 2024-08-26T09:19:25Z

Maybe only use the transformer encoder in DAVINCI？

shizhediao · 2024-08-27T00:55:01Z

Yes, you can just use the text encoder separately.

ALR-alr · 2024-08-27T05:11:07Z

Yes, you can just use the text encoder separately.

It is said in the paper that "We follow the practice of BART (Lewis et al., 2020) and feed the
same input to the encoder and decoder, and the hidden state of the final decoder token is fed into a
new multi-class linear classifier or regression head." In my understanding, isn't the decoder input here similar to the one in the transformer, where already generated tokens are used as decoder input to generate the next token through autoregression? Why is it said that the decoder input is manually passed in and is the same as the encoder input?

ALR-alr · 2024-08-27T05:49:37Z

How can the model evaluate on GLEU tasks？The tasks are text-pure， but in the paper it said “Similar to PLM, when prefix image is none, this task will degenerate into “text-to-image generation” task, forcing the model to generate an image with the input caption”， so how can the model complete text-pure tasks？

As in the code

DaVinci/models/model_glue.py

Line 36 in 283ea6f

gen_text=text,

,
same text is passed in DAVINCI model, but we need to predict the relationship between two sentences. Why is the hidden state from just one sentence can be passed in the classifier and predict two sentences' relationship?
I'm sorry that I have so many beginner level question...

github-staff deleted a comment from ALR-alr Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some questions on evaluating #5

some questions on evaluating #5

ALR-alr commented Aug 26, 2024

ALR-alr commented Aug 26, 2024

shizhediao commented Aug 27, 2024 •

edited

Loading

ALR-alr commented Aug 27, 2024

ALR-alr commented Aug 27, 2024

some questions on evaluating #5

some questions on evaluating #5

Comments

ALR-alr commented Aug 26, 2024

ALR-alr commented Aug 26, 2024

shizhediao commented Aug 27, 2024 • edited Loading

ALR-alr commented Aug 27, 2024

ALR-alr commented Aug 27, 2024

shizhediao commented Aug 27, 2024 •

edited

Loading