You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The idea of extracting relationships from self-attention weights is indeed very inspiring!
However, I have some questions. Firstly, I must clarify that my understanding of DETR is not very deep, but from what I understand, the object queries output by the later layers are more accurate. DETR typically uses the object queries from the final layer to regress the final bounding boxes. So, in EGTR, why don't you directly use the self-attention weights from the final layer for relationship extraction? Have you conducted any ablation studies to investigate this issue?
The text was updated successfully, but these errors were encountered:
The idea of extracting relationships from self-attention weights is indeed very inspiring!
However, I have some questions. Firstly, I must clarify that my understanding of DETR is not very deep, but from what I understand, the object queries output by the later layers are more accurate. DETR typically uses the object queries from the final layer to regress the final bounding boxes. So, in EGTR, why don't you directly use the self-attention weights from the final layer for relationship extraction? Have you conducted any ablation studies to investigate this issue?
The text was updated successfully, but these errors were encountered: