Fix indexing of lasttoken pooling for longest sequence #2111

ssharpe42 · 2023-05-24T12:39:23Z

Sequences that span the entire seq length of the batch will have attention mask of all 1's.

The current logic results in retrieving the first token embedding instead of the last since torch.min(torch.tensor([1,1,1 ... , 1])) will be 0.

The new logic uses the attention_mask.shape[1]-1 as the index to retrieve where min(attention_mask, 1)==1

fixing last token indexing for sequences that span the entire length

izhx · 2023-05-25T03:21:14Z

Nice work! Also find this bug.

sentence_transformers/models/Pooling.py

change to seq_len

into pr-2111

tomaarsen · 2023-12-13T19:59:22Z

Hello!

Thanks to you both for spotting this! You've provided a very elegant fix as well - I appreciate it! I'll merge when this goes green.

Update Pooling.py

256154b

fixing last token indexing for sequences that span the entire length

izhx reviewed May 25, 2023

View reviewed changes

sentence_transformers/models/Pooling.py Outdated Show resolved Hide resolved

ssharpe42 and others added 3 commits May 25, 2023 08:58

Update Pooling.py

581fe58

change to seq_len

Merge branch 'master' of https://github.com/UKPLab/sentence-transformers

01d5ae3

into pr-2111

Typo: lenth -> length

b875550

tomaarsen merged commit 6b524f8 into UKPLab:master Dec 13, 2023
8 checks passed

Provide feedback