This repo is based on the paper sentence-BERT.I hope you enjoy using it for your own projects:
The way it's implemented in the src is by using only the difference of embeddings and feeding it to a dense layer with sigmoid activation instead of softmax activation.