CS 224N Final Project - A Sentence-BERT Extension to the minBERT Model

This is our code for the default final project for the Stanford CS 224N class. Our final report is here.

In this project, we aim to fine-tune the minBERT model to simultaneously perform well on sentiment analysis, paraphrase detection, and semantic textual similarity (STS) prediction tasks. First, we use pre-trained weights loaded into our minBERT implementation and train only for the sentiment task to obtain baseline performance metrics for all three downstream tasks. Second, we train for all three tasks at once, using multi-task finetuning and gradient surgery to finetune our embeddings. We take an approach inspired by Sentence-BERT (SBERT) to generate embeddings that can be compared via cosine similarity for the STS task, addressing the overhead of computing pairwise similarities with BERT. Overall, our finetuned embeddings outperform our baseline on two out of the three tasks.

Acknowledgement

The BERT implementation part of the project was adapted from the "minbert" assignment developed at Carnegie Mellon University's CS11-711 Advanced NLP, created by Shuyan Zhou, Zhengbao Jiang, Ritam Dutt, Brendon Boldt, Aditya Veerubhotla, and Graham Neubig.

Parts of the code are from the transformers library (Apache License 2.0).

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Pytorch-PCGrad-master		Pytorch-PCGrad-master
__pycache__		__pycache__
data		data
predictions		predictions
LICENSE		LICENSE
README.md		README.md
STRUCTURE.md		STRUCTURE.md
base_bert.py		base_bert.py
bert.py		bert.py
classifier.py		classifier.py
config.py		config.py
cs224n_default_final_project_submission.zip		cs224n_default_final_project_submission.zip
datasets.py		datasets.py
evaluation.py		evaluation.py
multitask_classifier.py		multitask_classifier.py
optimizer.py		optimizer.py
optimizer_test.npy		optimizer_test.npy
optimizer_test.py		optimizer_test.py
pcgrad.py		pcgrad.py
prepare_submit.py		prepare_submit.py
sanity_check.data		sanity_check.data
sanity_check.py		sanity_check.py
setup.sh		setup.sh
tokenizer.py		tokenizer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS 224N Final Project - A Sentence-BERT Extension to the minBERT Model

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

License

shruti-sridhar/cs224n-final-project

Folders and files

Latest commit

History

Repository files navigation

CS 224N Final Project - A Sentence-BERT Extension to the minBERT Model

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages