NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA

This repository contains the code for our submission for SemEval-2024 Task 5: The Legal Argument Reasoning Task in Civil Procedure. Our submission secured the 7th place on the leaderboard.

Contained Files

Note: When we refer to data as being in "binary classification format", we mean that it is in the format provided to us by the task organizers. When we refer to it being in "multi-choice format", we mean that we have run the reformatting script on the data provided to us to create a new dataset.

GPT

binary_few_shot.py: Performs few shot prompting for either GPT-3.5 or GPT-4 on the test data. Data must be in binary classification format.
multi_choice_few_shot.py: Performs few shot prompting for either GPT-3.5 or GPT-4 on the test data. Data must be in multi-choice format.
reformat_test_data.py: Converts test data from binary classification format to multi-choice format.

BERT

inference.py: Generates predictions for the intended BERT model.
train.py: Runs training on the intended BERT model.

How to Run

GPT

Make sure the environment you are running the scripts in has an OpenAI access key defined so that the API can be accessed.
To run few shot prompting with binary classification, use binary_few_shot.py.
To run few shot prompting with multi-choice classification, first reformat the data using reformat_test_data.py. Then, use multi_choice_few_shot.py.
For both scripts, set the model variable to the desired version of GPT-3.5 or GPT-4, then you may run the script.

BERT

There is a collection of bash scripts that when run, call either train.py or inference.py for either vanilla BERT or Legal BERT.
If need be, change the --dataset flag in the bash script to the appropriately named dataset you have stored locally.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
bert		bert
gpt		gpt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA

Contained Files

GPT

BERT

How to Run

GPT

BERT

About

Releases

Packages

Contributors 2

Languages

devashat/UCSC-NLP-SemEval-2024-Task-5

Folders and files

Latest commit

History

Repository files navigation

NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA

Contained Files

GPT

BERT

How to Run

GPT

BERT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages