- Run train.py
transformers == 4.25.1
pandas == 1.3.5
numpy == 1.21.6
torch == 1.13.1+cu116
scikit-learn == 1.0.2
tqdm == 4.64.1
pyperclip == 1.8.2
selenium == 4.7.2
- weighted F1 score
- Public score : 74.78 (35/565) (with koElectra ensemble seed = 777) (it isn't public 6th solution)
- Private score : non checked
- loss can't coverge with klue/RoBERTa-large (I think it's because of hyperparameter. it can be get higher score)
- Seed ensemble
- CV ensemble
- Exploratory Data Analysis
- Code refactoring
- Project Managing
- Focal Loss function debug
- RoBERTaForSequenceClassification debug
- Backtranslatation function for data augmentation
- Focal Loss function (for fix data imbalance)
- Add BI_LSTM layer in RoBERTaForSequenceClassification