level2_Relation-Extraction_nlp-05

🐴Members


변성훈	서보성	이도현	이상민	이승우	이예원

📎RE (Relation Extraction)

부스트 캠프 AI-Tech 5기 NLP 트랙 Level2 1차 경진대회 프로젝트입니다. 관계 추출(Relation Extraction)은 문장이나 텍스트에서 두 개체(대상) 사이의 관계를 식별하고 분류하는 작업입니다. RE 작업은 정보 검색 및 추출, 지식 그래프 구축 등 다양한 응용 분야에서 중요하게 활용됩니다. 프로젝트의 목표는 문장과 문장의 두 개체가 주어졌을 때, 이 두 개체 사이의 관계를 자동으로 추출하도록 하는 것입니다.

Data (Private)

총 데이터 개수: 40,235 문장 쌍
- Train(학습) 데이터 개수: 32,470 (81%)
- Test(평가) 데이터 개수: 7,765 (19%)
- Label: 0 ~ 29 사이의 정수 (KLUE RE)

Metric

Micro F1, AUPRC

✔️Project

Structure

root/
|
|-- train.py
|-- inference.py
|
|-- custom/
|   |-- CustomDataCollator.py
|   |-- CustomModel.py
|   |-- CustomTrainer.py
|
|-- module/
|   |-- add_token.py
|   |-- config.yaml
|   |-- load_data.py
|   |-- pretrain.py
|   |-- seed_everything.py
|   |-- train_val_split.py
|
|-- utils/
|   |-- compute_metrics.py
|   |-- ensemble.py
|   |-- kfold.py
|   |-- label_to_num.py
|   |-- num_to_label.py

Preprocessing

Data Augmentation
- Duplicated data remove
- Token masking for an abnormal label (per:place_of_residence)
Entity Marker
- Special Token
  - Normal Special Token Ver1, Ver2(No CLS)
  - Korean Special Token
  - Special Token with CLS
- Punctuation
  - Normal punctuation
  - Korean Punctuation
Description
- Description version 1
- Description version 2

Modeling

Focal loss
Label Smoothing
Add modules to Pre-Trained Model
Use two types of BERT Models
- working independently
- working Sequentially
Entity Type Restriction

Ensemble

KFold → StratifiedKFold
Soft Voting Ensemble

💡 자세한 내용은 Wrap-up Report를 참고해주세요.

🐞Usage

# TRAIN
python3 code/train.py

# INFERENCE
python3 code/inference.py

🏆Result

Public 1위

Private 2위

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
custom		custom
module		module
utils		utils
.gitignore		.gitignore
.gitmessage		.gitmessage
README.md		README.md
[NLP-05]klue_wrapup_report.pdf		[NLP-05]klue_wrapup_report.pdf
inference.py		inference.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

level2_Relation-Extraction_nlp-05

🐴Members

📎RE (Relation Extraction)

Data (Private)

Metric

✔️Project

Structure

Preprocessing

Modeling

Ensemble

🐞Usage

🏆Result

About

Releases

Packages

Contributors 6

Languages

boostcampaitech5/level2_klue-nlp-05

Folders and files

Latest commit

History

Repository files navigation

level2_Relation-Extraction_nlp-05

🐴Members

📎RE (Relation Extraction)

Data (Private)

Metric

✔️Project

Structure

Preprocessing

Modeling

Ensemble

🐞Usage

🏆Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages