Team AI-it Repo for KLUE tasks
김재현/T2050
박진영/T2096
안성민/T2127
양재욱/T2130
이연걸/T2163
조범준/T2211
진혜원/T2217
GPU : V100
Language : Python
Develop tools : Jupyter Notebook, VSCode, Pycharm, Google Colab
code/
├── train
├── inference
├── inference_ensemble
├── data_helper
├── load_data
├── utils
├── error_handler
└── requirements.txt
dataset/
└── train
└── train.csv
└── test
└── test.csv
torch==1.6.0
transformers==4.11.0
wandb==0.12.3
koeda==0.0.4
konlpy==0.5.2
kss==3.2.0
soynlp==0.0.493
pip install -r requirements.txt
python train.py --[args] [value]
python train.py --model klue-roberta-large
TTA(Test Time Augmentation) 사용
python inference.py --[args] [value]
python inference.py --ensemble True
python inference_ensemble.py --[args] [value]
python inference_ensemble.py --model_dir ./ensemble
Train : Given KLUE dataset
klue/roberta-large
klue/bert-base
Optimizer : AdamP
Loss : Focal Loss
+------------------------+------------+-------+-----------------+--------------------+-------------------------------+-------------+
| Trial name | status | loc | learning_rate | num_train_epochs | per_device_train_batch_size | objective |
|------------------------+------------+-------+-----------------+--------------------+-------------------------------+-------------|
| _objective_6b574_00000 | TERMINATED | | 2.80576e-05 | 5 | 128 | 154.363 |
| _objective_6b574_00001 | TERMINATED | | 0.000145532 | 5 | 32 | 149.825 |
| _objective_6b574_00002 | TERMINATED | | 1.02567e-05 | 5 | 128 | 132.344 |
| _objective_6b574_00003 | TERMINATED | | 6.53337e-06 | 6 | 32 | 147.45 |
| _objective_6b574_00004 | TERMINATED | | 7.96526e-05 | 6 | 128 | 160.306 |
| _objective_6b574_00005 | TERMINATED | | 5.49717e-06 | 6 | 256 | 106.767 |
| _objective_6b574_00006 | TERMINATED | | 0.000231129 | 6 | 64 | 143.499 |
| _objective_6b574_00007 | TERMINATED | | 1.1551e-05 | 5 | 32 | 154.669 |
| _objective_6b574_00008 | TERMINATED | | 2.02981e-05 | 6 | 32 | 160.616 |
| _objective_6b574_00009 | TERMINATED | | 3.65477e-05 | 5 | 128 | 157.183 |
+------------------------+------------+-------+-----------------+--------------------+-------------------------------+-------------+
BestRun(run_id='6b574_00008', objective=160.6155945027986, hyperparameters={'learning_rate': 2.0298058052421517e-05, 'num_train_epochs': 6, 'per_device_train_batch_size': 32})