Results for English STS using BERT-mp #6

zhujiatong628 · 2023-03-06T15:15:55Z

Hi, I'm reproducing your experiment recently, and I found when I evaluated BERT-mp for English STS. The results I got are much better than reported in the paper, so I also use your code provided at here to check. I found it's still better than paper shows. However, when I use your code to evaluate BERT-CLS for English STS, I can get the same results with the paper shows. So I was wondering is there something wrong with the results using BERT-mp for English STS?

hardyqr · 2023-03-06T15:37:36Z

Hi, if you check out

mirror-bert/evaluation/eval.py

Line 19 in 8b6b9de

    
           parser.add_argument('--agg_mode', type=str, default="cls", help="{cls|mean|mean_std|...}")

, there are two options for mp: mean and mean_std where the first one considers padding tokens too. I used mean for reporting results which might have caused the discrepancy.

zhujiatong628 · 2023-03-06T15:45:43Z

Thanks for your reply. I also used mean, so I'm so confused about the difference.😫

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Results for English STS using BERT-mp #6

Results for English STS using BERT-mp #6

zhujiatong628 commented Mar 6, 2023

hardyqr commented Mar 6, 2023

zhujiatong628 commented Mar 6, 2023

Results for English STS using BERT-mp #6

Results for English STS using BERT-mp #6

Comments

zhujiatong628 commented Mar 6, 2023

hardyqr commented Mar 6, 2023

zhujiatong628 commented Mar 6, 2023