You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I'm reproducing your experiment recently, and I found when I evaluated BERT-mp for English STS. The results I got are much better than reported in the paper, so I also use your code provided at here to check. I found it's still better than paper shows. However, when I use your code to evaluate BERT-CLS for English STS, I can get the same results with the paper shows. So I was wondering is there something wrong with the results using BERT-mp for English STS?
The text was updated successfully, but these errors were encountered:
, there are two options for mp: mean and mean_std where the first one considers padding tokens too. I used mean for reporting results which might have caused the discrepancy.
Hi, I'm reproducing your experiment recently, and I found when I evaluated BERT-mp for English STS. The results I got are much better than reported in the paper, so I also use your code provided at here to check. I found it's still better than paper shows. However, when I use your code to evaluate BERT-CLS for English STS, I can get the same results with the paper shows. So I was wondering is there something wrong with the results using BERT-mp for English STS?

The text was updated successfully, but these errors were encountered: