Skip to content

Latest commit

 

History

History
6 lines (3 loc) · 268 Bytes

README.md

File metadata and controls

6 lines (3 loc) · 268 Bytes

This is a simple implementation of a Byte Pair Encoding Algorithm, used to tokenize text, and a Bigram Word Model.

These were created as part of my research work in Language Models and are my original implementation!

https://www.linkedin.com/in/peter-v-334609211/