Skip to content

Latest commit

 

History

History
24 lines (15 loc) · 414 Bytes

README.md

File metadata and controls

24 lines (15 loc) · 414 Bytes

Tokenizer-Changer

Python script for manipulating the existing tokenizer.

The solution was tested on Llama3-8B tokenizer.

Installation

Installation from PyPI:

pip install tokenizerchanger

Requirements

  • Python 3.9+
  • tokenizers>=0.21.0
  • transformers>=4.47.0
  • tqdm>=4.66.4

Docs

https://tokenizer-changer.readthedocs.io/en/latest/