File tree
9 files changed
+76
-47
lines changed- src
- cpp
- src
- python
- thirdparty
9 files changed
+76
-47
lines changed+2-2
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 |
| - | |
| 1 | + | |
2 | 2 |
| |
3 | 3 |
| |
4 | 4 |
| |
5 | 5 |
| |
6 |
| - | |
7 | 6 |
| |
| 7 | + | |
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 |
| - | |
| 1 | + | |
2 | 2 |
| |
3 | 3 |
| |
4 | 4 |
| |
5 |
| - | |
6 | 5 |
| |
| 6 | + |
+51-16
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 |
| - | |
| 1 | + | |
| 2 | + | |
| 3 | + | |
2 | 4 |
| |
3 |
| - | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
4 | 37 |
| |
5 | 38 |
| |
6 |
| - | |
7 | 39 |
| |
8 |
| - | |
9 |
| - | |
10 |
| - | |
11 |
| - | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
12 | 43 |
| |
| 44 | + | |
13 | 45 |
| |
14 | 46 |
| |
15 |
| - | |
16 | 47 |
| |
17 |
| - | |
18 |
| - | |
19 |
| - | |
20 |
| - | |
21 |
| - | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
22 | 56 |
| |
23 |
| - | |
24 |
| - | |
25 |
| - | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + |
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
11 | 11 |
| |
12 | 12 |
| |
13 | 13 |
| |
14 |
| - | |
15 |
| - | |
16 |
| - | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
17 | 17 |
| |
18 | 18 |
| |
19 | 19 |
| |
| |||
313 | 313 |
| |
314 | 314 |
| |
315 | 315 |
| |
316 |
| - | |
317 |
| - | |
318 |
| - | |
319 |
| - | |
320 |
| - | |
321 |
| - | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
322 | 321 |
| |
323 |
| - | |
324 |
| - | |
325 |
| - | |
326 |
| - | |
327 |
| - | |
328 |
| - | |
329 |
| - | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
330 | 329 |
| |
331 |
| - | |
332 |
| - | |
333 |
| - | |
334 |
| - | |
335 |
| - | |
336 |
| - | |
337 |
| - | |
338 |
| - | |
| 330 | + | |
339 | 331 |
| |
340 | 332 |
| |
341 | 333 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
1 | 5 |
| |
2 | 6 |
| |
3 | 7 |
| |
|
File renamed without changes.
Submodule nlohmann_json deleted from 199dea1
Submodule openvino_tokenizers updated 56 files
- .github/dependabot.yml+33
- .github/dependency_review.yml+18
- .github/labeler.yml+25-16
- .github/workflows/labeler.yml+11-3
- .github/workflows/linux.yml+5-2
- .github/workflows/mac.yml+5-2
- .github/workflows/sdl.yml+8-1
- .github/workflows/windows.yml+5-2
- CMakeLists.txt+1-1
- Jenkinsfile+3
- README.md+187-110
- pyproject.toml+6-4
- python/openvino_tokenizers/__init__.py+7-5
- python/openvino_tokenizers/build_tokenizer.py+76
- python/openvino_tokenizers/cli.py+14-1
- python/openvino_tokenizers/convert_tokenizer.py+3
- python/openvino_tokenizers/hf_parser.py+59-29
- python/openvino_tokenizers/str_pack.py+1-1
- python/openvino_tokenizers/tokenizer_pipeline.py+129-38
- python/openvino_tokenizers/utils.py+4-2
- requirements-build.txt+1-1
- src/CMakeLists.txt+29-8
- src/bpe_tokenizer.cpp+24-28
- src/bpe_tokenizer.hpp+1-1
- src/case_fold.cpp+22-6
- src/case_fold.hpp+9-2
- src/equal_str.cpp+73
- src/equal_str.hpp+40
- src/fuze.cpp+40
- src/fuze.hpp+35
- src/ov_extension.cpp+28-11
- src/ragged_to_ragged.cpp+82
- src/ragged_to_ragged.hpp+41
- src/ragged_to_sparse.cpp+47
- src/ragged_to_sparse.hpp+36
- src/regex_normalization.cpp+31-7
- src/regex_normalization.hpp+13-7
- src/regex_split.cpp+20-5
- src/regex_split.hpp+7-4
- src/sentence_piece.cpp+56-81
- src/sentence_piece.hpp+3-3
- src/tensorflow_translators.cpp+371-104
- src/tensorflow_translators.hpp+9-4
- src/tokenizer.hpp+6
- src/trie_tokenizer.cpp+111
- src/trie_tokenizer.hpp+54
- src/utils.cpp+14-18
- src/utils.hpp+3-1
- src/vocab_decoder.cpp+11-16
- src/vocab_decoder.hpp+1-2
- src/vocab_encoder.cpp+74
- src/vocab_encoder.hpp+45
- src/wordpiece_tokenizer.cpp+22-25
- src/wordpiece_tokenizer.hpp+1-1
- tests/pass_rates.json+1-1
- tests/tokenizers_test.py+34-17
0 commit comments