resolve complicated chat templates during tokenizer saving #1151

eaidova · 2025-02-07T07:21:12Z

What does this PR do?

Currently we have some issues with running some models with chat templates using C++ openvino API due to its chat template complexity parcing using C++. For smooth experience, we allow redefine them by own. To simplify such models usage, we want to provide C++ compatible simplified chat templates during tokenizer conversion (no impact on inference model with original python tokenizer) for several known and popular cases.

Additionally, it was found that sometimes running multimodal models (VLM), main chat template provided in processor instead of tokenizer as the result, our tokenizer conversion API ignore it. This PR fixes issue for such models.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

optimum/exporters/openvino/__main__.py

optimum/intel/openvino/modeling_seq2seq.py

HuggingFaceDocBuilderDev · 2025-02-07T07:27:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tests/openvino/test_exporters_cli.py

AlexKoff88 · 2025-02-10T14:49:09Z

I am fine with the changes. @eaidova, can you please describe the idea briefly for the rest of the reviewers?

Depends on huggingface/optimum-intel#1151 Close openvinotoolkit#1663 Ticket 161313

tests/openvino/test_exporters_cli.py

Depends on huggingface/optimum-intel#1151 Close #1663 Ticket 161313

echarlaix

LGTM! Left a very minor comment

optimum/exporters/openvino/convert.py

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

eaidova · 2025-02-14T09:35:42Z

LGTM! Left a very minor comment

@echarlaix, thanks for review. I applied your comment. Could this PR be merged?

resolve complicated chat templates during tokenizer saving

1699f32

eaidova commented Feb 7, 2025

View reviewed changes

optimum/exporters/openvino/__main__.py Outdated Show resolved Hide resolved

eaidova commented Feb 7, 2025

View reviewed changes

optimum/intel/openvino/modeling_seq2seq.py Outdated Show resolved Hide resolved

Apply suggestions from code review

6cc2de5

eaidova marked this pull request as ready for review February 10, 2025 08:48

eaidova force-pushed the ea/chat_template_simplification branch from d4ba3f2 to ccf3287 Compare February 10, 2025 10:09

improve template selection logic and add tests

7cd9492

eaidova force-pushed the ea/chat_template_simplification branch from ccf3287 to 7cd9492 Compare February 10, 2025 11:34

add updated deepseek template

2d2d0ba

eaidova commented Feb 10, 2025

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

Update tests/openvino/test_exporters_cli.py

ad03882

eaidova requested a review from AlexKoff88 February 10, 2025 13:14

deepseek llama

910d7e6

AlexKoff88 approved these changes Feb 10, 2025

View reviewed changes

AlexKoff88 requested review from IlyasMoutawwakil and echarlaix February 10, 2025 14:49

Wovchena added a commit to Wovchena/openvino.genai-public that referenced this pull request Feb 11, 2025

tokenizer: read simplified_chat_template

031e64a

Depends on huggingface/optimum-intel#1151 Close openvinotoolkit#1663 Ticket 161313

Wovchena added a commit to Wovchena/openvino.genai-public that referenced this pull request Feb 11, 2025

tokenizer: read simplified_chat_template

f3317de

Depends on huggingface/optimum-intel#1151 Close openvinotoolkit#1663 Ticket 161313

Wovchena mentioned this pull request Feb 11, 2025

tokenizer: read simplified_chat_template openvinotoolkit/openvino.genai#1712

Merged

ilya-lavrenov reviewed Feb 12, 2025

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

github-merge-queue bot pushed a commit to openvinotoolkit/openvino.genai that referenced this pull request Feb 12, 2025

tokenizer: read simplified_chat_template (#1712)

4b225c7

Depends on huggingface/optimum-intel#1151 Close #1663 Ticket 161313

github-merge-queue bot pushed a commit to openvinotoolkit/openvino.genai that referenced this pull request Feb 12, 2025

tokenizer: read simplified_chat_template (#1712)

982a0dd

Depends on huggingface/optimum-intel#1151 Close #1663 Ticket 161313

github-merge-queue bot pushed a commit to openvinotoolkit/openvino.genai that referenced this pull request Feb 12, 2025

tokenizer: read simplified_chat_template (#1712)

0958c7e

Depends on huggingface/optimum-intel#1151 Close #1663 Ticket 161313

github-merge-queue bot pushed a commit to openvinotoolkit/openvino.genai that referenced this pull request Feb 13, 2025

tokenizer: read simplified_chat_template (#1712)

108267c

Depends on huggingface/optimum-intel#1151 Close #1663 Ticket 161313

ilya-lavrenov pushed a commit to openvinotoolkit/openvino.genai that referenced this pull request Feb 13, 2025

tokenizer: read simplified_chat_template (#1712)

f015175

Depends on huggingface/optimum-intel#1151 Close #1663 Ticket 161313

eaidova force-pushed the ea/chat_template_simplification branch from 56823cd to df7e385 Compare February 13, 2025 08:56

add comparing templated strings

bbc48e4

eaidova force-pushed the ea/chat_template_simplification branch from df7e385 to bbc48e4 Compare February 13, 2025 08:59

fix space for minicpm3 template

5691472

echarlaix approved these changes Feb 13, 2025

View reviewed changes

optimum/exporters/openvino/convert.py Outdated Show resolved Hide resolved

Update optimum/exporters/openvino/convert.py

2e2a385

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

echarlaix merged commit 0ff2bbe into huggingface:main Feb 17, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resolve complicated chat templates during tokenizer saving #1151

resolve complicated chat templates during tokenizer saving #1151

eaidova commented Feb 7, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 7, 2025

AlexKoff88 commented Feb 10, 2025

echarlaix left a comment

eaidova commented Feb 14, 2025

resolve complicated chat templates during tokenizer saving #1151

resolve complicated chat templates during tokenizer saving #1151

Conversation

eaidova commented Feb 7, 2025 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Feb 7, 2025

AlexKoff88 commented Feb 10, 2025

echarlaix left a comment

Choose a reason for hiding this comment

eaidova commented Feb 14, 2025

eaidova commented Feb 7, 2025 •

edited

Loading