Skip to content

Files

Latest commit

 

History

History
381 lines (366 loc) · 17.4 KB

SUPPORTED_MODELS.md

File metadata and controls

381 lines (366 loc) · 17.4 KB

OpenVINO™ GenAI: Supported Models

Large language models

Architecture Models Example HuggingFace Models
ChatGLMModel ChatGLM
GemmaForCausalLM Gemma
GPTNeoXForCausalLM Dolly
RedPajama
LlamaForCausalLM Llama 3
Llama 2
OpenLLaMA
TinyLlama
MistralForCausalLM Mistral
Notus
Zephyr
PhiForCausalLM Phi
QWenLMHeadModel Qwen

The pipeline can work with other similar topologies produced by optimum-intel with the same model signature. The model is required to have the following inputs after the conversion:

  1. input_ids contains the tokens.
  2. attention_mask is filled with 1.
  3. beam_idx selects beams.
  4. position_ids (optional) encodes a position of currently generating token in the sequence and a single logits output.

Note

Models should belong to the same family and have the same tokenizers.

Image generation models

Architecture Text 2 image Image 2 image Example HuggingFace Models
Latent Consistency Model Supported Supported
Stable Diffusion Supported Supported
Stable Diffusion XL Supported Supported
Stable Diffusion 3 Supported Not supported
Flux Supported Not supported

Inpainting models

In addition to image generation models, InpaintingPipeline supports specialized inpainting models

Architecture Example HuggingFace Models
Stable Diffusion
Stable Diffusion XL

Visual language models

Architecture Models Example HuggingFace Models
InternVL2 InternVL2
LLaVA LLaVA-v1.5
LLaVA-NeXT LLaVa-v1.6
MiniCPMV MiniCPM-V-2_6

Whisper models

Architecture Models Example HuggingFace Models
WhisperForConditionalGeneration Whisper
Distil-Whisper
Some models may require access request submission on the Hugging Face page to be downloaded.

If https://huggingface.co/ is down, the conversion step won't be able to download the models.