Skip to content

Run Generative AI models using native OpenVINO C++ API

License

Notifications You must be signed in to change notification settings

andrei-kochin/openvino.genai

This branch is 2 commits behind openvinotoolkit/openvino.genai:refs/heads/releases/2024/4.

Folders and files

NameName
Last commit message
Last commit date
Sep 6, 2024
Jun 14, 2024
Sep 4, 2024
Sep 25, 2024
Sep 13, 2024
Sep 16, 2024
Sep 13, 2024
Sep 25, 2024
Mar 22, 2024
Jul 10, 2024
Jun 24, 2024
Sep 24, 2024
Jun 12, 2024
Oct 23, 2023
Aug 2, 2024
Nov 28, 2023
May 9, 2024
Sep 24, 2024
Jul 8, 2024
Jun 7, 2024

Repository files navigation

OpenVINO™ GenAI

The OpenVINO™ GenAI repository consists of the GenAI library and additional GenAI samples.

OpenVINO™ GenAI Library

OpenVINO™ GenAI is a flavor of OpenVINO, aiming to simplify running inference of generative AI models. It hides the complexity of the generation process and minimizes the amount of code required.

For installation and usage instructions, refer to the GenAI Library README.

OpenVINO™ GenAI Samples

The OpenVINO™ GenAI repository contains pipelines that implement image and text generation tasks. The implementation uses OpenVINO capabilities to optimize the pipelines. Each sample covers a family of models and suggests certain modifications to adapt the code to specific needs. It includes the following pipelines:

  1. Benchmarking script for large language models
  2. Text generation samples that support most popular models like LLaMA 2:
  3. Stable Diffuison (with LoRA) C++ image generation pipeline
  4. Latent Consistency Model (with LoRA) C++ image generation pipeline

Requirements

Requirements may vary for different samples. See respective readme files for more details, and make sure to install the OpenVINO version listed there. Refer to documentation to see how to install OpenVINO.

The supported devices are CPU and GPU including Intel discrete GPU.

See also: https://docs.openvino.ai/2023.3/gen_ai_guide.html.

License

The OpenVINO™ GenAI repository is licensed under Apache License Version 2.0. By contributing to the project, you agree to the license and copyright terms therein and release your contribution under these terms.

About

Run Generative AI models using native OpenVINO C++ API

Resources

License

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 64.0%
  • Python 29.4%
  • TypeScript 2.3%
  • MDX 1.4%
  • CMake 1.2%
  • C 1.1%
  • Other 0.6%