You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+5
Original file line number
Diff line number
Diff line change
@@ -239,3 +239,8 @@ Do not forget to install requirements for every example:
239
239
cd <example-folder>
240
240
pip install -r requirements.txt
241
241
```
242
+
243
+
244
+
## Gaudi
245
+
246
+
To train your model on [Intel Gaudi AI Accelerators (HPU)](https://docs.habana.ai/en/latest/index.html), check out [Optimum Habana](https://github.com/huggingface/optimum-habana) which provides a set of tools enabling easy model loading, training and inference on single- and multi-HPU settings for different downstream tasks. After training your model, feel free to submit it to the Intel [leaderboard](https://huggingface.co/spaces/Intel/powered_by_intel_llm_leaderboard) which is designed to evaluate, score, and rank open-source LLMs that have been pre-trained or fine-tuned on Intel Hardwares. Models submitted to the leaderboard will be evaluated on the Intel Developer Cloud. The evaluation platform consists of Gaudi Accelerators and Xeon CPUs running benchmarks from the Eleuther AI Language Model Evaluation Harness.
Models larger than 1 billion parameters are exported to the OpenVINO format with 8-bit weights by default. You can disable it with `--weight-format fp32`.
60
+
61
+
</Tip>
62
+
53
63
Once the model is exported, you can load the OpenVINO model using :
54
64
55
65
```python
@@ -126,7 +136,7 @@ model = OVModelForCausalLM.from_pretrained(model_id, load_in_8bit=True)
126
136
127
137
<Tipwarning={true}>
128
138
129
-
`load_in_8bit`is enabled by default for the models larger than 1 billion parameters. You can disable it with `load_in_8bit=False`.
139
+
If not specified, `load_in_8bit`will be set to `True`by default when models larger than 1 billion parameters are exported to the OpenVINO format (with `export=True`). You can disable it with `load_in_8bit=False`.
`load_in_8bit`is enabled by default for the models larger than 1 billion parameters. You can disable it with `load_in_8bit=False`.
47
+
If not specified, `load_in_8bit`will be set to `True`by default when models larger than 1 billion parameters are exported to the OpenVINO format (with `export=True`). You can disable it with `load_in_8bit=False`.
0 commit comments