Skip to content

Commit 5037688

Browse files
authored
Update README.md
1 parent 6772655 commit 5037688

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

+1-2
Original file line numberDiff line numberDiff line change
@@ -204,8 +204,7 @@ You can find more examples in the [documentation](https://huggingface.co/docs/op
204204

205205

206206
## IPEX
207-
With `export=True`, IPEX model will replace torch linear to ipex linear which prepacks the weights. It will also apply linear fusion and [IAKV](https://intel.github.io/intel-extension-for-pytorch/cpu/latest/tutorials/llm.html#indirect-access-kv-cache) for generation. Finally, jit.trace will be applied to change the model to graph mode.
208-
Here is the example of how to use IPEX optimized model to generate texts.
207+
To load your IPEX model, you can just replace your `AutoModelForXxx` class with the corresponding `IPEXModelForXxx` class. You can set `export=True` to load a PyTorch checkpoint, export your model via TorchScript and apply IPEX optimizations : both operators optimization (replaced with customized IPEX operators) and graph-level optimization (like operators fusion) will be applied on your model.
209208
```diff
210209
from transformers import AutoTokenizer, pipeline
211210
- from transformers import AutoModelForCausalLM

0 commit comments

Comments
 (0)