@@ -7,23 +7,24 @@ For more information about how to use Huggingface text-generation-inference with
7
7
> [ !TIP]
8
8
> For Gaudi-related documentation, check out [ tgi-gaudi] ( https://github.com/huggingface/tgi-gaudi ) .
9
9
10
- ![ Version: 0.1 .0] ( https://img.shields.io/badge/Version-0.1 .0-informational?style=flat-square ) ![ Type: application] ( https://img.shields.io/badge/Type-application-informational?style=flat-square ) ![ AppVersion: 1.16.0] ( https://img.shields.io/badge/AppVersion-1.16.0-informational?style=flat-square )
10
+ ![ Version: 0.2 .0] ( https://img.shields.io/badge/Version-0.2 .0-informational?style=flat-square ) ![ Type: application] ( https://img.shields.io/badge/Type-application-informational?style=flat-square ) ![ AppVersion: 1.16.0] ( https://img.shields.io/badge/AppVersion-1.16.0-informational?style=flat-square )
11
11
12
12
## Values
13
13
14
14
| Key | Type | Default | Description |
15
15
| -----| ------| ---------| -------------|
16
- | deploy.configMapName | string | ` "intel-proxy- config"` | ConfigMap of Environment Variables |
16
+ | deploy.configMap | object | ` {"enabled":true,"name":"tgi- config"} ` | ConfigMap of Environment Variables |
17
17
| deploy.image | string | ` "ghcr.io/huggingface/text-generation-inference:latest-intel" ` | Intel TGI Image |
18
- | deploy.model | string | ` "HuggingFaceTB/SmolLM-135M" ` | Model to be loaded |
19
- | deploy.quantize | string | ` "" ` | Enable Quantization (ex: bitsandbytes-nf4) |
20
18
| deploy.replicaCount | int | ` 1 ` | Number of pods |
21
19
| deploy.resources | object | ` {"limits":{"cpu":"4000m","gpu.intel.com/i915":1},"requests":{"cpu":"1000m","memory":"1Gi"}} ` | Resource configuration |
22
20
| deploy.resources.limits."gpu.intel.com/i915" | int | ` 1 ` | Intel GPU Device Configuration |
23
21
| fullnameOverride | string | ` "" ` | Full qualified Domain Name |
24
22
| ingress | object | ` {"annotations":{},"className":"","enabled":false,"hosts":[{"host":"chart-example.local","paths":[{"path":"/","pathType":"ImplementationSpecific"}]}],"tls":[]} ` | Ingress configuration |
25
23
| nameOverride | string | ` "" ` | Name of the serving service |
24
+ | pvc.size | string | ` "15Gi" ` | |
25
+ | pvc.storageClassName | string | ` "nil" ` | |
26
26
| secret.encodedToken | string | ` "" ` | Base64 Encoded Huggingface Hub API Token |
27
+ | securityContext | object | ` {} ` | Security Context Configuration |
27
28
| service | object | ` {"port":80,"type":"NodePort"} ` | Service configuration |
28
29
29
30
----------------------------------------------
0 commit comments