You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fn_get_submodels (`Optional[Callable]`, defaults to `None`):
172
161
Experimental usage: Override the default submodels that are used at the export. This is
173
162
especially useful when exporting a custom architecture that needs to split the ONNX (e.g. encoder-decoder). If unspecified with custom models, optimum will try to use the default submodels used for the given task, with no guarantee of success.
174
-
compression_option (`Optional[str]`, defaults to `None`):
175
-
The weight compression option, e.g. `f16` stands for float16 weights, `i8` - INT8 weights, `int4_sym_g128` - INT4 symmetric weights w/ group size 128, `int4_asym_g128` - as previous but asymmetric w/ zero-point,
176
-
`int4_sym_g64` - INT4 symmetric weights w/ group size 64, "int4_asym_g64" - as previous but asymmetric w/ zero-point, `f32` - means no compression.
177
-
compression_ratio (`Optional[float]`, defaults to `None`):
178
-
Compression ratio between primary and backup precision (only relevant to INT4).
179
163
stateful (`bool`, defaults to `True`):
180
164
Produce stateful model where all kv-cache inputs and outputs are hidden in the model and are not exposed as model inputs and outputs. Applicable only for decoder models.
181
165
**kwargs_shapes (`Dict`):
@@ -198,28 +182,6 @@ def main_export(
198
182
raiseValueError("You cannot use both `use_auth_token` and `token` arguments at the same time.")
199
183
token=use_auth_token
200
184
201
-
ifcompression_optionisnotNone:
202
-
logger.warning(
203
-
"The `compression_option` argument is deprecated and will be removed in optimum-intel v1.17.0. "
204
-
"Please, pass an `ov_config` argument instead `OVConfig(..., quantization_config=quantization_config)`."
205
-
)
206
-
207
-
ifcompression_ratioisnotNone:
208
-
logger.warning(
209
-
"The `compression_ratio` argument is deprecated and will be removed in optimum-intel v1.17.0. "
210
-
"Please, pass an `ov_config` argument instead `OVConfig(quantization_config={ratio=compression_ratio})`."
211
-
)
212
-
213
-
ifov_configisNoneandcompression_optionisnotNone:
214
-
from ...intel.openvino.configurationimportOVConfig
0 commit comments