[PTQ][OV] Sequential models support for Engine without TensorStatistics modifications #17

daniil-lyakhov · 2023-05-16T14:04:34Z

Changes
Sequential dataset with correspondent changes in engine in OV is presented: now user can specify two functions:
** get_tokens_from_sequence_func
** fill_sequential_inputs_fn
to infer sequential a model

TensorReducersSequence is presented

Reason for changes
To allow quantization of sequential models
TensorReducersSequence is needed to reduce statistics of sequential model: first reducer applies for each element in sample, second reducer applies on reduced element of each element
Related tickets
110654

Tests
Not yet

daniil-lyakhov added 4 commits May 15, 2023 15:49

Test with a model

f2bf59b

WIP

4327165

Sequential tensor reducer

7eba198

TensorReducerSequence

12ee720

github-actions bot added experimental NNCF ONNX NNCF OpenVINO labels May 16, 2023

daniil-lyakhov changed the title ~~Dl/ov/engine seq proposal~~ [PTQ][OV] Sequential models support for Engine without TensorStatistics modifications May 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PTQ][OV] Sequential models support for Engine without TensorStatistics modifications #17

[PTQ][OV] Sequential models support for Engine without TensorStatistics modifications #17

daniil-lyakhov commented May 16, 2023 •

edited

Loading

[PTQ][OV] Sequential models support for Engine without TensorStatistics modifications #17

Are you sure you want to change the base?

[PTQ][OV] Sequential models support for Engine without TensorStatistics modifications #17

Conversation

daniil-lyakhov commented May 16, 2023 • edited Loading

daniil-lyakhov commented May 16, 2023 •

edited

Loading