[POC] PyTorch Inlined Extension #29546

slyalin · 2025-03-18T15:53:31Z

It is a seamless way to embed a Python portion of the original PyTorch model into OpenVINO model as a custom operation without the explicit creation of a custom operation class.

This feature combines two core technologies: Python OpenVINO custom ops and PyTorch opaque autograd.Function (the same tool that is used underneath the ModuleExtension).

The main target users are model enablers who are exporting complex models from PyTorch ecosystem to OpenVINO ecosystem. They can hide parts of the original PyTorch model in on-the-fly defined opaque custom operations if there are issues with model tracing/conversion to OpenVINO.

Run the resulting OpenVINO model as-is with vanilla OpenVINO API in the same Python process. Provide an optimized implementation of such operations in C++ OpenVINO custom operations for performance reasons and to be ready to deploy without Python/PyTorch dependency.

Example:

import openvino as ov
import torch

# ov.inlined_extension decorator instructs to auto-generate a custom operation for each `my_func` call.
# Alternatively, use @ov.inlined_extension(MyOp) to substitute `MyOp` Python custom operation class instead of auto-generation.
# In both cases, each call of `my_func` will appear as a single custom op node in the OpenVINO model graph.

@ov.inlined_extension
def my_func(tensor1, tensor2):
    # Arbitrary Python code that is connected with the caller context by means of using torch.Tensor objects in inputs/outputs.
    # Inputs and outputs can consist of tracible (torch.Tensor) and not tracible types.
    # All `list`s, `tuple`s and `dict`s in inputs/outputs arguments are unpacked recursively to extract `torch.Tensor` objects.
    return tensor1 + tensor2, tensor1 * tensor2

class MyModel(torch.nn.Module):
    def __init__(self):
        super(MyModel, self).__init__()
    def forward(self, tensor):
        t1, t2 = my_func(tensor + 1, tensor + 2)  # <-- here the wrapped function is called as a part of PyTorch model
        return t1 / t2

my_model = MyModel()
sample_data = torch.tensor([1.0, 2.0, 3.0])
print(f'Original model inference: {my_model(sample_data)}')

ov_model = ov.convert_model(my_model, example_input=sample_data)  # convert model as usual
core = ov.Core()
compiled = core.compile_model(ov_model, 'CPU')
print(f'Compiled model inference: {compiled(sample_data)}')

Resulting OpenVINO model (custom op is InlinedCustomOp, id attribute is unique for each op instance):

t0 = opset.Parameter({'shape': [-1], 'element_type': 'f32'})  #  -> f32[?]
t1 = opset.Constant(model, 1)                                 #  -> f32[1]([1.0])
t2 = opset.Add([t0, t1], {'auto_broadcast': 'numpy'})         # f32[?], f32[1] -> f32[?]
t3 = opset.Constant(model, 3)                                 #  -> f32[1]([2.0])
t4 = opset.Add([t0, t3], {'auto_broadcast': 'numpy'})         # f32[?], f32[1] -> f32[?]
t5, t6 = opset.InlinedCustomOp([t2, t4], {'id': 6})           # f32[?], f32[?] -> f32[?], f32[?]
t7 = opset.Divide([t5, t6], {'auto_broadcast': 'numpy', 'm_pythondiv': True})  # f32[?], f32[?] -> f32[?]
t8 = opset.Result([t7], {})                                   # f32[?] -> f32[?]

Discovered limitations

Thanks to @eaidova, it was found that when torch.Tensor objects is used inside inlined_extension decorated function and this object is not present among input arguments as traced object (that means it is hidden inside some opaque data structure in one of the arguments, or even not passed to a function as an argument), then it leads to the exception thrown from C++ torch implementation like this one:

  File "/home/developer/.local/lib/python3.10/site-packages/torch/autograd/function.py", line 575, in apply
    return super().apply(*args, **kwargs)  # type: ignore[misc]
RuntimeError: _Map_base::at

…y Python function as a custom PyTorch and OpenVINO operation.

…s of inlined custom ops. Also supported kwargs for original functions.

src/bindings/python/src/openvino/frontend/pytorch/inlined_extension.py

src/frontends/pytorch/src/transforms/tuple_unpack_replacer.cpp

… and with broken TupleUnpack in old functionality.

slyalin added 6 commits March 13, 2025 14:57

WIP: Enhance PyTorch frontend with inlined extension that captures an…

ced1e36

…y Python function as a custom PyTorch and OpenVINO operation.

Merge remote-tracking branch 'origin/master' into inlined_extension

a161186

Introduce SinkHolder to keep nodes that do not return any outputs

2ea382a

Support arbitrary nested tuples, lists and dicts in inputs and output…

35d13e1

…s of inlined custom ops. Also supported kwargs for original functions.

Merge remote-tracking branch 'origin/master' into inlined_extension

07ef8e5

Removed diagnostic prints

Loading
Loading status checks…

1a5cd57

slyalin requested review from mvafin, akuporos, itikhono and eaidova March 18, 2025 15:53

github-actions bot added category: Python API category: tools category: CPP API category: PyTorch FE category: OVC labels Mar 18, 2025

slyalin requested a review from rkazants March 18, 2025 15:56

slyalin commented Mar 26, 2025

View reviewed changes

src/bindings/python/src/openvino/frontend/pytorch/inlined_extension.py Show resolved Hide resolved

slyalin commented Mar 26, 2025

View reviewed changes

src/frontends/pytorch/src/transforms/tuple_unpack_replacer.cpp Outdated Show resolved Hide resolved

Fixed two bugs: with incorrect list packing in the new functionality,…

Loading
Loading status checks…

781d190

… and with broken TupleUnpack in old functionality.

slyalin assigned mvafin Mar 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[POC] PyTorch Inlined Extension #29546

[POC] PyTorch Inlined Extension #29546

slyalin commented Mar 18, 2025 •

edited

Loading

[POC] PyTorch Inlined Extension #29546

Are you sure you want to change the base?

[POC] PyTorch Inlined Extension #29546

Conversation

slyalin commented Mar 18, 2025 • edited Loading

Discovered limitations

slyalin commented Mar 18, 2025 •

edited

Loading