Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NPU] Add compile interface with custom allocator #26748

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

ggladilo
Copy link
Contributor

Details:

New API method allows to compile an OV model with blob storage allocated via given interface. It allows to save extra memory consumption on copying a blob from compiler to the caller (UMD at the moment). With new method blob could be allocated directly at desired location (e.g. cache) and used afterwards without any copy of the data.

Note that compiler does not own blob storage and it is responsibility of the caller to deallocate memory after compilation.

Tickets:

@ggladilo ggladilo added the category: NPU OpenVINO NPU plugin label Sep 23, 2024
@ggladilo ggladilo self-assigned this Sep 23, 2024
@ggladilo ggladilo requested review from a team as code owners September 23, 2024 13:52
New API method allows to compile an OV model with blob storage allocated
via given interface. It allows to save extra memory consumption on
copying a blob from compiler to the caller (UMD at the moment). With new
method blob could be allocated directly at desired location (e.g. cache)
and used afterwards without any copy of the data.

Note that compiler does not own blob storage and it is responsibility
of the caller to deallocate memory after compilation.

Signed-off-by: Gladilov, Gleb <gleb.gladilov@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: NPU OpenVINO NPU plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant