[OpenVINO backend] OpenVINOQuantizer by daniil-lyakhov · Pull Request #15 · ynimmaga/executorch

daniil-lyakhov · 2025-02-12T14:32:26Z

Summary

OpenVINOQuantizer is introduced
aot_openvino_compiler.py is updated with the quantization pipeline (timm and torchvision backends only)
openvino_executor_runner.cpp is updated to take several inputs / produce several outputs in a row. This allows to validate models converted to the edge (inspired by the qualcom example) (code style is refactored by clang-format)
aot_openvino_compiler.py is updated with a validation pipeline (timm and torchvision backends only)

Test plan

Model	FP32 acc	INT8 acc	FP32 avg latency	INT8 avg latency	Batch size
Resnet50d	0.80534	0.8038	538	178.167	125

Co-authored-by: Alexander Suslov <alexander.suslov@intel.com>

ynimmaga · 2025-02-12T22:31:38Z

backends/openvino/quantizer/quantizer.py

+            preset=preset, model_type=model_type, **kwargs
+        )
+
+    def set_ignored_scope(


Is this function be exposed to the user so that they can tune the quantization process? If so, it will be good to mention in the documentation

Agree! But I'm not sure which documentation to update: we don't have a document with describe the OpenVINOQuantizer yet. Should we perhaps create one?

examples/openvino/aot/aot_openvino_compiler.py

examples/openvino/executor_runner/openvino_executor_runner.cpp

Co-authored-by: Yamini Nimmagadda <yamini.nimmagadda@intel.com>

BNNS copy crashes the process when the dtypes differ (pytorch#11714). With the example in this PR (pytorch#11714), we crash the process on main. Here is the stack trace from LLDB: ``` Process 19234 stopped * thread #1, queue = 'com.apple.main-thread', stop reason = signal SIGABRT frame #0: 0x0000000190ac9388 libsystem_kernel.dylib`__pthread_kill + 8 libsystem_kernel.dylib`__pthread_kill: -> 0x190ac9388 <+8>: b.lo 0x190ac93a8 ; <+40> 0x190ac938c <+12>: pacibsp 0x190ac9390 <+16>: stp x29, x30, [sp, #-0x10]! 0x190ac9394 <+20>: mov x29, sp (lldb) bt * thread #1, queue = 'com.apple.main-thread', stop reason = signal SIGABRT * frame #0: 0x0000000190ac9388 libsystem_kernel.dylib`__pthread_kill + 8 frame #1: 0x0000000190b0288c libsystem_pthread.dylib`pthread_kill + 296 frame #2: 0x0000000190a0bc60 libsystem_c.dylib`abort + 124 frame #3: 0x0000000190910174 libsystem_malloc.dylib`malloc_vreport + 892 frame #4: 0x0000000190913c90 libsystem_malloc.dylib`malloc_report + 64 frame #5: 0x000000019091821c libsystem_malloc.dylib`___BUG_IN_CLIENT_OF_LIBMALLOC_POINTER_BEING_FREED_WAS_NOT_ALLOCATED + 32 frame #6: 0x000000019d2f4084 libBNNS.dylib`___lldb_unnamed_symbol1620 + 564 frame #7: 0x000000019d2f5bac libBNNS.dylib`___lldb_unnamed_symbol1628 + 680 frame #8: 0x000000019d69ce48 libBNNS.dylib`BNNSCopy + 616 frame #9: 0x000000030c74d950 _portable_lib.cpython-310-darwin.so`(anonymous namespace)::copy_using_bnns(executorchcoreml::MultiArray const&, executorchcoreml::MultiArray&) + 188 frame #10: 0x000000030c74cfdc _portable_lib.cpython-310-darwin.so`(anonymous namespace)::copy(executorchcoreml::MultiArray const&, executorchcoreml::MultiArray&, executorchcoreml::MultiArray::CopyOptions) + 72 frame #11: 0x000000030c74ceec _portable_lib.cpython-310-darwin.so`executorchcoreml::MultiArray::copy(executorchcoreml::MultiArray&, executorchcoreml::MultiArray::CopyOptions) const + 148 frame #12: 0x000000030c7488d4 _portable_lib.cpython-310-darwin.so`invocation function for block in (anonymous namespace)::copy(MLMultiArray*, executorchcoreml::MultiArray&) + 376 frame #13: 0x000000030c748ac8 _portable_lib.cpython-310-darwin.so`invocation function for block in (anonymous namespace)::copy(MLMultiArray*, executorchcoreml::MultiArray&) + 52 frame #14: 0x000000019ad33f4c CoreML`CoreML::MultiArrayBuffer::getBytesWithHandler(void (void const*, unsigned long) block_pointer) const + 340 frame #15: 0x000000019ad34138 CoreML`-[MLMultiArray(ScopedBufferAccess) getBytesWithHandler:] + 152 frame #16: 0x000000030c7485ec _portable_lib.cpython-310-darwin.so`(anonymous namespace)::copy(MLMultiArray*, executorchcoreml::MultiArray&) + 296 frame #17: 0x000000030c744f68 _portable_lib.cpython-310-darwin.so`(anonymous namespace)::set_outputs(std::__1::vector<executorchcoreml::MultiArray, std::__1::allocator<executorchcoreml::MultiArray>>&, NSArray<MLMultiArray*>*) + 180 ``` With this PR, the process succeeds.

alexsu52 and others added 14 commits February 12, 2025 15:08

added init integration of quantization

5d2784d

deit3_small_patch16_224_in21ft1k

61488d5

Resnet-like model checked

42155a1

WIP

7c66314

Formating

c1fa9e2

openvino_executor_runner.cpp can run on several inputs

e2415af

Validate option / minor

8cbb117

Input shape from the input dataset

4b60fb4

--batch_size

e0cd644

Adapt subset size to keep +- 300 pics for calibration

2a04ee6

Apply suggestions from code review

db7dc13

Co-authored-by: Alexander Suslov <alexander.suslov@intel.com>

Comments

de3f50b

OpenVINOQuantizer: constructor arguments have been refined

17fe62f

set_ignored_scope | readme updates

c7e0758

ynimmaga reviewed Feb 12, 2025

View reviewed changes

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

ynimmaga reviewed Feb 12, 2025

View reviewed changes

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

cavusmustafa reviewed Feb 13, 2025

View reviewed changes

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

examples/openvino/executor_runner/openvino_executor_runner.cpp Outdated Show resolved Hide resolved

daniil-lyakhov and others added 4 commits February 14, 2025 14:10

openvino_executor_runner.cpp: comments

19cbc69

Apply suggestions from code review

0892b9d

Co-authored-by: Yamini Nimmagadda <yamini.nimmagadda@intel.com>

aot_openvino_compiler.py: comments

d1aa425

README

b9b604d

daniil-lyakhov requested review from cavusmustafa and ynimmaga February 14, 2025 17:06

daniil-lyakhov mentioned this pull request Feb 14, 2025

Move files from nncf/common/quantization to nncf/quantization openvinotoolkit/nncf#3280

Closed

cavusmustafa merged commit 9d07bbb into ynimmaga:openvino_backend Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenVINO backend] OpenVINOQuantizer#15

[OpenVINO backend] OpenVINOQuantizer#15
cavusmustafa merged 18 commits intoynimmaga:openvino_backendfrom
daniil-lyakhov:dl/openvino/model_enabling

daniil-lyakhov commented Feb 12, 2025

Uh oh!

ynimmaga Feb 12, 2025

Uh oh!

daniil-lyakhov Feb 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

daniil-lyakhov commented Feb 12, 2025

Summary

Test plan

Uh oh!

ynimmaga Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

daniil-lyakhov Feb 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants