Added weightless cache attributes for intel GPU plugin. #31468

susbhere · 2025-07-25T08:38:44Z

Weightless cache attributes are added when the weights are not coming from bin file. That happens for non-IR inputs like ORT.

https://jira.devtools.intel.com/browse/CVS-167691

susbhere · 2025-07-26T00:51:50Z

Jenkins build passed.
https://openvino-ci.toolbox.iotg.sclab.intel.com/job/private-ci/job/github_trigger/job/openvino/view/change-requests/job/PR-31468/

praasz · 2025-07-28T05:53:18Z

build_jenkins

sshlyapn · 2025-07-29T05:33:05Z

src/plugins/intel_gpu/include/intel_gpu/primitives/data.hpp

+                }
+            } else if (auto ti = ov::as_type<const ov::op::v0::TensorIterator>(node.get())) {
+                auto ti_body = ti->get_body();
+                fill_offset_to_constant_map(ti_body);


Suggested change

fill_offset_to_constant_map(ti_body);

fill_offset_to_constant_map(ti_body, cache_attr);

There is no logic for handling TensorIterator in set_weightless_cache_attributes(), so either its body should be ignored here, or the logic should be added to the corresponding functions

There are unit test failure with suggested changes. 4 tests out of 193 weightless cache related tests are failing. One such test is below one.

[ FAILED ] smoke_CheckWeightlessCacheAccuracy/CheckWeightlessCacheAccuracy.TiWithLstmCell/import_api=compile_model_do_encryption=1_inference_mode=f32_model_dtype=f32, where GetParam() = (4-byte object <02-00 00-00>, true, f32, f32) (273 ms)

@susbhere, did you check the tests with proper TensorIterator handling in set_weightless_cache_attributes() / create_weightless_cache_attributes() functioncs?

@sshlyapn, I have run below command and saw the failures. 4 out of 193 tests failed. Those are not failing with current state of the PR.

ov_gpu_func_tests.exe --gtest_filter=Weightless

@susbhere, my point is that either this TensorIterator-related condition is unnecessary, or TensorIterator should be properly handled during runtime attribute configuration. Currently, only constants are processed there:

for (const auto& node : model->get_ordered_ops()) { if (ov::op::util::is_constant(node)) { // Offset behaves as a unique key for each constant. Size = 1 is used as dummy. cache_attr_map->emplace(node->get_instance_id(), ov::WeightlessCacheAttribute(1, offset++, node->get_element_type())); } }

src/plugins/intel_gpu/include/intel_gpu/runtime/internal_properties.hpp

src/plugins/intel_gpu/include/intel_gpu/runtime/options.inl

sshlyapn · 2025-07-29T06:05:20Z

src/plugins/intel_gpu/src/graph/program.cpp

@@ -1860,14 +1860,20 @@ void program::save(cldnn::BinaryOutputBuffer& ob) const {
    }
 }

-void program::load(cldnn::BinaryInputBuffer& ib, std::shared_ptr<const ov::Model> model_ptr) {
+void program::load(cldnn::BinaryInputBuffer& ib,


[random spot]
Could you please add some functional tests for this caching mechanism?

Will do post merging this PR.

Renaming variables and minor updates as per review comments.

github-actions bot added the category: GPU OpenVINO GPU plugin label Jul 25, 2025

sys-openvino-ci added the ExternalIntelPR External contributor from Intel label Jul 25, 2025

susbhere mentioned this pull request Jul 25, 2025

Weightless caching for onnx models #31192

Closed

susbhere marked this pull request as ready for review July 25, 2025 08:57

susbhere requested review from a team as code owners July 25, 2025 08:57

susbhere force-pushed the gpu_weighless_caching branch 3 times, most recently from f108dbe to c9f955c Compare July 25, 2025 11:02

susbhere force-pushed the gpu_weighless_caching branch from c9f955c to 84e65dd Compare July 28, 2025 03:24

praasz assigned sshlyapn Jul 29, 2025

praasz approved these changes Jul 29, 2025

View reviewed changes

sshlyapn reviewed Jul 29, 2025

View reviewed changes

Added weightless cache attributes for intel GPU plugin.

e4a48d2

susbhere force-pushed the gpu_weighless_caching branch from 84e65dd to e4a48d2 Compare July 29, 2025 10:36

Added weightless cache attributes for intel GPU plugin

e735172

Renaming variables and minor updates as per review comments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added weightless cache attributes for intel GPU plugin. #31468

Added weightless cache attributes for intel GPU plugin. #31468

susbhere commented Jul 25, 2025 •

edited

Loading

Uh oh!

susbhere commented Jul 26, 2025

Uh oh!

praasz commented Jul 28, 2025

Uh oh!

sshlyapn Jul 29, 2025

Uh oh!

susbhere Jul 29, 2025

Uh oh!

sshlyapn Jul 29, 2025

Uh oh!

susbhere Jul 29, 2025 •

edited

Loading

Uh oh!

sshlyapn Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

sshlyapn Jul 29, 2025

Uh oh!

susbhere Jul 29, 2025

Uh oh!

Uh oh!

	fill_offset_to_constant_map(ti_body);
	fill_offset_to_constant_map(ti_body, cache_attr);

Added weightless cache attributes for intel GPU plugin. #31468

Are you sure you want to change the base?

Added weightless cache attributes for intel GPU plugin. #31468

Conversation

susbhere commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

susbhere commented Jul 26, 2025

Uh oh!

praasz commented Jul 28, 2025

Uh oh!

sshlyapn Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

susbhere Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

sshlyapn Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

susbhere Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sshlyapn Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sshlyapn Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

susbhere Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

susbhere commented Jul 25, 2025 •

edited

Loading

susbhere Jul 29, 2025 •

edited

Loading