[tmva][sofie] New Keras Parser #19692

PrasannaKasar · 2025-08-20T08:18:39Z

This Pull request:

This pull request adds the support for parsing Keras models with SOFIE

Changes or fixes:

Currently, SOFIE's existing Keras parser is written in C++ and is quite old. Although it's written in C++ but its actual parsing logic is written in Python. Additionally, it lacks support for parsing layers such as MaxPool.
This new parser is natively written in Python and uses of pythnoization to access C++ methods from SOFIE. The parser support latest version of Keras (Keras 3) and also has backwards compatibility with earlier versions of Keras 2. It also supports for both types of models i.e. models built using Keras' Functional as well as Sequential API.

Layers supported by the parser:

Checklist:

tested changes locally

guitargeek · 2025-08-20T16:31:47Z

Thank you very much for the PR! It's indeed better to implement this in Python.

Currently, SOFIE's existing Keras parser is written in C++ and is quite old.

While waiting for the review by @lmoneta and @sanjibansg, I have a general question.

What is the plan with the existing Keras parser in C++? If it is superseded by your new parser, can we remove it maybe in this PR as well? Like this we ensure that the users are not confused by two implementations, and also that the total maintenance cost doesn't increase.

sanjibansg · 2025-08-21T07:55:43Z

Thank you very much for the PR! It's indeed better to implement this in Python.

Currently, SOFIE's existing Keras parser is written in C++ and is quite old.

While waiting for the review by @lmoneta and @sanjibansg, I have a general question.

What is the plan with the existing Keras parser in C++? If it is superseded by your new parser, can we remove it maybe in this PR as well? Like this we ensure that the users are not confused by two implementations, and also that the total maintenance cost doesn't increase.

I agree with @guitargeek, we will remove the existing C++ parser for Keras within this PR.

sanjibansg

Did an initial review

sanjibansg · 2025-08-21T07:57:59Z

bindings/pyroot/pythonizations/CMakeLists.txt

@@ -58,7 +58,31 @@ if(tmva)
        ROOT/_pythonization/_tmva/_rtensor.py
        ROOT/_pythonization/_tmva/_tree_inference.py
        ROOT/_pythonization/_tmva/_utils.py
-        ROOT/_pythonization/_tmva/_gnn.py)
+        ROOT/_pythonization/_tmva/_gnn.py
+        ROOT/_pythonization/_tmva/_sofie/_parser/_keras/__init__.py


maybe better to have an additional CMake file for the keras parser files within their directory, so as to not add all of them here.

sanjibansg · 2025-08-21T07:58:21Z

...izations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/generate_keras_functional.py

@@ -0,0 +1,202 @@
+# functional_models.py


do we need this comment here?

sanjibansg · 2025-08-21T07:58:57Z

...izations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/generate_keras_functional.py

+    # # 1. Dropout (to test SOFIE's Identity operator)
+    # inp = layers.Input(shape=(10,))
+    # out = layers.Dropout(0.5)(inp)
+    # model = models.Model(inputs=inp, outputs=out)
+    # train_and_save(model, "Functional_Dropout_test")


maybe we remove the commented out code if this is not used anyway?

sanjibansg · 2025-08-21T07:59:36Z

...izations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/generate_keras_sequential.py

+    # 1. Dropout
+    # model = models.Sequential([
+    #     layers.Input(shape=(10,)),
+    #     layers.Dropout(0.5) # Dropout
+    # ])
+    # train_and_save(model, "Sequential_Dropout_test")
+
+    # 2. Binary Ops: Add, Subtract, Multiply are not typical in Sequential — skipping here
+
+    # 3. Concat (not applicable in Sequential without multi-input)


same comment as earlier for functional model

sanjibansg · 2025-08-21T08:00:45Z

.../pyroot/pythonizations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/layers/conv.py

+        return op
+    else:
+        raise RuntimeError(
+            "TMVA::SOFIE - Unsupported - Operator Gemm does not yet support input type " + fLayerDType


Suggested change

"TMVA::SOFIE - Unsupported - Operator Gemm does not yet support input type " + fLayerDType

"TMVA::SOFIE - Unsupported - Operator Conv does not yet support input type " + fLayerDType

sanjibansg · 2025-08-21T08:01:09Z

...yroot/pythonizations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/layers/binary.py

+          op =  gbl_namespace.TMVA.Experimental.SOFIE.ROperator_BasicBinary(float,'TMVA::Experimental::SOFIE::EBasicBinaryOperator::Mul')(fX1, fX2, fY)
+    else:
+        raise RuntimeError(
+            "TMVA::SOFIE - Unsupported - Operator Identity does not yet support input type " + fLayerDType


Suggested change

"TMVA::SOFIE - Unsupported - Operator Identity does not yet support input type " + fLayerDType

"TMVA::SOFIE - Unsupported - Operator BasicBinary does not yet support input type " + fLayerDType

sanjibansg · 2025-08-21T08:02:31Z

...ot/pythonizations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/layers/batchnorm.py

@@ -0,0 +1,48 @@
+from cppyy import gbl as gbl_namespace


maybe good to check if the datatypes for the input tensors are float since we don't support the other cases?

sanjibansg · 2025-08-21T08:03:07Z

...yroot/pythonizations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/layers/concat.py

+    input = [str(i) for i in finput]
+    output = str(foutput[0])
+    axis = int(attributes["axis"])
+    op =  gbl_namespace.TMVA.Experimental.SOFIE.ROperator_Concat(input, axis, 0,  output)


maybe good to check the datatype for the input tensor?

guitargeek

TensoFlow/Keras is too fragile to import unconditionally. You need to make sure it's not imported unconditionally, otherwise its presence will break several ROOT usecases, as we see in the tests. Always importing keras will also slow down importing ROOT, which is not desired

guitargeek · 2025-08-21T21:35:01Z

bindings/pyroot/pythonizations/python/ROOT/_pythonization/_tmva/_sofie/_parser/_keras/parser.py

@@ -0,0 +1,479 @@
+from ......_pythonization import pythonization
+from cppyy import gbl as gbl_namespace
+import keras


Here for example: instead of importing keras whenever the parsers are imported, you need to import it locally in the functions.

guitargeek · 2025-08-21T21:36:46Z

bindings/pyroot/pythonizations/python/ROOT/_pythonization/_tmva/__init__.py

@@ -44,6 +44,7 @@ def inject_rbatchgenerator(ns):


 from ._gnn import RModel_GNN, RModel_GraphIndependent
+from ._sofie._parser._keras.parser import RModelParser_Keras


But here is the main culprit: this global import is the entry point for all other imports I think.

github-actions · 2025-08-21T21:41:06Z

Test Results

21 files 21 suites 3d 23h 48m 50s ⏱️
3 496 tests 3 106 ✅ 0 💤 390 ❌
71 680 runs 69 584 ✅ 469 💤 1 627 ❌

For more details on these failures, see this check.

Results for commit 93a55ac.

pcanal · 2025-08-21T22:27:41Z

bindings/pyroot/pythonizations/python/ROOT/_pythonization/_tmva/__init__.py

@@ -44,6 +44,7 @@ def inject_rbatchgenerator(ns):


 from ._gnn import RModel_GNN, RModel_GraphIndependent
+from ._sofie._parser._keras.parser import RModelParser_Keras


Suggested change

from ._sofie._parser._keras.parser import RModelParser_Keras

[tmva][sofie] Keras Parser

93a55ac

PrasannaKasar requested review from bellenot, lmoneta, dpiparo and vepadulano as code owners August 20, 2025 08:18

devajithvs assigned lmoneta Aug 20, 2025

guitargeek added in:TMVA new contributor labels Aug 20, 2025

sanjibansg requested changes Aug 21, 2025

View reviewed changes

guitargeek reviewed Aug 21, 2025

View reviewed changes

pcanal reviewed Aug 21, 2025

View reviewed changes

vepadulano removed their request for review August 25, 2025 15:43

	"TMVA::SOFIE - Unsupported - Operator Gemm does not yet support input type " + fLayerDType
	"TMVA::SOFIE - Unsupported - Operator Conv does not yet support input type " + fLayerDType

	"TMVA::SOFIE - Unsupported - Operator Identity does not yet support input type " + fLayerDType
	"TMVA::SOFIE - Unsupported - Operator BasicBinary does not yet support input type " + fLayerDType

		@@ -44,6 +44,7 @@ def inject_rbatchgenerator(ns):


		from ._gnn import RModel_GNN, RModel_GraphIndependent
		from ._sofie._parser._keras.parser import RModelParser_Keras

[tmva][sofie] New Keras Parser #19692

Are you sure you want to change the base?

[tmva][sofie] New Keras Parser #19692

Uh oh!

Conversation

PrasannaKasar commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This Pull request:

Changes or fixes:

Checklist:

Uh oh!

guitargeek commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sanjibansg commented Aug 21, 2025

Uh oh!

sanjibansg left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

guitargeek left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 21, 2025

Test Results

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

PrasannaKasar commented Aug 20, 2025 •

edited

Loading

guitargeek commented Aug 20, 2025 •

edited

Loading