Feat: jaqpot-223 #14

johnsaveus · 2024-09-05T12:37:46Z

Torch Graph Inference for Binary Classification

alarv · 2024-09-10T09:31:28Z

src/handlers/predict.py

+    onnx_model = base64.b64decode(request.model["rawModel"])
+    ort_session = onnxruntime.InferenceSession(onnx_model)
+    feat_config = request.extraConfig["torchConfig"]["featurizer"]
+    # Load the featurizer
+    featurizer = SmilesGraphFeaturizer()
+    featurizer.load_json_rep(feat_config)
+    smiles = request.dataset["input"][0]["SMILES"]


This only works for SmilesGraphFeaturizer and smiles single input as an independent feature, but what about the rest of the torch models that users may upload?

The SmilesGraphFeaturizer works for all the torch models at the moment.

Then we should rename it to JaqpotGraphFeaturizer. Also we should remove hardcoding the dataset["input"][0]["SMILES"] as this assumes that the model input is always going to be an array with a single smiles inside, but that won't always be the case :/

alarv · 2024-09-10T10:01:01Z

src/handlers/predict.py

+    # Load the featurizer
+    featurizer = SmilesGraphFeaturizer()
+    featurizer.load_json_rep(feat_config)
+    smiles = request.dataset["input"][0]["SMILES"]


No need to hardcode the smiles here, just upload the input to the model

fixed. Need approval for merge

alarv · 2024-09-10T10:07:59Z

src/handlers/predict.py

+    # Load the featurizer
+    featurizer = SmilesGraphFeaturizer()
+    featurizer.load_json_rep(feat_config)
+    smiles = request.dataset["input"][0]


Suggested change

smiles = request.dataset["input"][0]

def featurize_smiles_array(smiles_array, featurizer):

if not isinstance(smiles_array, list):

raise ValueError("Input must be a list of SMILES strings")

featurized_data = []

for smiles in smiles_array:

try:

features = featurizer.featurize(smiles)

featurized_data.append(features)

except Exception as e:

print(f"Error featurizing SMILES: {smiles}. Error: {str(e)}")

featurized_data.append(None) # or handle the error as appropriate

return featurized_data

# Usage

smiles_array = request.dataset["input"]

data = featurize_smiles_array(smiles_array, featurizer)

so this is as generic as possible and we don't need to change it in the future. Hardcoding [0] of the input means that if we ever upload 2 smiles inputs this won't work and we won't know why till we find this [0] here 😄

alarv

Approving so we can merge and we'll fix the comments on another PR

johnsaveus added 6 commits July 31, 2024 16:27

starting_code_for_graph_inference

2cc295c

no_pickling_for_graph

41c7a45

remove predict_graph

f703a79

minor_fix

df939df

remove serializers

c7d95f3

comment_out

14db7d7

johnsaveus requested a review from vassilismin September 5, 2024 12:37

ruff_format

bccc3b6

vassilismin approved these changes Sep 10, 2024

View reviewed changes

johnsaveus added 2 commits September 10, 2024 12:26

Merge branch 'main' into feat/JAQPOT-223/Inference_for_graphs

8e113a0

Update predict.py

01a99c6

alarv reviewed Sep 10, 2024

View reviewed changes

johnsaveus and others added 2 commits September 10, 2024 12:33

Update predict.py

0aa0a0b

fix: build

d00d2a6

alarv requested changes Sep 10, 2024

View reviewed changes

input fix

ba0bd21

alarv reviewed Sep 10, 2024

View reviewed changes

alarv approved these changes Sep 10, 2024

View reviewed changes

johnsaveus merged commit a960e5e into main Sep 10, 2024
2 checks passed

johnsaveus deleted the feat/JAQPOT-223/Inference_for_graphs branch September 10, 2024 10:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: jaqpot-223 #14

Feat: jaqpot-223 #14

johnsaveus commented Sep 5, 2024

alarv Sep 10, 2024

johnsaveus Sep 10, 2024

alarv Sep 10, 2024

alarv Sep 10, 2024 •

edited

Loading

johnsaveus Sep 10, 2024

alarv Sep 10, 2024

alarv Sep 10, 2024

alarv left a comment

-    smiles = request.dataset["input"][0]
+def featurize_smiles_array(smiles_array, featurizer):
+    if not isinstance(smiles_array, list):
+        raise ValueError("Input must be a list of SMILES strings")
+    featurized_data = []
+    for smiles in smiles_array:
+        try:
+            features = featurizer.featurize(smiles)
+            featurized_data.append(features)
+        except Exception as e:
+            print(f"Error featurizing SMILES: {smiles}. Error: {str(e)}")
+            featurized_data.append(None)  # or handle the error as appropriate
+    return featurized_data
+# Usage
+smiles_array = request.dataset["input"]
+data = featurize_smiles_array(smiles_array, featurizer)

Feat: jaqpot-223 #14

Feat: jaqpot-223 #14

Conversation

johnsaveus commented Sep 5, 2024

alarv Sep 10, 2024

Choose a reason for hiding this comment

johnsaveus Sep 10, 2024

Choose a reason for hiding this comment

alarv Sep 10, 2024

Choose a reason for hiding this comment

alarv Sep 10, 2024 • edited Loading

Choose a reason for hiding this comment

johnsaveus Sep 10, 2024

Choose a reason for hiding this comment

alarv Sep 10, 2024

Choose a reason for hiding this comment

alarv Sep 10, 2024

Choose a reason for hiding this comment

alarv left a comment

Choose a reason for hiding this comment

alarv Sep 10, 2024 •

edited

Loading