Service cleanup #184

romain-intel · 2021-05-28T19:57:46Z

No description provided.

savingoyal · 2021-05-28T20:48:53Z

services/metadata_service/api/admin.py

        tags:
        - Auth
        produces:
-        - text/plain
+        - application/json


Have you verified that this change wouldn't affect the sandbox? We have a similar issue where the heartbeat endpoint returns an actual JSON but is tagged as text/plain which has caused issues in the metaflow codebase before.

I am going to rework things and push a PR to make it so that everything works on application/json. I will also check with Ferras if there was a reason for the string only implementation.

savingoyal · 2021-05-28T20:49:27Z

services/metadata_service/api/artifact.py

@@ -78,10 +77,12 @@ async def get_artifact(self, request):
          required: true
          type: "string"
        produces:
-        - text/plain
+        - application/json


Same as the comment above.

saikonen · 2021-05-31T07:05:09Z

services/metadata_service/api/artifact.py

+        run = await self._db.get_run_ids(flow_name, run_number)
+        task = await self._db.get_task_ids(flow_name, run_number,
+                                            step_name, task_id)
+        if run.response_code != 200 or task.response_code != 200:
+            return DBResponse(400, {"message": "need to register run_id and task_id first"})


A more generic thought and not a blocker for merging the cleanup, but is there a reason why this kind of integrity check is not covered by a foreign key constraint in the table?

also affects the metadata create handler

It is covered I think; the issue is that it used to be silent (ie: the db would raise the error but it could silently make it through.

saikonen · 2021-05-31T07:21:25Z

services/metadata_service/api/utils.py

-            return http_500(str(err))
+            # either use provided traceback from subprocess, or generate trace from current process
+            err_trace = getattr(err, 'traceback_str', None) or get_traceback_str()
+            print(err_trace)


Necessary print or leftover from some testing? If this is for logging purposes, consider using the logger from services.utils instead.

If this is for generic logging over the process, it should probably be set as a logging handler with loop.set_exception_handler(some_error_handling_function) in the server setup phase instead. You can see https://github.com/Netflix/metaflow-service/blob/ui/services/ui_backend_service/ui_server.py#L98 for an example

I will use the logger. I did want to log things to aid in debugging after.

saikonen · 2021-05-31T07:24:20Z

services/utils/__init__.py

-async def read_body(request_content):
-    byte_array = bytearray()
-    while not request_content.at_eof():
-        data = await request_content.read(4)
-        byte_array.extend(data)
-
-    return json.loads(byte_array.decode("utf-8"))
-
-


Were there some issues encountered with using await response.json() or why was this previously necessary? Concerned about some possible edge cases that motivated the previous implementation

I will check with Ferras for the why of the previous implementation but I felt it would be better to use the standard implementation instead of doing our own.

saikonen · 2021-05-31T07:33:28Z

services/metadata_service/api/admin.py

@@ -139,5 +139,5 @@ async def get_authorization_token(self, request):

            return web.Response(status=200, body=json.dumps(credentials))
        except Exception as ex:
-            body = {"err_msg": str(ex), "traceback": get_traceback_str()}
+            body = {"message": str(ex), "traceback": get_traceback_str()}


is this the expected body for the authorization route? the http_500 helper only has a detail field for the error message. Consider keeping it consistent with the helper output if possible?

I am trying to make them all consistent. Still working on it. After I pushed this, I realized I missed af ew things.

saikonen · 2021-05-31T07:38:40Z

services/metadata_service/api/utils.py

+                body = await request.text()
+            except:
+                body = '<no body>'
+            print("Error caused when %s %s with query %s and body %s" %


Could also instead be using the logger here to adhere to configured log levels.

Yep, will change to a logger here and elsewhere. I will try to get the proper one (I couldn't find it initially).

saikonen · 2021-05-31T07:46:08Z

services/metadata_service/api/step.py

-        run_number, run_id = await self._db.get_run_ids(flow_name, run_number)
+        run = await self._db.get_run_ids(flow_name, run_number)
+        if run.response_code != 200:
+            return DBResponse(400, {"message": "need to register run_id first"})


Steps have a foreign key for runs at least in the V3 schema, so consider handling error code 409 specifically for this instead?

Is it necessary to rewrite the error messages on the response handler level, or could this be handled at a higher level in aiopg_exception_handling instead?

I am reworking this and yes, plan to raise an exception all the way through. The issue before though was that it would be silent.

saikonen · 2021-05-31T07:48:05Z

services/metadata_service/api/task.py

+        if run.response_code != 200:
+            return DBResponse(400, {"message": "need to register run_id and task_id first"})


same comment as for steps. Tasks in the V3 schema also have a foreign key covering this, so consider handling the specific error instead.

romain-intel · 2021-05-31T09:02:51Z

Thanks for all the comments. I am going to keep working on this. After I pushed it out, I realized it was still not as clean as I wanted it to be and that I would need something on the Metaflow side to talk properly to an actual REST service.

romain-intel added 3 commits May 28, 2021 11:15

Log more context on error and use .json instead of home-made method

ac16be7

Add pool recycle to pool

90965b5

Various cleanups and better error handling

e12afb2

romain-intel requested a review from ferras May 28, 2021 19:57

romain-intel mentioned this pull request May 28, 2021

Add a 'tags' endpoint to support runtime tagging #185

Closed

savingoyal requested changes May 28, 2021

View reviewed changes

romain-intel added a commit that referenced this pull request May 28, 2021

Make it independent of #184

2127ea0

saikonen reviewed May 31, 2021

View reviewed changes

romain-intel mentioned this pull request Jul 7, 2021

Make sure aiohttp web response body is a binary #199

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Service cleanup #184

Service cleanup #184

romain-intel commented May 28, 2021

savingoyal May 28, 2021

romain-intel May 31, 2021

savingoyal May 28, 2021

saikonen May 31, 2021

romain-intel May 31, 2021

saikonen May 31, 2021 •

edited

Loading

romain-intel May 31, 2021

saikonen May 31, 2021

romain-intel May 31, 2021

saikonen May 31, 2021

romain-intel May 31, 2021

saikonen May 31, 2021 •

edited

Loading

romain-intel May 31, 2021

saikonen May 31, 2021

romain-intel May 31, 2021

saikonen May 31, 2021

romain-intel commented May 31, 2021

		if run.response_code != 200:
		return DBResponse(400, {"message": "need to register run_id and task_id first"})

Service cleanup #184

Are you sure you want to change the base?

Service cleanup #184

Conversation

romain-intel commented May 28, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

saikonen May 31, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

saikonen May 31, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romain-intel commented May 31, 2021

saikonen May 31, 2021 •

edited

Loading

saikonen May 31, 2021 •

edited

Loading