Possible higher level Python client API #5504

rgov · 2023-03-14T21:08:14Z

rgov
Mar 14, 2023

The Python client API is fairly low-level, and requires that the user (i.e, developer invoking the client API) include a lot of boilerplate to invoke the inference remote call. Consider this client example for doing inference on an image, which is over 400 lines long.

While it's valuable to have access to the low level API for building the RPC request objects directly, it would be nice to have a higher level API that handles many simpler cases.

I wrote a wrapper API that shows one way this could be done. Using this API, performing a classification on an image model looks like this, just a few lines:

    triton = tritonclient.grpc.InferenceServerClient('localhost:8001')

    model = Model(triton, 'feline_breed')
    model.input = ImageInput(scaling=ScalingMode.INCEPTION)
    model.output = ClassificationOutput(classes=1)
    
    result = model.infer(Image.open('maeby.jpg'))
    print(result.output[0].score, result.output[0].class_name)

In this example, the ImageInput preprocesses the user's input (which is a PIL Image object), and ClassificationOutput postprocesses Triton's output into an object with attributes like score and class_name. The user doesn't need to cargo cult this code from Nvidia's examples (which at the time of writing is inconsistent across example scripts, by the way).

I don't contend that this is the ideal higher level API, but I think it shows the potential of a simpler way to make inference calls without a lot of boilerplate. The whole implementation is not much more than the example code file that I linked earlier, but is easier to reuse and extend, and nicely separates the user's application logic from the fiddly parts of making inference requests.

This is just a sketch and shouldn't be used. I did add support for multiple inputs and outputs. I didn't implement the HTML API, streaming, or async. I didn't test on many different models.

dyastremsky · 2023-03-15T16:47:56Z

dyastremsky
Mar 15, 2023
Collaborator

CC: @jbkyang-nvi

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible higher level Python client API #5504

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Possible higher level Python client API #5504

rgov Mar 14, 2023

Replies: 1 comment

dyastremsky Mar 15, 2023 Collaborator

rgov
Mar 14, 2023

dyastremsky
Mar 15, 2023
Collaborator