[ML] [PyTorch] Communications with between ES and the native process #1700

davidkyle · 2021-01-28T09:59:28Z

Request ID

The design has to accommodate multiple concurrent inference requests to the server and a mechanism to tie a specific request to a model output is required. This could be inferred from the processing order which is strictly FIFO but adding a ID token to each request provides additional context for development and debugging. The token has no semantics and is purely passed through the C++. Anomaly Detector flushID is the prior art here.

Payload

The inference payload is a series of numeric tokens. An individual inference request will consist of the request ID, the payload tokens and a marker to delineate each request.

Anomaly Detection uses a concise length encoded binary protocol because of the high volume of data sent across pipes. Compared with Anomaly Detection the input is small so a more verbose input format can be used which has the advantage of being descriptive.

Input Format

A JSON document:

{
  “request_id” : “string”,  
  “token_ids” : [int, int,...],
  “attention_mask” : [int, int,...],
  “token_type_ids” : [int, int,...],
  “position_ids” : [int, int,...]
}

token_ids and attention_mask are required for all uses, token_type_ids and position_ids are optional depending on the model type.

Output Format

A JSON document for flexibility containing the request ID token, the result tensor and optionally the predicted tokens depending on the model type:

{
  “request_id” : “string”,  
  “predictions” : [float, float,...],
  “tokens” : [int, int,...]
}

The text was updated successfully, but these errors were encountered:

droberts195 · 2021-01-28T10:07:40Z

Should the output have token_ids rather than tokens? Presumably these are IDs that are looked up against the same mapping table as the input tokens?

droberts195 · 2021-01-28T10:37:39Z

It might be worth saying the input format is ND-JSON rather than arbitrary JSON. Then each input document or command document can be one line of a text file.

We have functionality for parsing a stream of arbitrarily formatted JSON documents separated by \0 characters, but this is not a friendly format for testing at the command line using simple text files. So ND-JSON is probably a better format for the long term.

davidkyle · 2021-04-23T09:51:18Z

Closed by elastic/elasticsearch#70713 and #1770

davidkyle added :ml >feature labels Jan 28, 2021

davidkyle mentioned this issue Jan 28, 2021

[ML] [PyTorch] Create command processor for inference app #1701

Closed

davidkyle added 3rd party models and removed :ml labels Jan 28, 2021

davidkyle mentioned this issue Feb 24, 2021

[ML] PyTorch Command Processor #1770

Merged

davidkyle closed this as completed Apr 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] [PyTorch] Communications with between ES and the native process #1700

[ML] [PyTorch] Communications with between ES and the native process #1700

davidkyle commented Jan 28, 2021

droberts195 commented Jan 28, 2021

droberts195 commented Jan 28, 2021

davidkyle commented Apr 23, 2021

[ML] [PyTorch] Communications with between ES and the native process #1700

[ML] [PyTorch] Communications with between ES and the native process #1700

Comments

davidkyle commented Jan 28, 2021

Request ID

Payload

Input Format

Output Format

droberts195 commented Jan 28, 2021

droberts195 commented Jan 28, 2021

davidkyle commented Apr 23, 2021