Add schema for embed API requests #117

bjester · 2023-11-02T18:08:51Z

Summary

Defines a JSON schema for the structure of recommendations API requests that will be received by TorchServe
Fields level, parent, and has_content will be generated in preprocessing in the TorchServe handler
TODO: unit tests to ensure schema does what we expect

Reference

Co-authored-by: Samson Akol <[email protected]>

bjester · 2023-11-02T18:11:40Z

@jamalex :: @akolson and I worked on a JSON schema to nail down the format of the data sent to TorchServe for topic recommendations. We followed your example in the example pipeline code. Does it look sufficient?

Other questions:

Do we need has_content to represent whether a topic has non-topic resources? Or is that only representative of whether it has subtopics?
What is category and do we need it?

jamalex

I added some clarifying comments in here -- the main thing is that the current schema seems designed to send the entire tree at once, whereas we only need the target topic(s) and their ancestors.

Fields level, parent, and has_content will be generated in preprocessing in the TorchServe handler

Level and has_content aren't important for embedding. The parent/ancestors are needed, though.

spec/recommendations-v1.json

jamalex

Thanks, great changes -- getting closer! I had a few high level points/questions (such as this being the schema for calling an embedding vs recommender endpoint, and needing to support embedding of content resources in addition to topics). And then some smaller notes. Happy to hop on a call to chat through any of my notes that may be confusing! Thanks.

spec/recommendations-v1.json

jamalex · 2024-01-11T17:42:40Z

spec/recommendations-v1.json

@@ -0,0 +1,77 @@
+{
+  "$id": "/schemas/recommendations_request",


Just wondering about naming: is this endpoint for recommendations, or embeddings? (I know there was discussion of having endpoints for making actual recommendations, as well -- but then we'd probably want to add a few additional parameters in addition to what's here, e.g. around desired number of recommended items to return, etc. So it may be best to have this named as an embedding endpoint schema?)

If I understand you correctly, I think "$id": "/schemas/recommendations_request", here is a URI for the schema to refer to elements of the schema from inside the same document or from external JSON documents. However, your comment on endpoint naming still stands and will be put into consideration for the endpoints we intend to implement.

Thus said, this schema is for embedding purposes only. Serve responds with embeddings that we hope to store and make comparisons against later in studio. I will change the URI to embed_request for better clarity.

spec/recommendations-v1.json

akolson · 2024-01-12T13:03:48Z

Hi @jamalex I was able to incorporate the feedback. It was pretty clear so need for the call. We should now be more closer to the final request body 🤞. Please let me know incase there is anything that may require an update. @bjester, a few files have been renamed as noted by Jamie in his comment. Also, any feedback is welcome. Thanks

tests/test_schemas.py

Resolved

js/EmbedRequest.js

jamalex

Looks good to me -- thanks! I left one small comment that may be good to address if you have a chance, but not a blocker.

Add schema for recommendations API requests

4472c12

Co-authored-by: Samson Akol <[email protected]>

bjester requested a review from jamalex November 2, 2023 18:09

jamalex requested changes Nov 9, 2023

View reviewed changes

spec/recommendations-v1.json Outdated Show resolved Hide resolved

spec/recommendations-v1.json Outdated Show resolved Hide resolved

spec/recommendations-v1.json Outdated Show resolved Hide resolved

spec/recommendations-v1.json Outdated Show resolved Hide resolved

akolson added 2 commits January 11, 2024 00:19

flattens schema and updates annotations

87b5cd1

Removes uuid validation from id

2bc8287

akolson requested a review from jamalex January 10, 2024 21:34

Enable building with generate script, add test

f29bcfe

jamalex previously requested changes Jan 11, 2024

View reviewed changes

akolson added 5 commits January 12, 2024 15:25

Adds feedback and fixes failing validation test

eb0a726

Fixes linting noise

30fe07f

Fixes more linting noise

aea321e

updates js version of schema

d43090d

Renames files for clarity

c050428

add __init__.py to validators/

d08adbe

bjester commented Jan 12, 2024

View reviewed changes

tests/test_schemas.py Show resolved Hide resolved

akolson added 2 commits January 12, 2024 18:31

run regenerate schema file

9ae7732

rename test

0317474

akolson requested a review from jamalex January 12, 2024 15:38

bjester marked this pull request as ready for review January 12, 2024 15:53

akolson changed the title ~~Add schema for recommendations API requests~~ Add schema for embed API requests Jan 15, 2024

akolson mentioned this pull request Jan 15, 2024

Implement pre-processing for the embed api learningequality/studio#4399

Closed

jamalex reviewed Jan 15, 2024

View reviewed changes

js/EmbedRequest.js Outdated Show resolved Hide resolved

jamalex approved these changes Jan 15, 2024

View reviewed changes

akolson added 3 commits January 15, 2024 23:48

Remove required fields from metadata

0737764

Remove required fields from metadata

303dafd

Fixes linting errors

54a2e34

rtibbles assigned jamalex Jan 16, 2024

akolson merged commit 006c130 into learningequality:main Jan 17, 2024
10 checks passed

akolson mentioned this pull request Jan 25, 2024

Split the embed_request json schema #119

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add schema for embed API requests #117

Add schema for embed API requests #117

bjester commented Nov 2, 2023 •

edited by akolson

Loading

bjester commented Nov 2, 2023

jamalex left a comment

jamalex left a comment

jamalex Jan 11, 2024

akolson Jan 11, 2024

akolson Jan 12, 2024

akolson commented Jan 12, 2024 •

edited

Loading

jamalex left a comment

Add schema for embed API requests #117

Add schema for embed API requests #117

Conversation

bjester commented Nov 2, 2023 • edited by akolson Loading

Summary

Reference

bjester commented Nov 2, 2023

jamalex left a comment

Choose a reason for hiding this comment

jamalex left a comment

Choose a reason for hiding this comment

jamalex Jan 11, 2024

Choose a reason for hiding this comment

akolson Jan 11, 2024

Choose a reason for hiding this comment

akolson Jan 12, 2024

Choose a reason for hiding this comment

akolson commented Jan 12, 2024 • edited Loading

jamalex left a comment

Choose a reason for hiding this comment

bjester commented Nov 2, 2023 •

edited by akolson

Loading

akolson commented Jan 12, 2024 •

edited

Loading